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NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 



1. CROSS REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part application of U.S. Provisional Application 
5 Serial No. 60/339,739 filed December 10, 2001 entitled "Novel Nucleic Acids and Secreted 
Polypeptides", Attorney Docket No. 811; U.S. Application Serial No. 10/128,558 filed April 
22, 2002 entitled "Novel Nucleic Acids and Polypeptides", Attorney Docket No. 812A, 
which in turn claims the benefit of U.S. Provisional Application Serial No. 60/339,453 filed 
December 11, 2001 entitled "Novel Nucleic Acids and Polypeptides", Attorney Docket No. 

10 812; U.S. Provisional Application Serial No. 60/365,384 filed March 14, 2002 entitled 
"Novel Nucleic Acids and Secreted Polypeptides", Attorney Docket No. 814; U.S. 
Provisional Application Serial No. 60/365,091 filed March 14, 2002 entitled "Novel Nucleic 
Acids and Polypeptides", Attorney Docket No. 815; U.S. Provisional Application Serial No. 
60/372,615 filed April 12, 2002 entitled "Novel Nucleic Acids and Secreted Polypeptides", 

15 Attorney Docket No. 817; U.S. Provisional Application Serial No. 60/376,045 filed April 24, 
2002 entitled "Novel Nucleic Acids and Secreted Polypeptides", Attorney Docket No. 
817CIP; U.S. Provisional Application Serial No. 60/372,381 filed April 12, 2002 entitled 
"Novel Nucleic Acids and Polypeptides", Attorney Docket No. 818; PCT Application Serial 
No. PCT/US00/35017 filed December 22, 2000 entitled "Novel Contigs Obtained from 

20 Various Libraries", Attorney Docket No. 784CIP3A/PCT, which in turn is a continuation-in- 
part application of U.S. Application Serial No. 09/552,317 filed April 25, 2000 entitled 
"Novel Contigs Obtained from Various Libraries", Attorney Docket No. 784CIP, which in 
turn is a continuation-in-part application of U.S. Application Serial No. 09/488,725 filed 
January 21, 2000 entitled "Novel Contigs Obtained from Various Libraries", Attorney 

25 Docket No. 784; PCT Application Serial No. PCT/US0 1/02623 filed January 25, 2001 
entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket No. 
785CIP3/PCT, which in turn is a continuation-in-part application of U.S. Application Serial 
No. 09/491,404 filed January 25, 2000 entitled "Novel Contigs Obtained from Various 
Libraries", Attorney Docket No. 785; PCT Application Serial No. PCT/US0 1/03 800 filed 

30 February 5, 2001 entitled "Novel Contigs Obtained from Various Libraries", Attorney 
Docket No. 787CIP3/PCT, which in turn is a continuation-in-part application of U.S. 
Application Serial No. 09/560,875 filed April 27, 2000 entitled "Novel Contigs Obtained 
from Various Libraries", Attorney Docket No. 787CIP, which in turn is a continuation-in- 
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part application of U.S. Application Serial No. 09/496,914 filed February 03, 2000 entitled 
"Novel Contigs Obtained from Various Libraries", Attorney Docket No. 787; PCT 
Application Serial No. PCT/US0 1/04927 filed February 26, 2001 entitled "Novel Contigs 
Obtained from Various Libraries", Attorney Docket No. 788CIP3/PCT, which in turn is a 
5 continuation-in-part application of U.S. Application Serial No. 09/577,409 filed May 1 8, 
2000 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket No. 
788CIP, which in turn is a continuation-in-part application of U.S. Application Serial No. 
09/515,126 filed February 28, 2000 entitled "Novel Contigs Obtained from Various 
Libraries", Attorney Docket No. 788; PCT Application Serial No. PCTVUS0 1/04941 filed 

10 March 5, 2001 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket 
No. 789CIP3/PCT, which in turn is a continuation-in-part application of U.S. Application 
Serial No. 09/574,454 filed May 19, 2000 entitled "Novel Contigs Obtained from Various 
Libraries", Attorney Docket No. 789CIP, which in turn is a continuation-in-part application 
of U.S. Application Serial No. 09/519,705 filed March 07, 2000 entitled "Novel Contigs 

1 5 Obtained from Various Libraries", Attorney Docket No. 789; PCT Application Serial No. 
PCT/US01/0863 1 filed March 30, 2001 entitled "Novel Contigs Obtained from Various 
Libraries", Attorney Docket No. 790CIP3/PCT, which in turn is a continuation-in-part 
application of U.S. Application Serial No. 09/649,167 filed August 23, 2000 entitled "Novel 
Contigs Obtained from Various Libraries", Attorney Docket No. 790CIP, which in turn is a 

20 continuation-in-part application of U.S. Application Serial No. 09/540,217 filed March 31, 

2000 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket No. 790; 
PCT Application Serial No. PCT/US01/08656 filed April 18, 2001 entitled "Novel Contigs 
Obtained from Various Libraries", Attorney Docket No. 791CIP3/PCT, which in turn is a 
continuation-in-part application of U.S. Application Serial No. 09/770,160 filed January 26, 

25 2001 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket No. 

791CIP, which is in turn a continuation-in-part application of U.S. Application Serial No. 
09/552,929 filed April 18, 2000 entitled "Novel Contigs Obtained from Various Libraries", 
Attorney Docket No. 791; and PCT Application Serial No. PCT/US0 1/14827 filed May 16, 

2001 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket No. 

30 792CIP3/PCT, which in turn is a continuation-in-part application of U.S. Application Serial 
No. 09/577,408 filed May 18, 2000 entitled "Novel Contigs Obtained from Various 
Libraries", Attorney Docket No. 792; all of which are incorporated herein by reference in 
their entirety, specifically including, but not limited to , incorporation by reference of the 
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tables in each application displaying sequence information, ematrix signatures, pfam 
signatures, signal peptide information, transmembrane domain information, chromosomal 
localization and tissue distribution information, and/or 3-dimensional structural information. 

5 2. BACKGROUND OF THE INVENTION 

2.1 TECHNICAL FIELD 

The present invention provides novel polynucleotides and proteins encoded by such 
polynucleotides, along with uses for these polynucleotides and proteins, for example in 
therapeutic, diagnostic and research methods. 

10 

2.2 BACKGROUND 

Technology aimed at the discovery of protein factors (including e.g., cytokines, such 
as lymphokines, interferons, circulating soluble factors, chemokines, and interleukins) has 
matured rapidly over the past decade. The now routine hybridization cloning and expression 

1 5 cloning techniques clone novel polynucleotides "directly" in the sense that they rely on 

information directly related to the discovered protein (i.e., partial DNA/amino acid sequence 
of the protein in the case of hybridization cloning; activity of the protein in the case of 
expression cloning). More recent "indirect" cloning techniques such as signal sequence 
cloning, which isolates DNA sequences based on the presence of a now well-recognized 

20 secretory leader sequence motif, as well as various PCR-based or low stringency 

hybridization-based cloning techniques, have advanced the state of the art by making 
available large numbers of DNA/amino acid sequences for proteins that are known to have 
biological activity, for example, by virtue of their secreted nature in the case of leader 
sequence cloning, by virtue of their cell or tissue source in the case of PCR-based 

25 techniques, or by virtue of structural similarity to other genes of known biological activity. 

Identified polynucleotide and polypeptide sequences have numerous applications in, 
for example, diagnostics, forensics, gene mapping; identification of mutations responsible 
for genetic disorders or other traits, to assess biodiversity, and to produce many other types 
of data and products dependent on DNA and amino acid sequences. 



30 



3. SUMMARY OF THE INVENTION 

The compositions of the present invention include novel isolated polypeptides, novel 
isolated polynucleotides encoding such polypeptides, including recombinant DNA molecules. 
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cloned genes or degenerate variants thereof, especially naturally occurring variants such as 
allelic variants, antisense polynucleotide molecules, and antibodies that specifically recognize 
one or more epitopes present on such polypeptides, as well as hybridomas producing such 
antibodies. 

5 The compositions of the present invention additionally include vectors, including 

expression vectors, containing the polynucleotides of the invention, cells genetically engineered 
to contain such polynucleotides and cells genetically engineered to express such 
polynucleotides. 

The present invention relates to a collection or library of at least one novel nucleic acid 

1 0 sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by 
hybridization (SBH), and in some cases, sequences obtained from one or more public 
databases. The invention relates also to the proteins encoded by such polynucleotides, along 
with therapeutic, diagnostic and research utilities for these polynucleotides and proteins. These 
nucleic acid sequences are designated as SEQ ID NO: 1-911, or 1 823-2478 and are provided in 

1 5 the Sequence Listing. In the nucleic acids provided in the Sequence Listing, A is adenine; C is 
cytosine; G is guanine; T is thymine; and N is any of the four bases or unknown. In the amino 
acids provided in the Sequence Listing, * corresponds to the. stop codon. 

The nucleic acid sequences of the present invention also include, nucleic acid sequences 
that hybridize to the complement of SEQ ID NO: 1-91 1, or 1823-2478 under stringent 

20 hybridization conditions; nucleic acid sequences which are allelic variants or species 

homologues of any of the nucleic acid sequences recited above, or nucleic acid sequences that 
encode a peptide comprising a specific domain or truncation of the peptides encoded by SEQ 
ID NO: 1-911, or 1823-2478. A polynucleotide comprising a nucleotide sequence having at 
least 90% identity to an identifying sequence of SEQ ID NO: 1-911, or 1823-2478 or a 

25 degenerate variant or fragment thereof. The identifying sequence can be 1 00 base pairs in 
length. 

The nucleic acid sequences of the present invention also include the sequence 
information from the nucleic acid sequences of SEQ ID NO: 1-911, or 1823-2478. The 
sequence information can be a segment of any one of SEQ ID NO: 1-911, or 1823-2478 that 
30 uniquely identifies or represents the sequence information of SEQ ID NO: 1-91 1, or 1 823-2478. 

A collection as used in this application can be a collection of only one polynucleotide. 
The collection of sequence information or identifying information of each sequence can be 
provided on a nucleic acid array. In one embodiment, segments of sequence information are 
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provided on a nucleic acid array to detect the polynucleotide that contains the segment. The 
array can be designed to detect full-match or mismatch to the polynucleotide that contains the 
segment The collection can also be provided in a computer-readable format. 

This invention also includes the reverse or direct complement of any of the nucleic acid 
sequences recited above; cloning or expression vectors containing the nucleic acid sequences; 
and host cells or organisms transformed with these expression vectors. Nucleic acid sequences 
(or their reverse or direct complements) according to the invention have numerous applications 
in a variety of techniques known to those skilled in the art of molecular biology, such as use as 
hybridization probes, use as primers for PGR, use in an array, use in computer-readable media, 
use in sequencing full-length genes, use for chromosome and gene mapping, use in the 
recombinant production of protein, and use in the generation of anti-sense DNA or RNA, their 

chemical analogs and the like. 

In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1-91 1, or 1823- 
2478 or novel segments or parts of the nucleic acids of the invention are used as primers in 
expression assays that are well known in the art. In a particularly preferred embodiment, the 
nucleic acid sequences of SEQ ID NO: 1-911, or 1823-2478 or novel segments or parts of the 
nucleic acids provided herein are used in diagnostics for identifying expressed genes or, as well 
known in the art and exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed 
sequence tags for physical mapping of the human genome. 

The isolated polynucleotides of the invention include, but are not limited to, a 
polynucleotide comprising any one of the nucleotide sequences set forth in SEQ ID NO: 1-911, 
or 1823-2478; a polynucleotide comprising any of the full length protein coding sequences of 
SEQ ID NO: 1-91 1, or 1823-2478; and a polynucleotide comprising any of the nucleotide 
sequences of the mature protein coding sequences of SEQ ID NO: 1-911, or 1823-2478. The 
polynucleotides of the present invention also include, but are not limited to, a polynucleotide 
that hybridizes under stringent hybridization conditions to (a) the complement of any one of the 
nucleotide sequences set forth in SEQ ID NO: 1-9 1 1 , or 1 823-2478; (b) a nucleotide sequence 
encoding any one of the amino acid sequences set forth in SEQ ID NO: 1-911, or 1823-2478; 
(c) a polynucleotide which is an allelic variant of any polynucleotides recited above; (d) a 
polynucleotide which encodes a species homologue (e.g. orthologs) of any of the proteins 
recited above; or (e) a polynucleotide that encodes a polypeptide comprising a specific domain 
or truncation of any of the polypeptides comprising an amino acid sequence set forth in SEQ ID 
NO: 912-1822, or 2479-3134, or Tables 3 A, 3B, 5, or 6. 
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The isolated polypeptides of the invention include, but are not limited to, a polypeptide 
comprising any of the amino acid sequences set forth in the Sequence Listing; or the 
corresponding full length or mature protein. Polypeptides of the invention (SEQ ID NO: 912- 
1 822, or 2479-3 134) also include polypeptides with biological activity that are encoded by (a) 
5 any of the polynucleotides having a nucleotide sequence set forth in SEQ ID NO: 1 -91 1 , or 
1823-2478; or (b) polynucleotides that hybridize to the complement of the polynucleotides of 
(a) under stringent hybridization conditions. Biologically active variants of any of the 
polypeptide sequences in the Sequence Listing, and "substantial equivalents" thereof (e.g., with 
at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% amino acid sequence 
10 identity) that preferably retain biological activity are also contemplated. The polypeptides of the 
invention may be wholly or partially chemically synthesized but are preferably produced by 
recombinant means using the genetically engineered cells (e.g. host cells) of the invention. 

The invention also provides compositions comprising a polypeptide of the invention. 
Polypeptide compositions of the invention may further comprise an acceptable carrier, such 
15 as a hydrophilic, e.g., pharmaceutically acceptable, carrier. 

The invention also provides host cells transformed or transfected with a 
polynucleotide of the invention. 

The invention also relates to methods for producing a polypeptide of the invention 
comprising growing a culture of the host cells of the invention in a suitable culture medium 
20 under conditions permitting expression of the desired polypeptide, and purifying the 

polypeptide from the culture or from the host cells. Preferred embodiments include those in 
which the protein produced by such processes is a mature form of the protein. 

Polynucleotides according to the invention have numerous applications in a variety 
of techniques known to those skilled in the art of molecular biology. These techniques 
25 include use as hybridization probes, use as oligomers, or primers, for PCR, use for 

chromosome and gene mapping, use in the recombinant production of protein, and use in 
generation of anti-sense DNA or ENA, their chemical analogs and the like. For example, 
when the expression of an mRNA is largely restricted to a particular cell or tissue type, 
polynucleotides of the invention can be used as hybridization probes to detect the presence 
30 of the particular cell or tissue mRNA in a sample using, e.g., in situ hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 
expressed sequence tags for identifying expressed genes or, as well known in the art and 
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exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed sequence tags for 
physical mapping of the human genome. 

The polypeptides according to the invention can be used in a variety of conventional 
procedures and methods that are currently applied to other proteins. For example, a 
5 polypeptide of the invention can be used to generate an antibody that specifically binds the 
polypeptide. Such antibodies, particularly monoclonal antibodies, are useful for detecting or 
quantitating the polypeptide in tissue. The polypeptides of the invention can also be used as 
molecular weight markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical 

10 condition which comprises the step of administering to a mammalian subject a 

therapeutically effective amount of a composition comprising a polypeptide of the present 
invention and a pharmaceutically acceptable carrier. 

In particular, the polypeptides and polynucleotides of the invention can be utilized, 
for example, in methods for the prevention and/or treatment of disorders involving aberrant 

15 protein expression or biological activity. 

The present invention further relates to methods for detecting the presence of the 
polynucleotides or polypeptides of the invention in a sample. Such methods can, for 
example, be utilized as part of prognostic and diagnostic evaluation of disorders as recited 
herein and for the identification of subjects exhibiting a predisposition to such conditions. 

20 The invention provides a method for detecting the polynucleotides of the invention in a 
sample, comprising contacting the sample with a compound that binds to and forms a 
complex with the polynucleotide of interest for a period sufficient to form the complex and 
under conditions sufficient to form a complex and detecting the complex such that if a 
complex is detected, the polynucleotide of interest is detected. The invention also provides a 

25 method for detecting the polypeptides of the invention in a sample comprising contacting the 
sample with a compound that binds to and forms a complex with the polypeptide under 
conditions and for a period sufficient to form the complex and detecting the formation of the 
complex such that if a complex is formed, the polypeptide is detected. 

The invention also provides kits comprising polynucleotide probes and/or 

30 monoclonal antibodies, and optionally quantitative standards, for carrying out methods of the 
invention. Furthermore, the invention provides methods for evaluating the efficacy of drugs, 
and monitoring the progress of patients, involved in clinical trials for the treatment of 
disorders as recited above. 
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The invention also provides methods for the identification of compounds that 
modulate (i.e., increase or decrease) the expression or activity of the polynucleotides and/or 
polypeptides of the invention. Such methods can be utilized, for example, for the 
identification of compounds that can ameliorate symptoms of disorders as recited herein. 
5 Such methods can include, but are not limited to, assays for identifying compounds and 
other substances that interact with (e.g., bind to) the polypeptides of the invention. The 
invention provides a method for identifying a compound that binds to the polypeptides of the 
invention comprising contacting the compound with a polypeptide of the invention in a cell 
for a time sufficient to form a polypeptide/compound complex, wherein the complex drives 

1 0 expression of a reporter gene sequence in the cell; and detecting the complex by detecting 

the reporter gene sequence expression such that if expression of the reporter gene is detected 
the compound that binds to a polypeptide of the invention is identified. 

The methods of the invention also provide methods for treatment which involve the 
administration of the polynucleotides or polypeptides of the invention to individuals 

15 exhibiting symptoms or tendencies. In addition, the invention encompasses methods for 
treating diseases or disorders as recited herein comprising administering compounds and 
other substances that modulate the overall activity of the target gene products. Compounds 
and other substances can affect such modulation either on the level of target gene/protein 
expression or target protein activity. 

20 The polypeptides of the present invention (e.g. SEQ ID NO: 912-1822, or 2479- 

3134) and the polynucleotides encoding them (e.g. SEQ ID NO: 1-911, or 1823-2478) are 
also useful for the same functions known to one of skill in the art as the polypeptides and 
polynucleotides to which they have homology (set forth in Tables 2A and 2B); for which 
they have a signature region (as set forth in Tables 3 A and 3B); or for which they have 

25 homology to a gene family (as set forth in Tables 4 A and 4B). If no homology is set forth 
for a sequence, then the polypeptides and polynucleotides of the present invention are useful 
for a variety of applications, as described herein, including use in arrays for detection. 

4. DETAILED DESCRIPTION OF THE INVENTION 
30 4.1 DEFINITIONS 

It must be noted that as used herein and in the appended claims, the singular forms 
"a", "an" and "the" include plural references unless the context clearly dictates otherwise. 
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The term "active" refers to those forms of the polypeptide which retain the biologic 
and/or immunologic activities of any naturally occurring polypeptide. According to the 
invention, the terms "biologically active" or "biological activity" refer to a protein or peptide 
having structural, regulatory or biochemical functions of a naturally occurring molecule. 
5 Likewise "immunologically active" or "immunological activity" refers to the capability of 
the natural, recombinant or synthetic polypeptide to induce a specific immune response in 
appropriate animals or cells and to bind with specific antibodies. 

The term "activated cells" as used in this application are those cells which are 
engaged in extracellular or intracellular membrane trafficking, including the export of 
10 secretory or enzymatic molecules as part of a normal or disease process. 

The terms "complementary" or "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence 5 5 -AGT-3 ' binds to the 
complementary sequence 3'-TCA-5\ Complementarity between two single-stranded 
molecules may be "partial" such that only certain portion(s) of the nucleic acids bind or it 
15 may be "complete" such that total complementarity exists between the single stranded 

molecules. The degree of complementarity between the nucleic acid strands has significant 
effects on the efficiency and strength of the hybridization between the nucleic acid strands. 

The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 
differentiated cell types in an embryo or an adult, including the germ cells. The term "germ 
20 line stem cells (GSCs)" refers to stem cells derived from primordial stem cells that provide a 
steady and continuous source of germ cells for the production of gametes. The term 
"primordial germ cells (PGCs)" refers to a small population of cells set aside from other cell 
lineages particularly from the yolk sac, mesenteries, or gonadal ridges during embryogenesis 
that have the potential to differentiate into germ cells and other cells. PGCs are the source 
25 from which GSCs and ES cells are derived. The PGCs, the GSCs and the ES cells are 

capable of self-renewal. Thus these cells not only populate the germ line and give rise to a 
plurality of terminally differentiated cells that comprise the adult specialized organs, but are 
able to regenerate themselves. 

The term "expression modulating fragment," EMF, means a series of nucleotides 
30 which modulates the expression of an operably linked ORF or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably linked 
sequence" when the expression of the sequence is altered by the presence of the EMF. 
EMFs include, but are not limited to, promoters, and promoter modulating sequences 
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(inducible elements). One class of EMFs are nucleic acid fragments which induce the 
expression of an operably linked ORF in response to a specific regulatory factor or 

physiological event. 

The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 
5 "oligonucleotide" are used interchangeably and refer to a heteropolymer of nucleotides or 
the sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the 
sense or the antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or RNA-like 
material. In the sequences herein A is adenine, C is cytosine, T is thymine, G is guanine and 

10 N is A, C, G, or T (U) or unknown. It is contemplated that where the polynucleotide is 
RNA, the T (thymine) in the sequences provided herein is substituted with U (uracil). 
Generally, nucleic acid segments provided by this invention may be assembled from 
fragments of the genome and short oligonucleotide linkers, or from a series of 
oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic acid which is 

1 5 capable of being expressed in a recombinant transcriptional unit comprising regulatory 
elements derived from a microbial or viral operon, or a eukaryotic gene. 

The terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," or 
"segment" or "probe" or "primer" are used interchangeably and refer to a sequence of 
nucleotide residues which are at least about 5 nucleotides, more preferably at least about 7 

20 nucleotides, more preferably at least about 9 nucleotides, more preferably at least about 1 1 
nucleotides and most preferably at least about 17 nucleotides. The fragment is preferably 
less than about 500 nucleotides, preferably less than about 200 nucleotides, more preferably 
less than about 100 nucleotides, more preferably less than about 50 nucleotides and most 
preferably less than 30 nucleotides. Preferably the probe is from about 6 nucleotides to 

25 about 200 nucleotides, preferably from about 1 5 to about 50 nucleotides, more preferably 
from about 17 to 30 nucleotides and most preferably from about 20 to 25 nucleotides. 
Preferably the fragments can be used in polymerase chain reaction (PCR), various 
hybridization procedures or microarray procedures to identify or amplify identical or related 
parts of mRNA or DNA molecules. A fragment or segment may uniquely identify each 

30 polynucleotide sequence of the present invention. Preferably the fragment comprises a 
sequence substantially similar to any one of SEQ ID NO: 1-911, or 1823-2478. 

Probes may, for example, be used to determine whether specific mRNA molecules 
are present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal 
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DNA as described by Walsh et al. (Walsh, P.S. et al., 1992, PCR Methods Appl 1 :241-250). 
They may be labeled by nick translation, Klenow fill-in reaction, PCR, or other methods 
well known in the art. Probes of the present invention, their preparation and/or labeling are 
elaborated in Sambrook, J. et al., 1989, Molecular Cloning: A Laboratory Manual, Cold 
5 Spring Harbor Laboratory, NY; or Ausubel, F.M. et al., 1989, Current Protocols in 

Molecular Biology, John Wiley & Sons, New York NY, both of which are incorporated 
herein by reference in their entirety. 

The nucleic acid sequences of the present invention also include the sequence 
information from the nucleic acid sequences of SEQ ID NO: 1-91 1, or 1823-2478. The 

10 sequence information can be a segment of any one of SEQ ID NO: 1-91 1, or 1823-2478 that 
uniquely identifies or represents the sequence information of that sequence of SEQ ID NO: 
1-91 1, or 1823-2478, or those segments identified in Tables 3A, 3B, 5, or 6. One such 
segment can be a twenty-mer nucleic acid sequence because the probability that a twenty- 
mer is fully matched in the human genome is 1 in 300. In the human genome, there are three 

15 billion base pairs in one set of chromosomes. Because 4 20 possible twenty-mers exist, there 
are 300 times more twenty-mers than there are base pairs in a set of human chromosomes. 
LTsing the same analysis, the probability for a seventeen-mer to be fully matched in the 
human genome is approximately 1 in 5. When these segments are used in arrays for 
expression studies, fifteen-mer segments can be used. The probability that the fifteen-mer is 

20 fully matched in the expressed sequences is also approximately one in five because 

expressed sequences comprise less than approximately 5% of the entire genome sequence. 

Similarly, when using sequence information for detecting a single mismatch, a segment 
can be a twenty-five mer. The probability that the twenty-five mer would appear in a human 
genome with a single mismatch is calculated by multiplying the probability for a full match 

25 (1h-4 25 ) times the increased probability for mismatch at each nucleotide position (3 x 25). The 
probability that an eighteen mer with a single mismatch can be detected in an array for 
expression studies is approximately one in five. The probability that a twenty-mer with a single 
mismatch can be detected in a human genome is approximately one in five. 

The term "open reading frame," ORF, means a series of nucleotide triplets coding for 

30 amino acids without any termination codons and is a sequence translatable into protein. 

The terms "operably linked" or "operably associated" refer to functionally related 
nucleic acid sequences. For example, a promoter is operably associated or operably linked 
with a coding sequence if the promoter controls the transcription of the coding sequence. 
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While operably linked nucleic acid sequences can be contiguous and in the same reading 
frame, certain genetic elements e.g. repressor genes are not contiguously linked to the coding 
sequence but still control transcription/translation of the coding sequence. 

The term "pluripotent" refers to the capability of a cell to differentiate into a number 
5 of differentiated cell types that are present in an adult organism. A pluripotent cell is 
restricted in its differentiation capability in comparison to a totipotent cell. 

The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an 
oligopeptide, peptide, polypeptide or protein sequence or fragment thereof and to naturally 
occurring or synthetic molecules. A polypeptide "fragment," "portion," or "segment" is a 

10 stretch of amino acid residues of at least about 5 amino acids, preferably at least about 7 

amino acids, more preferably at least about 9 amino acids and most preferably at least about 
17 or more amino acids. The peptide preferably is not greater than about 200 amino acids, 
more preferably less than 150 amino acids and most preferably less than 100 amino acids. 
Preferably the peptide is from about 5 to about 200 amino acids. To be active, any 

15 polypeptide must have sufficient length to display biological and/or immunological activity. 

The term "naturally occurring polypeptide" refers to polypeptides produced by cells 
that have not been genetically engineered and specifically contemplates various polypeptides 
arising from post-translational modifications of the polypeptide including, but not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. 

20 The term "translated protein coding portion" means a sequence which encodes for the 

full-length protein which may include any leader sequence or any processing sequence. 

The term "mature protein coding sequence" means a sequence which encodes a 
peptide or protein without a signal or leader sequence. The "mature protein portion" means 
that portion of the protein which does not include a signal or leader sequence. The peptide 

25 may have been produced by processing in the cell which removes any leader/signal 

sequence. The mature protein portion may or may not include the initial methionine residue. 
The methionine residue may be removed from the protein during processing in the cell. The 
peptide may be produced synthetically or the protein may have been produced using a 
polynucleotide only encoding for the mature protein coding sequence. 

30 The term "derivative" refers to polypeptides chemically modified by such techniques 

as ubiquitination, labeling (e.g., with radionuclides or various en2ymes), covalent polymer 
attachment such as pegylation (derivatization with polyethylene glycol) and insertion or 



WO 03/054152 PCT/US02/39555 

13 

substitution by chemical synthesis of amino acids such as ornithine, which do not normally 

occur in human proteins. 

The term "variant"(or "analog") refers to any polypeptide differing from naturally 
occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, 
5 e g., recombinant DNA techniques. Guidance in determining which amino acid residues 
may be replaced, added or deleted without abolishing activities of interest, may be found by 
comparing the sequence of the particular polypeptide with that of homologous peptides and 
minimizing the number of amino acid sequence changes made in regions of high homology 
(conserved regions) or by replacing amino acids with consensus sequence. 
10 Alternatively, recombinant variants encoding these same or similar polypeptides may 

be synthesized or selected by making use of the "redundancy" in the genetic code. Various 
codon substitutions, such as the silent changes which produce various restriction sites, may 
be introduced to optimize cloning into a plasmid or viral vector or expression in a particular 
prokaryotic or eukaryotic system. Mutations in the polynucleotide sequence may be 
1 5 reflected in the polypeptide or domains of other peptides added to the polypeptide to modify 
the properties of any part of the polypeptide, to change characteristics such as ligand-binding 
affinities, interchain affinities, or degradation/turnover rate. 

Preferably, amino acid "substitutions" are the result of replacing one amino acid with 
another amino acid having similar structural and/or chemical properties, i.e., conservative 
20 amino acid replacements. "Conservative" amino acid substitutions may be made on the 

basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the 
amphipathic nature of the residues involved. For example, nonpolar (hydrophobic) amino 
acids include alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and 
methionine; polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, 
25 asparagine, and glutamine; positively charged (basic) amino acids include arginine, lysine, 
and histidine; and negatively charged (acidic) amino acids include aspartic acid and glutamic 
acid. "Insertions" or "deletions" are preferably in the range of about 1 to 20 amino acids, 
more preferably 1 to 10 amino acids. The variation allowed may be experimentally 
determined by systematically making insertions, deletions, or substitutions of amino acids in 
30 a polypeptide molecule using recombinant DNA techniques and assaying the resulting 
recombinant variants for activity. 

Alternatively, where alteration of function is desired, insertions, deletions or 
non-conservative alterations can be engineered to produce altered polypeptides. Such 
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alterations can, for example, alter one or more of the biological functions or biochemical 
characteristics of the polypeptides of the invention. For example, such alterations may 
change polypeptide characteristics such as ligand-binding affinities, interchain affinities, or 
degradation/turnover rate. Further, such alterations can be selected so as to generate 
5 polypeptides that are better suited for expression, scale up and the like in the host cells 
chosen for expression. For example, cysteine residues can be deleted or substituted with 
another amino acid residue in order to eliminate disulfide bridges. 

The terms "purified" or "substantially purified" as used herein denotes that the 
indicated nucleic acid or polypeptide is present in the substantial absence of other biological 
10 macromolecules, e.g., polynucleotides, proteins, and the like. In one embodiment, the 
polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, 
more preferably at least 99% by weight, of the indicated biological macromolecules present 
(but water, buffers, and other small molecules, especially molecules having a molecular 
weight of less than 1000 daltons, can be present). 
1 5 The term "isolated" as used herein refers to a nucleic acid or polypeptide separated 

ftom at least one other component (e.g., nucleic acid or polypeptide) present with the nucleic 
acid or polypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide 
is found in the presence of (if anything) only a solvent, buffer, ion, or other component 
normally present in a solution of the same. The terms "isolated" and "purified" do not 
20 encompass nucleic acids or polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protein, means 
that a polypeptide or protein is derived from recombinant {e.g., microbial, insect, or 
mammalian) expression systems. "Microbial" refers to recombinant polypeptides or proteins 
made in bacterial or fungal (e.g., yeast) expression systems. As a product, "recombinant 
25 microbial" defines a polypeptide or protein essentially free of native endogenous substances 
and unaccompanied by associated native glycosylation. Polypeptides or proteins expressed 
in most bacterial cultures, e.g., E. coli, will be free of glycosylation modifications; 
polypeptides or proteins expressed in yeast will have a glycosylation pattern in general 
different from those expressed in mammalian cells. 
30 The term "recombinant expression vehicle or vector" refers to a plasmid or phage or 

virus or vector, for expressing a polypeptide from a DNA (RNA) sequence. An expression 
vehicle can comprise a transcriptional unit comprising an assembly of (1) a genetic element 
or elements having a regulatory role in gene expression, for example, promoters or 
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enhancers, (2) a structural or coding sequence which is transcribed into mRNA and 
translated into protein, and (3) appropriate transcription initiation and termination sequences. 
Structural units intended for use in yeast or eukaryotic expression systems preferably include 
a leader sequence enabling extracellular secretion of translated protein by a host cell. 
5 Alternatively, where recombinant protein is expressed without a leader or transport 

sequence, it may include an amino terminal methionine residue. This residue may or may 
not be subsequently cleaved from the expressed recombinant protein to provide a final 
product. 

The term "recombinant expression system" means host cells which have stably 
10 integrated a recombinant transcriptional unit into chromosomal DNA or carry the 

recombinant transcriptional unit extrachromosomally. Recombinant expression systems as 
defined herein will express heterologous polypeptides or proteins upon induction of the 
regulatory elements linked to the DNA segment or synthetic gene to be expressed. This term 
also means host cells which have stably integrated a recombinant genetic element or 
1 5 elements having a regulatory role in gene expression, for example, promoters or enhancers. 
Recombinant expression systems as defined herein will express polypeptides or proteins 
endogenous to the cell upon induction of the regulatory elements linked to the endogenous 
DNA segment or gene to be expressed. The cells can be prokaryotic or eukaryotic. 
The term "secreted" includes a protein that is transported across or through a 
20 membrane, including transport as a result of signal sequences in its amino acid sequence 
when it is expressed in a suitable host cell. "Secreted" proteins include without limitation 
proteins secreted wholly (e.g., soluble proteins) or partially (e.g., receptors) from the cell in 
which they are expressed. "Secreted" proteins also include without limitation proteins that 
are transported across the membrane of the endoplasmic reticulum. "Secreted" proteins are 

i 

25 also intended to include proteins containing non-typical signal sequences (e.g. Interleukin-1 
Beta, see Krasney, P. A. and Young, P.R. (1992) Cytokine 4(2): 134 -143) and factors 

r 

released from damaged cells (e.g. Interleukin-1 Receptor Antagonist, see Arend, W.P. et. al. 
(1998) Annu. Rev. Immunol. 16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or leader 
30 sequence" which will direct the polypeptide through the membrane of a cell. Such a 

sequence may be naturally present on the polypeptides of the present invention or provided 
from heterologous protein sources by recombinant DNA techniques. 



WO 03/054152 PCT/US02/39555 

16 

The term "stringent" is used to refer to conditions that are commonly understood in 
the art as stringent. Stringent conditions can include highly stringent conditions (i.e., 
hybridization to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 
mM EDTA at 65°C, and washing in 0.1X SSC/0.1% SDS at 68°C), and moderately stringent 
5 conditions (i.e., washing in 0.2X SSC/0.1% SDS at 42°C). Other exemplary hybridization 
conditions are described herein in the examples. 

In instances of hybridization of deoxyoligonucleotides, additional exemplary 
stringent hybridization conditions include washing in 6X SSC/0.05% sodium pyrophosphate 
at 37°C (for 14-base oligonucleotides), 48°C (for 17-base oligonucleotides), 55°C (for 20- 

10 base oligonucleotides), and 60°C (for 23-base oligonucleotides). 

As used herein, "substantially equivalent" or "substantially similar" can refer both to 
nucleotide and amino acid sequences, for example a mutant sequence, that varies from a 
reference sequence by one or more substitutions, deletions, or additions, the net effect of 
which does not result in an adverse functional dissimilarity between the reference and 

15 subject sequences. Typically, such a substantially equivalent sequence varies from one of 
those listed herein by no more than about 35% {i.e., the number of individual residue 
substitutions, additions, and/or deletions in a substantially equivalent sequence, as compared 
to the corresponding reference sequence, divided by the total number of residues in the 
substantially equivalent sequence is about 0.35 or less). Such a sequence is said to have 

20 65% sequence identity to the listed sequence. In one embodiment, a substantially 

equivalent, e.g., mutant, sequence of the invention varies from a listed sequence by no more 
than 30% (70% sequence identity); in a variation of this embodiment, by no more than 25% 
(75% sequence identity); and in a further variation of this embodiment, by no more than 
20% (80% sequence identity) and in a further variation of this embodiment, by no more than 

25 10% (90% sequence identity) and in a further variation of this embodiment, by no more that 
5% (95% sequence identity). Substantially equivalent, e.g., mutant, amino acid sequences 
according to the invention preferably have at least 80% sequence identity with a listed amino 
acid sequence, more preferably at least 85% sequence identity, more preferably at least 90% 
sequence identity, more preferably at least 95% sequence identity, more preferably at least 

30 98% sequence identity, and most preferably at least 99% sequence identity. Substantially 
equivalent nucleotide sequence of the invention can have lower percent sequence identities, 
taking into account, for example, the redundancy or degeneracy of the genetic code. 
Preferably, the nucleotide sequence has at least about 65% identity, more preferably at least 
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about 75% identity, more preferably at least about 80% sequence identity, more preferably at 
least 85%> sequence identity, more preferably at least 90% sequence identity, more preferably 
at least about 95% sequence identity, more preferably at least 98% sequence identity, and 
most preferably at least 99% sequence identity. For the purposes of the present invention, 
5 sequences having substantially equivalent biological activity and substantially equivalent 
expression characteristics are considered substantially equivalent. For the purposes of 
determining equivalence, truncation of the mature sequence (e.g., via a mutation which 
creates a new stop codon) should be disregarded. Sequence identity may be determined, 
e.g., using the Jotun Hein method (Hein, J. (1990) Methods Enzymol. 183:626-645). 
10 Identity between sequences can also be determined by other methods known in the art, e.g. 
by varying hybridization conditions. 

The term "totipotent" refers to the capability of a cell to differentiate into all of the 

cell types of an adult organism. 

The term "transformation" means introducing DNA into a suitable host cell so that 
1 5 the DNA is replicable, either as an extrachromosomal element, or by chromosomal 

integration. The term "transfection" refers to the taking up of an expression vector by a 
suitable host cell, whether or not any coding sequences are in fact expressed. The term 
"infection" refers to the introduction of nucleic acids into a suitable host cell by use of a 
virus or viral vector. 

20 As used herein, an "uptake modulating fragment," UMF, means a series of 

nucleotides which mediate the uptake of a linked DNA fragment into a cell. UMFs can be 
readily identified using known UMFs as a target sequence or target motif with the 
computer-based systems described below. The presence and activity of a UMF can be 
confirmed by attaching the suspected UMF to a marker sequence. The resulting nucleic acid 

25 molecule is then incubated with an appropriate host under appropriate conditions and the 

uptake of the marker sequence is determined. As described above, a UMF will increase the 
frequency of uptake of a linked marker sequence. 

Each of the above terms is meant to encompass all that is described for each, unless 
the context dictates otherwise. 

30 

4.2 NUCLEIC ACIDS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 
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The isolated polynucleotides of the invention include a polynucleotide comprising 
the nucleotide sequences of SEQ ID NO: 1-911, or 1823-2478; a polynucleotide encoding 
any one of the peptide sequences of SEQ ID NO: 1-911, or 1 823-2478; and a polynucleotide 
-comprising the nucleotide sequence encoding the mature protein coding sequence of the 
5 polynucleotides of any one of SEQ ID NO : 1 -9 1 1 , or 1 823-2478. The polynucleotides of the 
present invention also include, but are not limited to, a polynucleotide that hybridizes under 
stringent conditions to (a) the complement of any of the nucleotides sequences of SEQ ID 
NO: 1-911, or 1823-2478; (b) nucleotide sequences encoding any one of the amino acid 
sequences set forth in the Sequence Listing, ; (c) a polynucleotide which is an allelic variant 

10 of any polynucleotide recited above; (d) a polynucleotide which encodes a species 

homologue of any of the proteins recited above; or (e) a polynucleotide that encodes a 
polypeptide comprising a specific domain or truncation of the polypeptides of SEQ ID NO: 
912-1822, or 2479-3134 (for example, as set forth in Tables 3A, 3B, 5, or 6). Domains of 
interest may depend on the nature of the encoded polypeptide; e.g., domains in receptor-like 

15 polypeptides include ligand-binding, extracellular, transmembrane, or cytoplasmic domains, 
or combinations thereof; domains in immunoglobulin-like proteins include the variable 
immunoglobulin-like domains; domains in enzyme-like polypeptides include catalytic and 
substrate binding domains; and domains in ligand polypeptides include receptor-binding 
domains. 

20 The polynucleotides of the invention include naturally occurring or wholly or 

partially synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The 
polynucleotides may include entire coding region of the cDNA or may represent a portion of 
the coding region of the cDNA. 

The present invention also provides genes corresponding to the cDNA sequences 

25 disclosed herein. The corresponding genes can be isolated in accordance with known methods 
using the sequence information disclosed herein. Such methods include the preparation of 
probes or primers from the disclosed sequence information for identification and/or 
amplification of genes in appropriate genomic libraries or other sources of genomic materials. 
Further 5' and 3* sequence can be obtained using methods known in the art. For example, full 

30 length cDNA or genomic DNA that corresponds to any of the polynucleotides of SEQ ID NO: 
1-91 1 ? or 1823-2478 can be obtained by screening appropriate cDNA or genomic DNA 
libraries under suitable hybridization conditions using any of the polynucleotides of SEQ ID 
NO: 1-91 1, or 1823-2478 or a portion thereof as a probe. Alternatively, the polynucleotides of 
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SEQ ID NO: 1-91 1, or 1823-2478 may be used as the basis for suitable primer(s) that allow 
identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries. 

The nucleic acid sequences of the invention can be assembled from ESTs and sequences 
(including cDNA and genomic sequences) obtained from one or more public databases, such as 
5 dbEST, gbpri, and UmGene. The EST sequences can provide identifying sequence 

information, representative fragment or segment information, or novel segment information for 

the full-length gene. 

The polynucleotides of the invention also provide polynucleotides including 
nucleotide sequences that are substantially equivalent to the polynucleotides recited above. 

10 Polynucleotides according to the invention can have, e.g., at least about 65%, at least about 
70%, at least about 75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least 
about 85%, 86%, 87%, 88%, 89%, more typically at least about 90%, 91%, 92%, 93%, 94%, 
and even more typically at least about 95%, 96%, 97%, 98%, 99% sequence identity to a 
polynucleotide recited above. 

1 5 Included within the scope of the nucleic acid sequences of the invention are nucleic 

acid sequence fragments that hybridize under stringent conditions to any of the nucleotide 
sequences of SEQ ID NO: 1-911, or 1823-2478, or complements thereof, which fragment is 
greater than about 5 nucleotides, preferably 7 nucleotides, more preferably greater than 9 
nucleotides and most preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, or 20 

20 nucleotides or more that are selective for (i.e. specifically hybridize to) any one of the 
polynucleotides of the invention are contemplated. Probes capable of specifically 
hybridizing to a polynucleotide can differentiate polynucleotide sequences of the invention 
from other polynucleotide sequences in the same family of genes or can differentiate human 
genes from genes of other species, and are preferably based on unique nucleotide sequences. 

25 The sequences falling within the scope of the present invention are not limited to these 

specific sequences, but also include allelic and species variations thereof. Allelic and species 
variations can be routinely determined by comparing the sequence provided in SEQ ID NO: 1- 
91 1, or 1823-2478, a representative fragment thereof, or a nucleotide sequence at least 90% 
identical, preferably 95% identical, to SEQ ID NO: 1-91 1, or 1823-2478 with a sequence from 

30 another isolate of the same species. Furthermore, to accommodate codon variability, the 

invention includes nucleic acid molecules coding for the same amino acid sequences as do the 
specific ORFs disclosed herein. In other words, in the coding region of an ORF, substitution of 
one codon for another codon that encodes the same amino acid is expressly contemplated. 



WO 03/054152 PCT/US02/39555 

20 

The nearest neighbor or homology results for the nucleic acids of the present invention, 
including SEQ ID NO: 1-91 1, or 1823-2478 can be obtained by searching a database using an 
algorithm or a program. Preferably, a BLAST (Basic Local Alignment Search Tool) program is 
used to search for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 290-300 (1993) and 
5 Altschul S.F. et al. J. Mol. Biol. 21 :403-410 (1990)). Alternatively a FASTA version 3 search 
against Genpept, using FASTXY algorithm may be performed. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are 
also provided by the present invention. Species homologs may be isolated and identified by 
making suitable probes or primers from the sequences provided herein and screening a 
10 suitable nucleic acid source from the desired species. 

The invention also encompasses allelic variants of the disclosed polynucleotides or 
proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide which 
also encode proteins which are identical, homologous or related to that encoded by the 
polynucleotides. 

15 The nucleic acid sequences of the invention are further directed to sequences which 

encode variants of the described nucleic acids. These amino acid sequence variants may be 
prepared by methods known in the art by introducing appropriate nucleotide changes into a 
native or variant polynucleotide. There are two variables in the construction of amino acid 
sequence variants: the location of the mutation and the nature of the mutation. Nucleic 

20 acids encoding the amino acid sequence variants are preferably constructed by mutating the 
polynucleotide to encode an amino acid sequence that does not occur in nature. These 
nucleic acid alterations can be made at sites that differ in the nucleic acids from different 
species (variable positions) or in highly conserved regions (constant regions). Sites at such 
locations will typically be modified in series, e.g., by substituting first with conservative 

25 choices (e.g. , hydrophobic amino acid to a different hydrophobic amino acid) and then with 
more distant choices (e.g., hydrophobic amino acid to a charged amino acid), and then 
deletions or insertions may be made at the target site. Amino acid sequence deletions 
generally range from about 1 to 30 residues, preferably about 1 to 10 residues, and are 
typically contiguous. Amino acid insertions include amino- and/or carboxyl-terminal 

30 fusions ranging in length from one to one hundred or more residues, as well as intrasequence 
insertions of single or multiple amino acid residues. Intrasequence insertions may range 
generally from about 1 to 10 amino residues, preferably from 1 to 5 residues. Examples of 
terminal insertions include the heterologous signal sequences necessary for secretion or for 
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intracellular targeting in different host cells and sequences such as FLAG or poly-histidine 
sequences useful for purifying the expressed protein. 

In a preferred method, polynucleotides encoding the novel amino acid sequences are 
changed via site-directed mutagenesis. This method uses oligonucleotide sequences to alter 
5 a polynucleotide to encode the desired amino acid variant, as well as sufficient adjacent 

nucleotides on both sides of the changed amino acid to form a stable duplex on either side of 
the site of being changed. In general, the techniques of site-directed mutagenesis are well 
known to those of skill in the art and this technique is exemplified by publications such as, 
Edelman et al., DNA 2: 183 (1983). A versatile and efficient method for producing 

10 site-specific changes in a polynucleotide sequence was published by Zoller and Smith, 
Nucleic Acids Res. 10:6487-6500 (1982). PGR may also be used to create amino acid 
sequence variants of the novel nucleic acids. When small amounts of template DNA are 
used as starting material, primer(s) that differs slightly in sequence from the corresponding 
region in the template DNA can generate the desired amino acid variant. PGR amplification 

1 5 results in a population of product DNA fragments that differ from the polynucleotide 

template encoding the polypeptide at the position specified by the primer. The product DNA 
fragments replace the corresponding region in the plasmid and this gives a polynucleotide 
encoding the desired amino acid variant. 

A further technique for generating amino acid variants is the cassette mutagenesis 

20 technique described in Wells et al., Gene 34:315 (1985); and other mutagenesis techniques 
well known in the art, such as, for example, the techniques in Sambrook et al., supra, and 
Current Protocols in Molecular Biology, Ausubel et al. Due to the inherent degeneracy of 
the genetic code, other DNA sequences which encode substantially the same or a 
functionally equivalent amino acid sequence may be used in the practice of the invention for 

25 the cloning and expression of these novel nucleic acids. Such DNA sequences include those 
which are capable of hybridizing to the appropriate novel nucleic acid sequence under 
stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention could be 
used to generate polynucleotides encoding chimeric or fusion proteins comprising one or 
30 more domains of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any of 
the polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, 
amplified, or synthetic) or RNA. Methods and algorithms for obtaining such 
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polynucleotides are well known to those of skill in the art and can include, for example, 
methods for determining hybridization conditions that can routinely isolate polynucleotides 

of the desired sequence identities. 

In accordance with the invention, polynucleotide sequences comprising the mature 
5 protein coding sequences corresponding to any one of SEQ ID NO: 1-91 1, or 1823-2478, or 
functional equivalents thereof, may be used to generate recombinant DNA molecules that 
direct the expression of that nucleic acid, or a functional equivalent thereof, in appropriate 
host cells. Also included are the cDNA inserts of any of the clones identified herein. 

A polynucleotide according to the invention can be joined to any of a variety of other 

10 nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et 
al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). 
Useful nucleotide sequences for joining to polynucleotides include an assortment of vectors, 
e.g., plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well 
known in the art. Accordingly, the invention also provides a vector including a 

15 polynucleotide of the invention and a host cell containing the polynucleotide. In general, the 
vector contains an origin of replication functional in at least one organism, convenient 
restriction endonuclease sites, and a selectable marker for the host cell. Vectors according to 
the invention include expression vectors, replication vectors, probe generation vectors, and 
sequencing vectors. A host cell according to the invention can be a prokaryotic or 

20 eukaryotic cell and can be a unicellular organism or part of a multicellular organism. 

The present invention further provides recombinant constructs comprising a nucleic 
acid having any of the nucleotide sequences of SEQ ID NO: 1-911, or 1823-2478 or a 
fragment thereof or any other polynucleotides of the invention. In one embodiment, the 
recombinant constructs of the present invention comprise a vector, such as a plasmid or viral 

25 vector, into which a nucleic acid having any of the nucleotide sequences of SEQ ID NO: 1- 
9 1 1 , or 1 823-2478 or a fragment thereof is inserted, in a forward or reverse orientation. In 
the case of a vector comprising one of the OKFs of the present invention, the vector may 
further comprise regulatory sequences, including for example, a promoter, operably linked to 
the ORF. Large numbers of suitable vectors and promoters are known to those of skill in the 

30 art and are commercially available for generating the recombinant constructs of the present 
invention. The following vectors are provided by way of example: Bacterial: pBs, 
phagescript, PsiX174, pBluescript SK, pBs KS, pNH8a, pNH16a, pNH18a, pNH46a 
(Stratagene), pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia); Eukaryotic: 
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pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an expression 
control sequence such as the pMT2 or pED expression vectors disclosed in Kaufinan et al., 
5 Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein recombinantly. 
Many suitable expression control sequences are known in the art. General methods of 
expressing recombinant proteins are also known and are exemplified in R. Kaufman, 
Methods in Enzymology 185, 537-566 (1990). As defined herein "operably linked" means 
that the isolated polynucleotide of the invention and an expression control sequence are 
1 0 situated within a vector or cell in such a way that the protein is expressed by a host cell 

which has been transformed (transfected) with the ligated polynucleotide/expression control 
sequence. 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 

1 5 appropriate vectors are pKK232-8 and pCM7. Particular named bacterial promoters include 
lad, lacZ, T3, T7, gpt, lambda PR, and trc. Eukaryotic promoters include CMV immediate 
early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and mouse 
metallothionein-I. Selection of the appropriate vector and promoter is well within the leyel 
of ordinary skill in the art. Generally, recombinant expression vectors will include origins of 

20 replication and selectable markers permitting transformation of the host cell, e.g. , the 

ampicillin resistance gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived 
from a highly expressed gene to direct transcription of a downstream structural sequence. 
Such promoters can be derived from operons encoding glycolytic enzymes such as 3- 
phosphoglycerate kinase (PGK), a-factor, acid phosphatase, or heat shock proteins, among 

25 others. The heterologous structural sequence is assembled in appropriate phase with 

translation initiation and termination sequences, and preferably, a leader sequence capable of 
directing secretion of translated protein into the periplasmic space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an amino 
terminal identification peptide imparting desired characteristics, e.g., stabilization or 

3 0 simplified purification of expressed recombinant product. Useful expression vectors for 
bacterial use are constructed by inserting a structural DNA sequence encoding a desired 
protein together with suitable translation initiation and termination signals in operable 
reading phase with a functional promoter. The vector will comprise one or more phenotypic 
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selectable markers and an origin of replication to ensure maintenance of the vector and to, if 
desirable, provide amplification within the host. Suitable prokaryotic hosts for 
transformation include E. coli, Bacillus subtilis, Salmonella typhimurium and various species 
within the genera Pseudomonas, Streptomyces, and Staphylococcus, although others may 
5 also be employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for bacterial 
use can comprise a selectable marker and bacterial origin of replication derived from 
commercially available plasmids comprising genetic elements of the well known cloning 
vector pBR322 (ATCC 37017). Such commercial vectors include, for example, pKK223-3 

10 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, 
USA). These pBR322 "backbone" sections are combined with an appropriate promoter and 
the structural sequence to be expressed. Following transformation of a suitable host strain 
and growth of the host strain to an appropriate cell density, the selected promoter is induced 
or derepressed by appropriate means (e.g., temperature shift or chemical induction) and cells 

1 5 are cultured for an additional period. Cells are typically harvested by centrifugation, 

disrupted by physical or chemical means, and the resulting crude extract retained for further 
purification. 

Polynucleotides of the invention can also be used to induce immune responses. For 
example, as described in Fan et al., Nat. Biotech 17, 870-872 (1999), incorporated herein by 
20 reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies 
against the encoded polypeptide following topical administration of naked plasmid DNA or 
following injection, and preferably intra-muscular injection of the DNA. The nucleic acid 
sequences are preferably inserted in a recombinant expression vector and may be in the form 
of naked DNA. 

25 

43 ANTISENSE 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules 
that are hybridizable to or complementary to the nucleic acid molecule comprising the 
nucleotide sequence of SEQ ID NO: 1-911, or 1823-2478, or fragments, analogs or 
30 derivatives thereof. An "antisense" nucleic acid comprises a nucleotide sequence that is 
complementary to a "sense" nucleic acid encoding a protein, e.g., complementary to the 
coding strand of a double-stranded cDNA molecule or complementary to an mRNA 
sequence. In specific aspects, antisense nucleic acid molecules are provided that comprise a 
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sequence complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides or an 
entire coding strand, or to only a portion thereof. Nucleic acid molecules encoding 
fragments, homologs, derivatives and analogs of a protein of any of SEQ ID NO : 1 -9 1 1 , or 
1823-2478 or antisense nucleic acids complementary to a nucleic acid sequence of SEQ ID 
NO: 1-911, or 1823-2478 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence of the invention. The term "coding 
region" refers to the region of the nucleotide sequence comprising codons which are 
translated into amino acid residues. In another embodiment, the antisense nucleic acid 
molecule is antisense to a "noncoding region" of the coding strand of a nucleotide sequence 
of the invention. The term "noncoding region" refers to 5' and 3' sequences that flank the 
coding region that are not translated into amino acids (z.e., also referred to as 5' and 3' 
untranslated regions). 

Given the coding strand sequences encoding a nucleic acid disclosed herein {e.g., 
SEQ ID NO: 1-911, or 1 823-2478, antisense nucleic acids of the invention can be designed 
according to the rules of Watson and Crick or Hoogsteen base pairing. The antisense nucleic 
acid molecule can be complementary to the entire coding region of an mRNA, but more 
preferably is an oligonucleotide that is antisense to only a portion of the coding or noncoding 
region of an mRNA. For example, the antisense oligonucleotide can be complementary to 
the region surrounding the translation start site of an mRNA. An antisense oligonucleotide 
can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An 
antisense nucleic acid of the invention can be constructed using chemical synthesis or 
enzymatic ligation reactions using procedures known in the art. For example, an antisense 
nucleic acid {e.g., an antisense oligonucleotide) can be chemically synthesized using 
naturally occurring nucleotides or variously modified nucleotides designed to increase the 
biological stability of the molecules or to increase the physical stability of the duplex formed 
between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives and 
acridine substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense nucleic 
acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, 
xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 
carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 
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1- methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3- 
methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 

methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil ? beta-D-mannosylqueosine, 
S'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, 
5 uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl- 

2- thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, 
uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, 
(acp3)w, and 2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced 
biologically using an expression vector into which a nucleic acid has been subcloned in an 

10 antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of interest, described further in the following 
subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 

i 

1 5 genomic DNA encoding a protein according to the invention to thereby inhibit expression of 
the protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the 
case of an antisense nucleic acid molecule that binds to DNA duplexes, through specific 
interactions in the major groove of the double helix. An example of a route of 

20 administration of antisense nucleic acid molecules of the invention includes direct injection 
at a tissue site. Alternatively, antisense nucleic acid molecules can be modified to target 
selected cells and then administered systemically. For example, for systemic administration, 
antisense molecules can be modified such that they specifically bind to receptors or antigens 
expressed on a selected cell surface, e.g., by linking the antisense nucleic acid molecules to 

25 peptides or antibodies that bind to cell surface receptors or antigens. The antisense nucleic 
acid molecules can also be delivered to cells using the vectors described herein. To achieve 
sufficient intracellular concentrations of antisense molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the control of a strong pol II or pol III 
promoter are preferred. 

30 In yet another embodiment, the antisense nucleic acid molecule of the invention is an 

ot-anomeric nucleic acid molecule. An ce-anomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual ct-units, 
the strands run parallel to each other (Gaultier et al (1987) Nucleic Acids Res 15: 
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6625-6641). The antisense nucleic acid molecule can also comprise a 
2'-o-methylribonucleotide (Inoue etal (1987) Nucleic Acids Res 15: 6131-6148) ora 
chimeric RNA -DNA analogue (Inoue etal (1987) FEBS Lett 215: 327-330). 



5 4.4 RIBOZYMES AND PNA MOIETIES 

In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of 
cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 
complementary region. Thus, ribozymes {e.g., hammerhead ribozymes (described in 

10 Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to catalytically cleave 

mRNA transcripts to thereby inhibit translation of an mRNA. A ribozyme having specificity 
for a nucleic acid of the invention can be designed based upon the nucleotide sequence of a 
DNA disclosed herein {i.e., SEQ ID NO: 1-911, or 1823-2478). For example, a derivative of 
Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the 

1 5 active site is complementary to the nucleotide sequence to be cleaved in a mRNA. See, e.g. , 
Cech et al U.S. Pat. No. 4,987,071 ; and Cech et ah U.S. Pat. No. 5,1 16,742. Alternatively, 
mRNA of the invention can be used to select a catalytic RNA having a specific ribonuclease 
activity from a pool of RNA molecules. See, e.g., Bartel et al., (1993) Science 
261:1411-1418. 

20 Alternatively, gene expression can be inhibited by targeting nucleotide sequences 

complementary to the regulatory region {e.g., promoter and/or enhancers) to form triple 
helical structures that prevent transcription of the gene in target cells. See generally, Helene. 
(1991) Anticancer Drug Des. 6: 569-84; Helene. et al (1992) Ann. N. Y. Acad. Sci. 
660:27-36; andMaher (1992) Bioassays 14: 807-15. 

25 In various embodiments, the nucleic acids of the invention can be modified at the 

base moiety, sugar moiety or phosphate backbone to improve, e.g. , the stability, 
hybridization, or solubility of the molecule. For example, the deoxyribose phosphate 
backbone of the nucleic acids can be modified to generate peptide nucleic acids (see Hyrup 
et al (1996) Bio org Med Chem 4: 5-23). As used herein, the terms "peptide nucleic acids" 

30 or "PNAs" refer to nucleic acid mimics, e.g. , DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide backbone and only the four natural 
nucleobases are retained. The neutral backbone of PNAs has been shown to allow for 
specific hybridization to DNA and RNA under conditions of low ionic strength. The 
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synthesis of PNA oligomers can be performed using standard solid phase peptide synthesis 
protocols as described in Hyrup et al (1996) above; Perry-O'Keefe et al (1996) PNAS 93: 
14670-675. 

PNAs of the invention can be used in therapeutic and diagnostic applications. For 
5 example, PNAs can be used as antisense or antigene agents for sequence-specific modulation 
of gene expression by, e.g., inducing transcription or translation arrest or inhibiting 
replication. PNAs of the invention can also be used, e.g., in the analysis of single base pair 
mutations in a gene by, e.g., PNA directed PGR clamping; as artificial restriction enzymes 
when used in combination with other enzymes, e.g., SI nucleases (Hyrup B. (1996) above); 

10 or as probes or primers for DNA sequence and hybridization (Hyrup et al (1 996), above; 
Perry-O'Keefe (1996), above). 

In another embodiment, PNAs of the invention can be modified, e.g., to enhance 
their stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by 
the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 

1 5 delivery known in the art. For example, PNA-DNA chimeras can be generated that may 
combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA 
portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms of 

20 base stacking, number of bonds between the nucleobases, and orientation (Hyrup (1996) 
above). The synthesis of PNA-DNA chimeras can be performed as described in Hyrup 
(1996) above and Finn et al. (1996) Nucl Acids Res 24: 3357-63. For example, a DNA chain 
can be synthesized on a solid support using standard phosphoramidite coupling chemistry, 
and modified nucleoside analogs, e.g., 5'-(4-methoxytrityl)ammo-5 , -deoxy-thymidine 

25 phosphoramidite, can be used between the PNA and the 5' end of DNA (Mag et al (1989) 
Nucl Acid Res 17: 5973-88). PNA monomers are then coupled in a stepwise manner to 
produce a chimeric molecule with a 5' PNA segment and a V DNA segment (Finn et al. 
(1996) above). Alternatively, chimeric molecules can be synthesized with a 5 ! DNA 
segment and a 3' PNA segment. See, Petersen et al. (1975) Bioorg Med Chem Lett 5: 

30 1119-11124. 

In other embodiments, the oligonucleotide may include other appended groups such 
as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport 
across the cell membrane (see, e.g., Letsinger et al, 1989, Proc. Natl Acad. Set U.S.A. 
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86:6553-6556; Lemaitre et aU 1987, Proc. Natl Acad. Set. 84:648-652; PCT Publication 
No. WO88/O9810) or the blood-brain barrier (see, e.g., PCT Publication No. W089/10134). 
In addition, oligonucleotides can be modified with hybridization triggered cleavage agents 
(See, e.g. 9 Krol et al., 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., 
5 Zon, 1988, Pharm. Res. 5: 539-549). To this end, the oligonucleotide may be conjugated to 
another molecule, e.g., a peptide, a hybridization triggered cross-linking agent, a transport 
agent, a hybridization-triggered cleavage agent, etc. 



4.5 HOSTS 

1 0 The present invention further provides host cells genetically engineered to contain 

the polynucleotides of the invention. For example, such host cells may contain nucleic acids 
of the invention introduced into the host cell using known transformation, transfection or 
infection methods. The present invention still further provides host cells genetically 
engineered to express the polynucleotides of the invention, wherein such polynucleotides are 

15 in operative association with a regulatory sequence heterologous to the host cell which 
drives expression of the polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, or 
increase, expression of endogenous polypeptide. Cells can be modified (e.g., by 
homologous recombination) to provide increased polypeptide expression by replacing, in 

20 whole or in part, the naturally occurring promoter with all or part of a heterologous promoter 
so that the cells express the polypeptide at higher levels. The heterologous promoter is 
inserted in such a manner that it is operatively linked to the encoding sequences. See, for 
example, PCT International Publication No. WO94/12650, PCT International Publication 
No. WO92/20808, and PCT International Publication No. WO91/09955. It is also 

25 contemplated that, in addition to heterologous promoter DNA, amplifiable marker DNA 
(e.g., ada, dhfr, and the multifunctional CAD gene which encodes carbamyl phosphate 
synthase, aspartate transcarbamylase, and dihydroorotase) and/or intron DNA may be 
inserted along with the heterologous promoter DNA. If linked to the coding sequence, 
amplification of the marker DNA by standard selection methods results in co-amplification 

30 of the desired protein coding sequences in the cells. 

The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower 
eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the recombinant construct into the host cell can be effected by 
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calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation 
(Davis, L. et al., Basic Methods in Molecular Biology (1986)). The host cells containing one 
of the polynucleotides of the invention, can be used in conventional manners to produce the 
gene product encoded by the isolated fragment (in the case of an ORF) or can be used to 
5 produce a heterologous protein under the control of the EMF. 

Any host/vector system can be used to express one or more of the ORFs of the 
present invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, 
Cv-1 cell, COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. coli and 
B. subtilis. The most preferred cells are those which do not normally express the particular 

10 polypeptide or protein or which expresses the polypeptide or protein at low natural level. 
Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under 
the control of appropriate promoters. Cell-free translation systems can also be employed to 
produce such proteins using RNAs derived from the DNA constructs of the present 
invention. Appropriate cloning and expression vectors for use with prokaryotic and 

15 eukaryotic hosts are described by Sambrook, et ah, in Molecular Cloning: A Laboratory 
Manual, Second Edition, Cold Spring Harbor, New York (1989), the disclosure of which is 
hereby incorporated by reference. 

Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the COS-7 lines 

20 of monkey kidney fibroblasts, described by Gluzman, Cell 23: 175 (1981). Other cell lines 
capable of expressing a compatible vector are, for example, the CI 27, monkey COS cells, 
Chinese Hamster Ovary (CHO) cells, human kidney 293 cells, human epidermal A43 1 cells, 
human Colo205 cells, 3T3 cells, CV-1 cells, other transformed primate cell lines, normal 
diploid cells, cell strains derived from in vitro culture of primary tissue, primary explants, 

25 HeLa cells, mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells. Mammalian expression 
vectors will comprise an origin of replication, a suitable promoter and also any necessary 
ribosome binding sites, polyadenylation site, splice donor and acceptor sites, transcriptional 
termination sequences, and 5' flanking nontranscribed sequences. DNA sequences derived 
from the SV40 viral genome, for example, SV40 origin, early promoter, enhancer, splice, 

30 and polyadenylation sites may be used to provide the required nontranscribed genetic 

elements. Recombinant polypeptides and proteins produced in bacterial culture are usually 
isolated by initial extraction from cell pellets, followed by one or more salting-out, aqueous 
ion exchange or size exclusion chromatography steps. Protein refolding steps can be used, 
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as necessary, in completing configuration of the mature protein. Finally, high performance 
liquid chromatography (HPLC) can be employed for final purification steps. Microbial cells 
employed in expression of proteins can be disrupted by any convenient method, including 
freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents. 
5 Alternatively, it may be possible to produce the protein in lower eukaryotes such as 

yeast or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, 
or any yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 
strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial 
10 strain capable of expressing heterologous proteins. If the protein is made in yeast or 
bacteria, it may be necessary to modify the protein produced therein, for example by 
phosphorylation or glycosylation of the appropriate sites, in order to obtain the functional 
protein. Such covalent attachments may be accomplished using known chemical or 
enzymatic methods. 

1 5 In another embodiment of the present invention, cells and tissues may be engineered 

to express an endogenous gene comprising the polynucleotides of the invention under the 
control of inducible regulatory elements, in which case the regulatory sequences of the 
endogenous gene may be replaced by homologous recombination. As described herein, gene 
targeting can be used to replace a gene's existing regulatory region with a regulatory 

20 sequence isolated from a different gene or a novel regulatory sequence synthesized by 

genetic engineering methods. Such regulatory sequences may be comprised of promoters, 
enhancers, scaffold-attachment regions, negative regulatory elements, transcriptional 
initiation sites, and regulatory protein binding sites or combinations of said sequences. 
Alternatively, sequences which affect the structure or stability of the RNA or protein 

25 produced may be replaced, removed, added, or otherwise modified by targeting. These 
sequence include polyadenylation signals, mRNA stability elements, splice sites, leader 
sequences for enhancing or modifying transport or secretion properties of the protein, or 
other sequences which alter or improve the function or stability of protein or RNA 
molecules. 

30 The targeting event may be a simple insertion of the regulatory sequence, placing the 

gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple 
deletion of a regulatory element, such as the deletion of a tissue-specific negative regulatory 
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element. Alternatively, the targeting event may replace an existing element; for example, a 
tissue-specific enhancer can be replaced by an enhancer that has broader or different 
cell-type specificity than the naturally occurring elements. Here, the naturally occurring 
sequences are deleted and new sequences are added. In all cases, the identification of the 
5 targeting event may be facilitated by the use of one or more selectable marker genes that are 
contiguous with the targeting DNA, allowing for the selection of cells in which the 
exogenous DNA has integrated into the host cell genome. The identification of the targeting 
event may also be facilitated by the use of one or more marker genes exhibiting the property 
of negative selection, such that the negatively selectable marker is linked to the exogenous 

1 0 DNA, but configured such that the negatively selectable marker flanks the targeting 

sequence, and such that a correct homologous recombination event with sequences in the 
host cell genome does not result in the stable integration of the negatively selectable marker. 
Markers useful for this purpose include the Herpes Simplex Virus thymidine kinase (TK) 
gene or the bacterial xanthine-guanine phosphoribosyl-transferase (gpt) gene. 

1 5 The gene targeting or gene activation techniques which can be used in accordance 

with this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 
to Chappel; U.S. Patent No. 5,578,461 to Sherwin et aL; International Application No. 
PCT7US92/09627 (WO93/09222) by Selden et al.; and International Application No. 
PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by 

20 reference herein in its entirety. 



4.6 POLYPEPTIDES OF THE INVENTION 

The isolated polypeptides of the invention include, but are not limited to, a 
polypeptide comprising: the amino acid sequences set forth as any one of SEQ ID NO: 912- 

25 1 822, or 2479-3 1 34 or an amino acid sequence encoded by any one of the nucleotide 

sequences SEQ ID NO: 1-91 1, or 1823-2478 or the corresponding full length or mature 
protein. Polypeptides of the invention also include polypeptides preferably with biological or 
immunological activity that are encoded by: (a) a polynucleotide having any one of the 
nucleotide sequences set forth in SEQ ID NO: 1-911, or 1823-2478 or (b) polynucleotides 

30 encoding any one of the amino acid sequences set forth as SEQ ID NO: 912-1822, or 2479- 
3 134 or (c) polynucleotides that hybridize to the complement of the polynucleotides of either 
(a) or (b) under stringent hybridization conditions. The invention also provides biologically 
active or immunologically active variants of any of the amino acid sequences set forth as 
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SEQ ID NO: 912-1 822, or 2479-3 134 or the corresponding full length or mature protein; and 
"substantial equivalents" thereof (e.g., with at least about 65%, at least about 70%, at least 
about 75%, at least about 80%, at least about 85%, 86%, 87%, 88%, 89%, at least about 
90%, 91%, 92%, 93%, 94%, typically at least about 95%, 96%, 97%, more typically at least 
5 about 98%, or most typically at least about 99% amino acid identity) that retain biological 
activity. Polypeptides encoded by allelic variants may have a similar, increased, or 
decreased activity compared to polypeptides comprising SEQ ID NO: 912-1822, or 2479- 
3134. 

Fragments of the proteins of the present invention which are capable of exhibiting 

1 0 biological activity are also encompassed by the present invention. Fragments of the protein 
may be in linear form or they may be cyclized using known methods, for example, as 
described in H. U. Saragovi, et al., Bio/Technology 10, 773-778 (1992) and in R. S. 
McDowell, et al., J. Amer. Chem. Soc. 1 14, 9245-9253 (1992), both of which are 
incorporated herein by reference. Such fragments may be fused to carrier molecules such as 

1 5 immunoglobulins for many purposes, including increasing the valency of protein binding 
sites. Fragments are also identified in Tables 3A, 3B, 5, or 6. 

The present invention also provides both full-length and mature forms (for example, 
without a signal sequence or precursor sequence) of the disclosed proteins. The protein 
coding sequence is identified in the sequence listing by translation of the disclosed 

20 nucleotide sequences. The predicted signal sequence is set forth in Table 6. The mature 
form of such protein may be obtained and confirmed by expression of a full-length 
polynucleotide in a suitable mammalian cell or other host cell and sequencing of the cleaved 
product. One of skill in the art will recognize that the actual cleavage site may be different 
than that predicted in Table 6. The sequence of the mature form of the protein is also 

25 determinable from the amino acid sequence of the full-length form. Where proteins of the 
present invention are membrane bound, soluble forms of the proteins are also provided. In 
such forms, part or all of the regions causing the proteins to be membrane bound are deleted 
so that the proteins are fully secreted from the cell in which they are expressed (See, e.g., 
Sakal et al., Prep. Biochem. Biotechnol. (2000), 30(2), pp. 107-23, incorporated herein by 

30 reference). 

Protein compositions of the present invention may further comprise an acceptable 
carrier, such as a hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 
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The present invention further provides isolated polypeptides encoded by the nucleic 
acid fragments of the present invention or by degenerate variants of the nucleic acid 
fragments of the present invention. By "degenerate variant" is intended nucleotide 
fragments which differ from a nucleic acid fragment of the present invention (e.g., an ORF) 
5 by nucleotide sequence but, due to the degeneracy of the genetic code, encode an identical 
polypeptide sequence. Preferred nucleic acid fragments of the present invention are the 

ORFs that encode proteins. 

A variety of methodologies known in the art can be utilized to obtain any one of the 
isolated polypeptides or proteins of the present invention. At the simplest level, the amino 

10 acid sequence can be synthesized using commercially available peptide synthesizers. The 
synthetically-constructed protein sequences, by virtue of sharing primary, secondary or 
tertiary structural and/or conformational characteristics with proteins may possess biological 
properties in common therewith, including protein activity. This technique is particularly 
useful in producing small peptides and fragments of larger polypeptides. Fragments are 

1 5 useful, for example, in generating antibodies against the native polypeptide. Thus, they may 
be employed as biologically active or immunological substitutes for natural, purified 
proteins in screening of therapeutic compounds and in immunological processes for the 

development of antibodies. 

The polypeptides and proteins of the present invention can alternatively be purified 

20 from cells which have been altered to express the desired polypeptide or protein. As used 
herein, a cell is said to be altered to express a desired polypeptide or protein when the cell, 
through genetic manipulation, is made to produce a polypeptide or protein which it normally 
does not produce or which the cell normally produces at a lower level. One skilled in the art 
can readily adapt procedures for introducing and expressing either recombinant or synthetic 

25 sequences into eukaryotic or prokaryotic cells in order to generate a cell which produces one 
of the polypeptides or proteins of the present invention. 

The invention also relates to methods for producing a polypeptide comprising 
growing a culture of host cells of the invention in a suitable culture medium, and purifying 
the protein from the cells or the culture in which the cells are grown. For example, the 

30 methods of the invention include a process for producing a polypeptide in which a host cell 
containing a suitable expression vector that includes a polynucleotide of the invention is 
cultured under conditions that allow expression of the encoded polypeptide. The 
polypeptide can be recovered from the culture, conveniently from the culture medium, or 
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from a lysate prepared from the host cells and further purified. Preferred embodiments 
include those in which the protein produced by such process is a full length or mature form 
of the protein. 

In an alternative method, the polypeptide or protein is purified from bacterial cells 
5 which naturally produce the polypeptide or protein. One skilled in the art can readily follow 
known methods for isolating polypeptides and proteins in order to obtain one of the isolated 
polypeptides or proteins of the present invention. These include, but are not limited to, 
immunoctoomatography, HPLC, size-exclusion chromatography, ion-exchange 
chromatography, and immuno-affinity chromatography. See, e.g., Scopes, Protein 

10 Purification: Principles and Practice, Springer-Verlag (1994); Sambrook, et al., in 

Molecular Cloning: A Laboratory Manual; Ausubel et al., Current Protocols in Molecular 
Biology. Polypeptide fragments that retain biological/immunological activity include 
fragments comprising greater than about 100 amino acids, or greater than about 200 amino 
acids, and fragments that encode specific protein domains. 

1 5 The purified polypeptides can be used in in vitro binding assays which are well 

known in the art to identify molecules which bind to the polypeptides. These molecules 
include but are not limited to, for e.g., small molecules, molecules from combinatorial 
libraries, antibodies or other proteins. The molecules identified in the binding assay are then 
tested for antagonist or agonist activity in in vivo tissue culture or animal models that are 

20 well known in the art. In brief, the molecules are titrated into a plurality of cell cultures or 
animals and then tested for either cell/animal death or prolonged survival of the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the 
peptides may be complexed with toxins, e.g., ricin or cholera, or with other compounds that 
are toxic to cells. The toxin-binding molecule complex is then targeted to a tumor or other 

25 cell by the specificity of the binding molecule for SEQ ID NO: 912-1822, or 2479-3134. 

The protein of the invention may also be expressed as a product of transgenic 
animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which are 
characterized by somatic or germ cells containing a nucleotide sequence encoding the 
protein. 

30 The proteins provided herein also include proteins characterized by amino acid 

sequences similar to those of purified proteins but into which modification are naturally 
provided or deliberately engineered. For example, modifications, in the peptide or DNA 
sequence, can be made by those skilled in the art using known techniques. Modifications of 



WO 03/054152 PCT/US02/39555 

36 

interest in the protein sequences may include the alteration, substitution, replacement, 
insertion or deletion of a selected amino acid residue in the coding sequence. For example, 
one or more of the cysteine residues may be deleted or replaced with another amino acid to 
alter the conformation of the molecule. Techniques for such alteration, substitution, 
5 replacement, insertion or deletion are well known to those skilled in the art (see, e.g., U.S. 
Pat. No. 4,518,584). Preferably, such alteration, substitution, replacement, insertion or 
deletion retains the desired activity of the protein. Regions of the protein that are important 
for the protein function can be determined by various methods known in the art including the 
alanine-scanning method which involved systematic substitution of single or strings of 

10 amino acids with alanine, followed by testing the resulting alanine-containing variant for 

biological activity. This type of analysis determines the importance of the substituted amino 
acid(s) in biological activity. Regions of the protein that are important for protein function 
may be determined by the eMATRIX program. 

Other fragments and derivatives of the sequences of proteins which would be 

1 5 expected to retain protein activity in whole or in part and are useful for screening or other 
immunological methodologies may also be easily made by those skilled in the art given the 
disclosures herein. Such modifications are encompassed by the present invention. 

The protein may also be produced by operably linking the isolated polynucleotide of 
the invention to suitable control sequences in one or more insect expression vectors, and 

20 employing an insect expression system. Materials and methods for baculovirus/insect cell 
expression systems are commercially available in kit form from, e.g., Invitrogen, San Diego, 
Calif., U.S.A. (the MaxBat™ kit), and such methods are well known in the art, as described 
in Summers and Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), 
incorporated herein by reference. As used herein, an insect cell capable of expressing a 

25 polynucleotide of the present invention is "transformed." 

The protein of the invention may be prepared by culturing transformed host cells 
under culture conditions suitable to express the recombinant protein. The resulting 
expressed protein may then be purified from such culture (i.e., from culture medium or cell 
extracts) using known purification processes, such as gel filtration and ion exchange 

30 chromatography. The purification of the protein may also include an affinity column 

containing agents which will bind to the protein; one or more column steps over such affinity 
resins as concanavalin A-agarose, heparin-toyopearl™ or Cibacrom blue 3GA Sepharose™; 
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one or more steps involving hydrophobic interaction chromatography using such resins as 
phenyl ether, butyl ether, or propyl ether; or immunoaffinity chromatography. 

Alternatively, the protein of the invention may also be expressed in a form which will 
facilitate purification. For example, it may be expressed as a fusion protein, such as those of 
5 maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin (TRX), or as 
a His tag. Kits for expression and purification of such fusion proteins are commercially 
available from New England BioLab (Beverly, Mass.), Pharmacia (Piscataway, N.J.) and 
Invitrogen, respectively. The protein can also be tagged with an epitope and subsequently 
purified by using a specific antibody directed to such epitope. One such epitope ("FLAG®") 

10 is commercially available from Kodak (New Haven, Conn.). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- 
HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant 
methyl or other aliphatic groups, can be employed to further purify the protein. Some or all 
of the foregoing purification steps, in various combinations, can also be employed to provide 

15 a substantially homogeneous isolated recombinant protein. The protein thus purified is 

substantially free of other mammalian proteins and is defined in accordance with the present 
invention as an "isolated protein." 

The polypeptides of the invention include analogs (variants). This embraces 
fragments, as well as peptides in which one or more amino acids has been deleted, inserted, 

20 or substituted. Also, analogs of the polypeptides of the invention embrace fusions of the 

polypeptides or modifications of the polypeptides of the invention, wherein the polypeptide 
or analog is fused to another moiety or moieties, e.g., targeting moiety or another therapeutic 
agent. Such analogs may exhibit improved properties such as activity and/or stability. 
Examples of moieties which may be fused to the polypeptide or an analog include, for 

25 example, targeting moieties which provide for the delivery of polypeptide to pancreatic cells, 
e.g., antibodies to pancreatic cells, antibodies to immune cells such as T-cells, monocytes, 
dendritic cells, granulocytes, etc., as well as receptor and ligands expressed on pancreatic or 
immune cells. Other moieties which may be fused to the polypeptide include therapeutic 
agents which are used for treatment, for example, immunosuppressive drugs such as 

30 cyclosporin, SK506, azathioprine, CD3 antibodies and steroids. Also, polypeptides may be 
fused to immune modulators, and other cytokines such as alpha or beta interferon. 
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4.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE 
IDENTITY AND SIMILARITY 

Preferred identity and/or similarity are designed to give the largest match between 
the sequences tested. Methods to determine identity and similarity are codified in computer 
5 programs including, but are not limited to, the QCG program package, including GAP 

(Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, 
University of Wisconsin, Madison, WI), BLASTP, BLASTN, BLASTX, FASTA (Altschul, 
S.F. et al., J. Molec. Biol. 215:403-410 (1990), PSI-BLAST (Altschul S.F. et al., Nucleic 
Acids Res. vol. 25, pp. 3389-3402, herein incorporated by reference), eMatrix software (Wu 

10 et al., J. Comp. Biol., Vol. 6, pp. 219-235 (1999), herein incorporated by reference), eMotif 
software (Nevill-Manning et al, ISMB-97, Vol. 4, pp. 202-209, herein incorporated by 
reference), Pfam software (Sonnhammer et al., Nucleic Acids Res., Vol. 26(1), pp. 320-322 
(1998), herein incorporated by reference) and the Kyte-Doolittle hydrophobocity prediction 
algorithm (J. Mol Biol, 157, pp. 105-31 (1982), the GeneAtlas software (Molecular 

15 Simulations Inc. (MSI), San Diego, CA) (Sanchez and Sali (1998) Proc. Natl. Acad. Sci., 95, 
13597-13602; Kitson DH et al, (2000) "Remote homology detection using structural 
modeling - an evaluation" Submitted; Fischer and Eisenberg (1996) Protein Sci. 5, 947- 
955), Neural Network SignalP VI. 1 program (from Center for Biological Sequence 
Analysis, The Technical University of Denmark) incorporated herein by reference). 

20 Polypeptide sequences were examined by a proprietary algorithm, SeqLoc that separates the 
proteins into three sets of locales: intracellular, membrane, or secreted. This prediction is 
based upon three characteristics of each polypeptide, including percentage of cysteine 
residues, Kyte-Doolittle scores for the first 20 amino acids of each protein, and Kyte- 
Doolittle scores to calculate the longest hydrophobic stretch of the said protein. Values of 

25 predicted proteins are compared against the values from a set of 592 proteins of known 

cellular localization from the Swissprot database (http://www.expasv.ch/sprot) . Predictions 
are based upon the maximum likelihood estimation. 

Pesence of transmembrane region(s) was detected using the TMpred program 
rtitt p : //www . ch. embnet. or g/software/TMPRED form .html) . 

30 The BLAST programs are publicly available from the National Center for 

Biotechnology Information (NCBI) and other sources (BLAST Manual, Altschul, S., et al. 
NCBI NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. Mol. Biol. 215:403-410 
(1990). 
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4.7 CHIMERIC AND FUSION PROTEINS 

The invention also provides chimeric or fusion proteins. As used herein, a "chimeric 
protein" or "fusion protein" comprises a polypeptide of the invention operatively linked to 
another polypeptide. Within a fusion protein the polypeptide according to the invention can 
5 correspond to all or a portion of a protein according to the invention. In one embodiment, a 
fusion protein comprises at least one biologically active portion of a protein according to the 
invention. In another embodiment, a fusion protein comprises at least two biologically 
active portions of a protein according to the invention. Within the fusion protein, the term 
"operatively linked" is intended to indicate that the polypeptide according to the invention 
10 and the other polypeptide are fused in-frame to each other. The polypeptide can be fused to 
the N-terminus or C-terminus, or to the middle. 

For example, in one embodiment a fusion protein comprises a polypeptide according 
to the invention operably linked to the extracellular domain of a second protein. 

In another embodiment, the fusion protein is a GST-fusion protein in which the 
15 polypeptide sequences of the invention are fused to the C-terminus of the GST (i.e., 

glutathione S-transferase) sequences. 

In another embodiment, the fusion protein is an immunoglobulin fusion protein in 
which the polypeptide sequences according to the invention comprise one or more domains 
fused to sequences derived from a member of the immunoglobulin protein family. The 

20 immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical 
compositions and administered to a subject to inhibit an interaction between a ligand and a 
protein of the invention on the surface of a cell, to thereby suppress signal transduction in 
vivo. The immunoglobulin fusion proteins can be used to affect the bioavailability of a 
cognate ligand. Inhibition of the ligand/protein interaction may be useful therapeutically for 

25 both the treatment of proliferative and differentiative disorders, e.g., cancer as well as 

modulating {e.g., promoting or inhibiting) cell survival. Moreover, the immunoglobulin 
fusion proteins of the invention can be used as immunogens to produce antibodies in a 
subject, to purify ligands, and in screening assays to identify molecules that inhibit the 
interaction of a polypeptide of the invention with a ligand. 

30 A chimeric or fusion protein of the invention can be produced by standard 

recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 
techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, restriction 
enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as 
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appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic 
ligation. In another embodiment, the fusion gene can be synthesized by conventional 
techniques including automated DNA synthesizers. Alternatively, PCR amplification of 
gene fragments can be carried out using anchor primers that give rise to complementary 
5 overhangs between two consecutive gene fragments that can subsequently be annealed and 
reamplified to generate a chimeric gene sequence (see, for example, Ausubel et al. (eds.) 
Current Protocols in Molecular Biology, John Wiley & Sons, 1992). Moreover, 
many expression vectors are commercially available that already encode a fusion moiety 
(e.g., a GST polypeptide). A nucleic acid encoding a polypeptide of the invention can be 
10 cloned into such an expression vector such that the fusion moiety is linked in-frame to the 
protein of the invention. 

4.8 GENE THERAPY 

Mutations in the polynucleotides of the invention gene may result in loss of normal 

1 5 function of the encoded protein. The invention thus provides gene therapy to restore normal 
activity of the polypeptides of the invention; or to treat disease states involving polypeptides 
of the invention. Delivery of a functional gene encoding polypeptides of the invention to 
appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more 
particularly viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo 

20 by use of physical DNA transfer methods (e.g., liposomes or chemical treatments). See, for 
example, Anderson, Nature, supplement to vol. 392, no. 6679, pp.25-20 (1998). For 
additional reviews of gene therapy technology see Friedmann, Science, 244: 1275-1281 
(1989); Verma, Scientific American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). 
Introduction of any one of the nucleotides of the present invention or a gene encoding the 

25 polypeptides of the present invention can also be accomplished with extrachromosomal 
substrates (transient expression) or artificial chromosomes (stable expression). Cells may 
also be cultured ex vivo in the presence of proteins of the present invention in order to 
proliferate or to produce a desired effect on or activity in such cells. Treated cells can then 
be introduced in vivo for therapeutic purposes. Alternatively, it is contemplated that in other 

30 human disease states, preventing the expression of or inhibiting the activity of polypeptides 
of the invention will be useful in treating the disease states. It is contemplated that antisense 
therapy or gene therapy could be applied to negatively regulate the expression of 
polypeptides of the invention. 
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Other methods inhibiting expression of a protein include the introduction of antisense 
molecules to the nucleic acids of the present invention, their complements, or their translated 
RNA sequences, by methods known in the art. Further, the polypeptides of the present 
invention can be inhibited by using targeted deletion methods, or the insertion of a negative 
5 regulatory element such as a silencer, which is tissue specific. 

The present invention still further provides cells genetically engineered in vivo to 
express the polynucleotides of the invention, wherein such polynucleotides are in operative 
association with a regulatory sequence heterologous to the host cell which drives expression of 
the polynucleotides in the cell. These methods can be used to increase or decrease the 

1 0 expression of the polynucleotides of the present invention. 

Knowledge of DNA sequences provided by the invention allows for modification of 
cells to permit, increase, or decrease, expression of endogenous polypeptide. Cells can be 
modified (e.g., by homologous recombination) to provide increased polypeptide expression by 
replacing, in whole or in part, the naturally occurring promoter with all or part of a heterologous 

1 5 promoter so that the cells express the protein at higher levels. The heterologous promoter is 

inserted in such a manner that it is operatively linked to the desired protein encoding sequences. 
See, for example, PCT International Publication No. WO 94/12650, PCT International 
Publication No. WO 92/20808, and PCT International Publication No. WO 91/09955. It is also 
contemplated that, in addition to heterologous promoter DNA, amplifiable marker DNA (e.g., 

20 ada, dhfr, and the multifunctional CAD gene which encodes carbamyl phosphate synthase, 

aspartate transcarbamylase, and dihydroorotase) and/or intron DNA may be inserted along with 
the heterologous promoter DNA. If linked to the desired protein coding sequence, 
amplification of the marker DNA by standard selection methods results in co-amplification of 
the desired protein coding sequences in the cells. 

25 In another embodiment of the present invention, cells and tissues may be engineered to 

express an endogenous gene comprising the polynucleotides of the invention under the control 
of inducible regulatory elements, in which case the regulatory sequences of the endogenous 
gene may be replaced by homologous recombination. As described herein, gene targeting can 
be used to replace a gene's existing regulatory region with a regulatory sequence isolated from 

30 a different gene or a novel regulatory sequence synthesized by genetic engineering methods. 
Such regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment 
regions, negative regulatory elements, transcriptional initiation sites, regulatory protein binding 
sites or combinations of said sequences. Alternatively, sequences which affect the structure or 
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stability of the RNA or protein produced may be replaced, removed, added, or otherwise 
modified by targeting. These sequences include polyadenylation signals, mRNA stability 
elements, splice sites, leader sequences for enhancing or modifying transport or secretion 
properties of the protein, or other sequences which alter or improve the function or stability of 

5 protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the 
gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple 
deletion of a regulatory element, such as the deletion of a tissue-specific negative regulatory 

1 0 element. Alternatively, the targeting event may replace an existing element; for example, a 
tissue-specific enhancer can be replaced by an enhancer that has broader or different cell-type 
specificity than the naturally occurring elements. Here, the naturally occurring sequences are 
deleted and new sequences are added. In all cases, the identification of the targeting event may 
be facilitated by the use of one or more selectable marker genes that are contiguous with the 

1 5 targeting DNA, allowing for the selection of cells in which the exogenous DNA has integrated 
into the cell genome. The identification of the targeting event may also be facilitated by the use 
of one or more marker genes exhibiting the property of negative selection, such that the 
negatively selectable marker is linked to the exogenous DNA, but configured such that the 
negatively selectable marker flanks the targeting sequence, and such that a correct homologous 

20 recombination event with sequences in the host cell genome does not result in the stable 

integration of the negatively selectable marker. Markers useful for this purpose include the 
Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with 

25 this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to 
Chappel; U.S. Patent No. 5,578,461 to Sherwinet al.; International Application No. 
PCT/US92/09627 (WO93/09222) by Selden et al.; and International Application No. 
PCT/US90/06436 (W09 1/06667) by Skoultchi et al., each of which is incorporated by 
reference herein in its entirety. 

30 

4.9 TRANSGENIC ANIMALS 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed or 
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inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination 
5 are referred to as "knockout" animals. Knockout animals, preferably non-human mammals, 
can be prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. 
Transgenic animals are useful to determine the roles polypeptides of the invention play in 
biological processes, and preferably in disease states. Transgenic animals are useful as model 
systems to identify compounds that modulate lipid metabolism. Transgenic animals, 

10 preferably non-human mammals, are produced using methods as described in U.S. Patent No 
5,489,743 and PCT Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of a promoter of the 
polynucleotides of the invention is either activated or inactivated to alter the level of 
expression of the polypeptides of the invention. Inactivation can be carried out using 

1 5 homologous recombination methods described above. Activation can be achieved by 

supplementing or even replacing the homologous promoter to provide for increased protein 
expression. The homologous promoter can be supplemented by insertion of one or more 
heterologous enhancer elements known to confer promoter activation in a particular tissue. 
The polynucleotides of the present invention also make possible the development, 

20 through, e.g., homologous recombination or knock out strategies, of animals that fail to 

express polypeptides of the invention or that express a variant polypeptide. Such animals are 
useful as models for studying the in vivo activities of polypeptide as well as for studying 
modulators of the polypeptides of the invention. 

In preferred methods to determine biological functions of the polypeptides of the 

25 invention in vivo, one or more genes provided by the invention are either over expressed or 
inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination 

30 are referred to as "knockout" animals. Knockout animals, preferably non-human mammals, 
can be prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. 
Transgenic animals are useful to determine the roles polypeptides of the invention play in 
biological processes, and preferably in disease states. Transgenic animals are useful as model 
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systems to identify compounds that modulate lipid metabolism. Transgenic animals, 
preferably non-human mammals, are produced using methods as described in U.S. Patent No 
5,489,743 and PCT Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of the polynucleotides of the 
5 invention promoter is either activated or inactivated to alter the level of expression of the 
polypeptides of the invention. Inactivation can be carried out using homologous 
recombination methods described above. Activation can be achieved by supplementing or 
even replacing the homologous promoter to provide for increased protein expression. The 
homologous promoter can be supplemented by insertion of one or more heterologous 
10 enhancer elements known to confer promoter activation in a particular tissue. 



4.10 USES AND BIOLOGICAL ACTIVITY 

The polynucleotides and proteins of the present invention are expected to exhibit one 
or more of the uses or biological activities (including those associated with assays cited 

15 herein) identified herein. Uses or activities described for proteins of the present invention 
may be provided by administration or use of such proteins or of polynucleotides encoding 
such proteins (such as, for example, in gene therapies or vectors suitable for introduction of 
DNA). The mechanism underlying the particular condition or pathology will dictate whether 
the polypeptides of the invention, the polynucleotides of the invention or modulators 

20 (activators or inhibitors) thereof would be beneficial to the subject in need of treatment. 

Thus, "therapeutic compositions of the invention" include compositions comprising isolated 
polynucleotides (including recombinant DNA molecules, cloned genes and degenerate 
variants thereof) or polypeptides of the invention (including full length protein, mature 
protein and truncations or domains thereof), or compounds and other substances that 

25 modulate the overall activity of the target gene products, either at the level of target 

gene/protein expression or target protein activity. Such modulators include polypeptides, 
analogs, (variants), including fragments and fusion proteins, antibodies and other binding 
proteins; chemical compounds that directly or indirectly activate or inhibit the polypeptides 
of the invention (identified, e.g., via drug screening assays as described herein); antisense 

30 polynucleotides and polynucleotides suitable for triple helix formation; and in particular 
antibodies or other binding partners that specifically recognize one or more epitopes of the 
polypeptides of the invention. 
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The polypeptides of the present invention may likewise be involved in cellular 
activation or in one of the other physiological pathways described herein. 

4.10.1 RESEARCH USES AND UTILITIES 

5 The polynucleotides provided by the present invention can be used by the research 

community for various purposes. The polynucleotides can be used to express recombinant 
protein for analysis, characterization or therapeutic use; as markers for tissues in which the 
corresponding protein is preferentially expressed (either constitutively or at a particular stage 
of tissue differentiation or development or in disease states); as molecular weight markers on 

10 gels; as chromosome markers or tags (when labeled) to identify chromosomes or to map 
related gene positions; to compare with endogenous DNA sequences in patients to identify 
potential genetic disorders; as probes to hybridize and thus discover novel, related DNA 
sequences; as a source of information to derive PGR primers for genetic fingerprinting; as a 
probe to "subtract-out" known sequences in the process of discovering other novel 

1 5 polynucleotides; for selecting and making oligomers for attachment to a "gene chip" or other 
support, including for examination of expression patterns; to raise anti-protein antibodies 
using DNA immunization techniques; and as an antigen to raise anti-DNA antibodies or 
elicit another immune response. Where the polynucleotide encodes a protein which binds or 
potentially binds to another protein (such as, for example, in a receptor-ligand interaction), 

20 the polynucleotide can also be used in interaction trap assays (such as, for example, that 

described in Gyuris et al., Cell 75:791-803 (1993)) to identify polynucleotides encoding the 
other protein with which binding occurs or to identify inhibitors of the binding interaction. 

The polypeptides provided by the present invention can similarly be used in assays to 
determine biological activity, including in a panel of multiple proteins for high-throughput 

25 screening; to raise antibodies or to elicit another immune response; as a reagent (including 
the labeled reagent) in assays designed to quantitatively determine levels of the protein (or 
its receptor) in biological fluids; as markers for tissues in which the corresponding 
polypeptide is preferentially expressed (either constitutively or at a particular stage of tissue 
differentiation or development or in a disease state); and, of course, to isolate correlative 

30 receptors or ligands. Proteins involved in these binding interactions can also be used to 
screen for peptide or small molecule inhibitors or agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent 
grade or kit format for commercialization as research products. 
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Methods for performing the uses listed above are well known to those skilled in the 
art. References disclosing such methods include without limitation "Molecular Cloning: A 
Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. 
Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular 
5 Cloning Techniques", Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987. 

4.10.2 NUTRITIONAL USES 

Polynucleotides and polypeptides of the present invention can also be used as 
nutritional sources or supplements. Such uses include without limitation use as a protein or 

10 amino acid supplement, use as a carbon source, use as a nitrogen source and use as a source of 
carbohydrate. In such cases the polypeptide or polynucleotide of the invention can be added to 
the feed of a particular organism or can be administered as a separate solid or liquid 
preparation, such as in the form of powder, pills, solutions, suspensions or capsules. In the case 
of microorganisms, the polypeptide or polynucleotide of the invention can be added to the 

1 5 medium in or on which the microorganism is cultured. 

4.10.3 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 
ACTIVITY 

A polypeptide of the present invention may exhibit activity relating to cytokine, cell 
20 proliferation (either inducing or inhibiting) or cell differentiation (either inducing or 

inhibiting) activity or may induce production of other cytokines in certain cell populations. 
A polynucleotide of the invention can encode a polypeptide exhibiting such attributes. 
Many protein factors discovered to date, including all known cytokines, have exhibited 
activity in one or more factor-dependent cell proliferation assays, and hence the assays serve 
25 as a convenient confirmation of cytokine activity. The activity of therapeutic compositions 
of the present invention is evidenced by any one of a number of routine factor dependent cell 
proliferation assays for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, 
B9/1 1, BaF3, MC9/G, M+(preB M+), 2E8, RB5, DAI, 123, Tl 165, HT2, CTLL2, TF-1, 
Mo7e, CMK, HUVEC, and Caco. Therapeutic compositions of the invention can be used in 

30 the following: 

Assays for T-cell or thymocyte proliferation include without limitation those 
described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H, 
Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
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Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Takai et al. 5 J. Immunol. 137:3494-3500, 1986; 
Bertagnolli et al., J. Immunol. 145: 1706-1712, 1990; Bertagnolli et al., Cellular Immunology 
133:327-341, 1991; Bertagnolli, et al., I. Immunol. 149:3778-3783, 1992; Bowman et al., I. 

5 Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node cells 
or thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, 
Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan 
eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of 

1 0 mouse and human interleukm-y, Schreiber, R. D. In Current Protocols in Immunology. J. E. 
e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells 
include, without limitation, those described in: Measurement of Human and Murine 
Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current 

15 Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and 
Sons, Toronto. 1991; deVries et al., J. Exp. Med. 173:1205-1211, 1991; Moreau et al., 
Nature 336:690-692, 1988; Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 
1983; Measurement of mouse and human interleukin 6-Nordan, R. In Current Protocols in 
Immunology. J. E. Coligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; 

20 Smith et al., Proc. Natl. Aced. Sci. U.S.A. 83:1857-1861, 1986; Measurement of human 

Interleukin 1 1 --Bennett, F., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols 
in Immunology. J. E. Coligan eds. Vol 1 pp. 6.15.1 John Wiley and Sons, Toronto. 1991; 
Measurement of mouse and human Interleukin 9— Ciarletta, A., Giannotti, J., Clark, S. C. 
and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 6.13.1, 

25 John Wiley and Sons, Toronto. 1991. 

Assays for T-cell clone responses to antigens (which will identify, among others, 
proteins that affect APC-T cell interactions as well as direct T-cell effects by measuring 
proliferation and cytokine production) include, without limitation, those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, 

30 E. M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley-Interscience 

(Chapter 3, In Vitro assays for Mouse Lymphocyte Function; Chapter 6, Cytokines and their 
cellular receptors; Chapter 7, Immunologic studies in Humans); Weinberger et al., Proc. 
Natl. Acad. Sci. USA 77:6091-6095, 1980; Weinberger et al., Eur. J. Immun. 11:405-411, 
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1981; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 
1988. 



4.10.4 STEM CELL GROWTH FACTOR ACTIVITY 

5 A polypeptide of the present invention may exhibit stem cell growth factor activity 

and be involved in the proliferation, differentiation and survival of pluripotent and totipotent 
stem cells including primordial germ cells, embryonic stem cells, hematopoietic stem cells 
and/or germ line stem cells. Administration of the polypeptide of the invention to stem cells 
in vivo or ex vivo is expected to maintain and expand cell populations in a totipotential or 

10 pluripotential state which would be useful for re-engineering damaged or diseased tissues, 
transplantation, manufacture of bio-pharmaceuticals and the development of bio-sensors. 
The ability to produce large quantities of human cells has important working applications for 
the production of human proteins which currently must be obtained from non-human sources 
or donors, implantation of cells to treat diseases such as Parkinson's, Alzheimer's and other 

15 neurodegenerative diseases; tissues for grafting such as bone marrow, skin, cartilage, 
tendons, bone, muscle (including cardiac muscle), blood vessels, cornea, neural cells, 
gastrointestinal cells and others; and organs for transplantation such as kidney, liver, 
pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or cytokines 

20 may be administered in combination with the polypeptide of the invention to achieve the 

desired effect, including any of the growth factors listed herein, other stem cell maintenance 
factors, and specifically including stem cell factor (SCF), leukemia inhibitory factor (LIF), 
Flt-3 ligand (Flt-3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL- 
6, macrophage inflammatory protein 1 -alpha (MIP-1 -alpha), G-CSF, GM-CSF, 

25 thrombopoietin (TPO), platelet factor 4 (PF-4), platelet-derived growth factor (PDGF), 
neural growth factors and basic fibroblast growth factor (bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, expansion 
of these cells in culture will facilitate the production of large quantities of mature cells. 
Techniques for culturing stem cells are known in the art and administration of polypeptides 

30 of the invention, optionally with other growth factors and/or cytokines, is expected to 
enhance the survival and proliferation of the stem cell populations. This can be 
accomplished by direct administration of the polypeptide of the invention to the culture 
medium. Alternatively, stroma cells transfected with a polynucleotide that encodes for the 



WO 03/054152 PCT/US02/39555 

49 

polypeptide of the invention can be used as a feeder layer for the stem cell populations in 
culture or in vivo. Stromal support cells for feeder layers may include embryonic bone 
marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or cultured embryonic 
fibroblasts (see U.S. Patent No. 5,690,926). 
5 Stem cells themselves can be transfected with a polynucleotide of the invention to 

induce autocrine expression of the polypeptide of the invention. This will allow for 
generation of undifferentiated totipotential/pluripotential stem ce^l lines that are useful as is 
or that can then be differentiated into the desired mature cell types. These stable cell lines 
can also serve as a source of undifferentiated totipotential/pluripotential mRNA to create 

10 cDNA libraries and templates for polymerase chain reaction experiments. These studies 

would allow for the isolation and identification of differentially expressed genes in stem cell 
populations that regulate stem cell proliferation and/or maintenance. 

Expansion and maintenance of totipotent stem cell populations will be useful in the 
treatment of many pathological conditions. For example, polypeptides of the present 

1 5 invention may be used to manipulate stem cells in culture to give rise to neuroepithelial cells 
that can be used to augment or replace cells damaged by illness, autoimmune disease, 
accidental damage or genetic disorders. The polypeptide of the invention may be useful for 
inducing the proliferation of neural cells and for the regeneration of nerve and brain tissue, 
i.e. for the treatment of central and peripheral nervous system diseases and neuropathies, as 

20 well as mechanical and traumatic disorders which involve degeneration, death or trauma to 
neural cells or nerve tissue. In addition, the expanded stem cell populations can also be 
genetically altered for gene therapy purposes and to decrease host rejection of replacement 
tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can also be 

25 manipulated to achieve controlled differentiation of the stem cells into more differentiated 
cell types. A broadly applicable method of obtaining pure populations of a specific 
differentiated cell type from undifferentiated stem cell populations involves the use of a cell- 
type specific promoter driving a selectable marker. The selectable marker allows only cells 
of the desired type to survive. For example, stem cells can be induced to differentiate into 

30 cardiomyocytes (Wobus et al., Differentiation, 48: 173-182, (1991); Klug et al., J. Clin. 

Invest, 98(1): 216-224, (1998)) or skeletal muscle cells (Browder, L. W. In: Principles of 
Tissue Engineering eds. Lanza et al., Academic Press (1997)). Alternatively, directed 
differentiation of stem cells can be accomplished by culturing the stem cells in the presence 
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of a differentiation factor such as retinoic acid and an antagonist of the polypeptide of the 
invention which would inhibit the effects of endogenous stem cell factor activity and allow 
differentiation to proceed. 

In vitro cultures of stem cells can be used to determine if the polypeptide of the 
5 invention exhibits stem cell growth factor activity. Stem cells are isolated from any one of 
various cell sources (including hematopoietic stem cells and embryonic stem cells) and 
cultured on a feeder layer, as described by Thompson et al. Proc. Natl. Acad. Sci, U.S.A., 
92: 7844-7848 (1995), in the presence of the polypeptide of the invention alone or in 
combination with other growth factors or cytokines. The ability of the polypeptide of the 
1 0 invention to induce stem cells proliferation is determined by colony formation on semi-solid 
support e.g. as described by Bernstein et al., Blood, 77: 2316-2321 (1991). 

4.10.5 HEMATOPOIESIS REGULATING ACTIVITY 

A polypeptide of the present invention may be involved in regulation of 

1 5 hematopoiesis and, consequently, in the treatment of myeloid or lymphoid cell disorders. 
Even marginal biological activity in support of colony forming cells or of factor-dependent 
cell lines indicates involvement in regulating hematopoiesis, e.g. in supporting the growth 
and proliferation of erythroid progenitor cells alone or in combination with other cytokines, 
thereby indicating utility, for example, in treating various anemias or for use in conjunction 

20 with irradiation/chemotherapy to stimulate the production of erythroid precursors and/or 
erythroid cells; in supporting the growth and proliferation of myeloid cells such as 
granulocytes and monocytes/macrophages (i.e., traditional CSF activity) useful, for example, 
in conjunction with chemotherapy to prevent or treat consequent myelo-suppression; in 
supporting the growth and proliferation of megakaryocytes and consequently of platelets 

25 thereby allowing prevention or treatment of various platelet disorders such as 

thrombocytopenia, and generally for use in place of or complimentary to platelet 
transfusions; and/or in supporting the growth and proliferation of hematopoietic stem cells 
which are capable of maturing to any and all of the above-mentioned hematopoietic cells and 
therefore find therapeutic utility in various stem cell disorders (such as those usually treated 

30 with transplantation, including, without limitation, aplastic anemia and paroxysmal nocturnal 
hemoglobinuria), as well as in repopulating the stem cell compartment post 
irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in conjunction with bone marrow 
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transplantation or with peripheral progenitor cell transplantation (homologous or 
heterologous)) as normal cells or genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 
Suitable assays for proliferation and differentiation of various hematopoietic lines are 
5 cited above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 
proteins that influence embryonic differentiation hematopoiesis) include, without limitation, 
those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al., 
Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 81:2903-2915, 
10 1993. 

Assays for stem cell survival and differentiation (which will identify, among others, 
proteins that regulate lympho-hematopoiesis) include, without limitation, those described in: 
Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. 
R. I. Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, N.Y. 1994; 

15 Hirayama et al., Proc. Natl. Acad. Sci. USA 89:5907-591 1, 1992; Primitive hematopoietic 
colony forming cells with high proliferative potential, McNiece, I. K. and Briddell, R. A. In 
Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., 
New York, N.Y. 1994; Neben et al., Experimental Hematology 22:353-359, 1994; 
Cobblestone area forming cell assay, Ploemacher, R. E. In Culture of Hematopoietic Cells. 

20 R. I. Freshney, et al. eds. Vol pp. 1-21, Wiley-Liss, Inc., New York, N.Y. 1994; Long term 
bone marrow cultures in the presence of stromal cells, Spooncer, E., Dexter, M. and Allen, 
T. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 163-179, Wiley-Liss, 
Inc., New York, N.Y. 1994; Long term culture initiating cell assay, Sutherland, H. J. In 
Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 139-162, Wiley-Liss, Inc., 

25 New York, N.Y. 1 994. 



4.10.6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also may be involved in bone, cartilage, 
tendon, ligament and/or nerve tissue growth or regeneration, as well as in wound healing and 
30 tissue repair and replacement, and in healing of burns, incisions and ulcers. 

A polypeptide of the present invention which induces cartilage and/or bone growth in 
circumstances where bone is not normally formed, has application in the healing of bone 
fractures and cartilage damage or defects in humans and other animals. Compositions of a 
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polypeptide, antibody, binding partner, or other modulator of the invention may have 
prophylactic use in closed as well as open fracture reduction and also in the improved 
fixation of artificial joints. De novo bone formation induced by an osteogenic agent 
contributes to the repair of congenital, trauma induced, or oncologic resection induced 
5 craniofacial defects, and also is useful in cosmetic plastic surgery. 

A polypeptide of this invention may also be involved in attracting bone-forming 
cells, stimulating growth of bone-forming cells, or inducing differentiation of progenitors of 
bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or 
periodontal disease, such as through stimulation of bone and/or cartilage repair or by 

1 0 blocking inflammation or processes of tissue destruction (collagenase activity, osteoclast 
activity, etc.) mediated by inflammatory processes may also be possible using the 
composition of the invention. 

Another category of tissue regeneration activity that may involve the polypeptide of 
the present invention is tendon/ligament formation. Induction of tendon/ligament-like tissue 

15 or other tissue formation in circumstances where such tissue is not normally formed, has 
application in the healing of tendon or ligament tears, deformities and other tendon or 
ligament defects in humans and other animals. Such a preparation employing a 
tendon/ligament-like tissue inducing protein may have prophylactic use in preventing 
damage to tendon or ligament tissue, as well as use in the improved fixation of tendon or 

20 ligament to bone or other tissues, and in repairing defects to tendon or ligament tissue. De 
novo tendon/ligament-like tissue formation induced by a composition of the present 
invention contributes to the repair of congenital, trauma induced, or other tendon or ligament 
defects of other origin, and is also useful in cosmetic plastic surgery for attachment or repair 
of tendons or ligaments. The compositions of the present invention may provide 

25 environment to attract tendon- or ligament-forming cells, stimulate growth of tendon- or 

ligament-forming cells, induce differentiation of progenitors of tendon- or ligament-forming 
cells, or induce growth of tendon/ligament cells or progenitors ex vivo for return in vivo to 
effect tissue repair. The compositions of the invention may also be useful in the treatment of 
tendinitis, carpal tunnel syndrome and other tendon or ligament defects. The compositions 

30 may also include an appropriate matrix and/or sequestering agent as a carrier as is well 
known in the art. 

The compositions of the present invention may also be useful for proliferation of 
neural cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central 
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and peripheral nervous system diseases and neuropathies, as well as mechanical and 
traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve 
tissue. More specifically, a composition maybe used in the treatment of diseases of the 
peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and 
5 localized neuropathies, and central nervous system diseases, such as Alzheimer's, 

Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy~Drager 
syndrome. Further conditions which may be treated in accordance with the present invention 
include mechanical and traumatic disorders, such as spinal cord disorders, head trauma and 
cerebrovascular diseases such as stroke. Peripheral neuropathies resulting from 
1 0 chemotherapy or other medical therapies may also be treatable using a composition of the 
invention. 

Compositions of the invention may also be useful to promote better or faster closure 
of non-healing wounds, including without limitation pressure ulcers, ulcers associated with 
vascular insufficiency, surgical and traumatic wounds, and the like. 

1 5 Compositions of the present invention may also be involved in the generation or 

regeneration of other tissues, such as organs (including, for example, pancreas, liver, 
intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular 
(including vascular endothelium) tissue, or for promoting the growth of cells comprising 
such tissues. Part of the desired effects may be by inhibition or modulation of fibrotic 

20 scarring may allow normal tissue to regenerate. A polypeptide of the present invention may 

also exhibit angiogenic activity. 

A composition of the present invention may also be useful for gut protection or 

regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and 

conditions resulting from systemic cytokine damage. 
25 A composition of the present invention may also be useful for promoting or 

inhibiting differentiation of tissues described above from precursor tissues or cells; or for 

inhibiting the growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 
Assays for tissue generation activity include, without limitation, those described in: 
30 International Patent Publication No. WQ95/1 6035 (bone, cartilage, tendon); International 

Patent Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. 

WQ9 1/07491 (skin, endothelium). 
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Assays for wound healing activity include, without limitation, those described in: 
Winter, Epidermal Wound Healing, pps. 71-1 12 (Maibach, H. I. and Rovee, D. T., eds.), 
Year Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. 
Dermatol 71:382-84 (1978). 

5 

4.10.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 

A polypeptide of the present invention may also exhibit immune stimulating or 
immune suppressing activity, including without limitation the activities for which assays are 
described herein. A polynucleotide of the invention can encode a polypeptide exhibiting 

10 such activities. A protein may be useful in the treatment of various immune deficiencies and 
disorders (including severe combined immunodeficiency (SCID)), e.g., in regulating (up or 
down) growth and proliferation of T and/or B lymphocytes, as well as effecting the cytolytic 
activity of NK cells and other cell populations. These immune deficiencies may be genetic or 
be caused by viral (e.g., HIV) as well as bacterial or fungal infections, or may result from 

15 autoimmune disorders. More specifically, infectious diseases causes by viral, bacterial, 

fungal or other infection may be treatable using a protein of the present invention, including 
infections by HIV, hepatitis viruses, herpes viruses, mycobacteria, Leishmania spp., malaria 
spp. and various fungal infections such as candidiasis. Of course, in this regard, proteins of 
the present invention may also be useful where a boost to the immune system generally may 

20 be desirable, i.e., in the treatment of cancer. 

Autoimmune disorders which may be treated using a protein of the present invention 
include, for example, connective tissue disease, multiple sclerosis, systemic lupus 
erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre 
syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, 

25 graft-versus-host disease and autoimmune inflammatory eye disease. Such a protein (or 
antagonists thereof, including antibodies) of the present invention may also to be useful in 
the treatment of allergic reactions and conditions (e.g., anaphylaxis, serum sickness, drug 
reactions, food allergies, insect venom allergies, mastocytosis, allergic rhinitis, 
hypersensitivity pneumonitis, urticaria, angioedema, eczema, atopic dermatitis, allergic 

30 contact dermatitis, erythema multiforme, Stevens- Johnson syndrome, allergic conjunctivitis, 
atopic keratoconjunctivitis, venereal keratoconjunctivitis, giant papillary conjunctivitis and 
contact allergies), such as asthma (particularly allergic asthma) or other respiratory 
problems. Other conditions, in which immune suppression is desired (including, for 
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example, organ transplantation), may also be treatable using a protein (or antagonists 
thereof) of the present invention. The therapeutic effects of the polypeptides or antagonists 
thereof on allergic reactions can be evaluated by in vivo animals models such as the 
cumulative contact enhancement test (Lastbom et al., Toxicology 125: 59-66, 1998), skin 
5 prick test (Hoffmann et al., Allergy 54: 446-54, 1 999), guinea pig skin sensitization test 
(Vohr et al., Arch, Joxocol. 73: 501-9), and murine local lymph node assay (Kimber et al., 
J. Toxicol. Environ. Health 53: 563-79). 

Using the proteins of the invention it may also be possible to modulate immune 
responses, in a number of ways. Down regulation may be in the form of inhibiting or 

10 blocking an immune response already in progress or may involve preventing the induction of 
an immune response. The functions of activated T cells may be inhibited by suppressing T 
cell responses or by inducing specific tolerance in T cells, or both. Immunosuppression of T 
cell responses is generally an active, non-antigen-specific, process which requires continuous 
exposure of the T cells to the suppressive agent. Tolerance, which involves inducing 

15 non-responsiveness or anergy in T cells, is distinguishable from immunosuppression in that 
it is generally antigen-specific and persists after exposure to the tolerizing agent has ceased. 
Operationally, tolerance can be demonstrated by the lack of a T cell response upon 
reexposure to specific antigen in the absence of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 

20 limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing high 
level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin 
and organ transplantation and in graft-versus-host disease (GVHD). For example, blockage 
of T cell function should result in reduced tissue destruction in tissue transplantation. 
Typically, in tissue transplants, rejection of the transplant is initiated through its recognition 

25 as foreign by T cells, followed by an immune reaction that destroys the transplant. The 

administration of a therapeutic composition of the invention may prevent cytokine synthesis 
by immune cells, such as T cells, and thus acts as an immunosuppressant. Moreover, a lack 
of costimulation may also be sufficient to anergize the T cells, thereby inducing tolerance in 
a subject. Induction of long-term tolerance by B lymphocyte antigen-blocking reagents may 

30 avoid the necessity of repeated administration of these blocking reagents. To achieve 

sufficient immunosuppression or tolerance in a subject, it may also be necessary to block the 
function of a combination of B lymphocyte antigens. 
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The efficacy of particular therapeutic compositions in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
humans. Examples of appropriate systems which can be used include allogeneic cardiac 
grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been 
5 used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as 

described in Lenschow et al. 3 Science 257:789-792 (1992) and Turka et al., Proc. Natl. Acad. 
Sci USA, 89: 1 1 102-1 1 105 (1992). In addition, murine models of GVHD (see Paul ed., 
Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to 
determine the effect of therapeutic compositions of the invention on the development of that 
10 disease. 

Blocking antigen function may also be therapeutically useful for treating 
autoimmune diseases. Many autoimmune disorders are the result of inappropriate activation 
of T cells that are reactive against self-tissue and which promote the production of cytokines 
and autoantibodies involved in the pathology of the diseases. Preventing the activation of 

15 autoreactive T cells may reduce or eliminate disease symptoms. Administration of reagents 
which block stimulation of T cells can be used to inhibit T cell activation and prevent 
production of autoantibodies or T cell-derived cytokines which may be involved in the 
disease process. Additionally, blocking reagents may induce antigen-specific tolerance of 
autoreactive T cells which could lead to long-term relief from the disease. The efficacy of 

20 blocking reagents in preventing or alleviating autoimmune disorders can be determined 
using a number of well-characterized animal models of human autoimmune diseases. 
Examples include murine experimental autoimmune encephalitis, systemic lupus 
erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, murine autoimmune collagen 
arthritis, diabetes mellitus in NOD mice and BB rats, and murine experimental myasthenia 

25 gravis (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 
840-856). 

Upregulation of an antigen function (e.g., a B lymphocyte antigen function), as a 
means of up regulating immune responses, may also be useful in therapy. Upregulation of 
immune responses may be in the form of enhancing an existing immune response or eliciting 
30 an initial immune response. For example, enhancing an immune response may be useful in 
cases of viral infection, including systemic viral diseases such as influenza, the common 
cold, and encephalitis. 
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Alternatively, anti-viral immune responses may be enhanced in an infected patient by 
removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed 
APCs either expressing a peptide of the present invention or together with a stimulatory 
form of a soluble peptide of the present invention and reintroducing the in vitro activated T 
5 cells into the patient. Another method of enhancing anti-viral immune responses would be to 
isolate infected cells from a patient, transfect them with a nucleic acid encoding a protein of 
the present invention as described herein such that the cells express all or a portion of the 
protein on their surface, and reintroduce the transfected cells into the patient. The infected 
cells would now be capable of delivering a costimulatory signal to, and thereby activate, T 
10 cells in vivo. 

A polypeptide of the present invention may provide the necessary stimulation signal 
to T cells to induce a T cell mediated immune response against the transfected tumor cells. 
In addition, tumor cells which lack MHC class I or MHC class II molecules, or which fail to 
reexpress sufficient mounts of MHC class I or MHC class II molecules, can be transfected 

15 with nucleic acid encoding all or a portion of (e.g., a cytoplasmic-domain truncated portion) 
of an MHC class I alpha chain protein and p 2 microglobulin protein or an MHC class II 
alpha chain protein and an MHC class II beta chain protein to thereby express MHC class I 
or MHC class II proteins on the cell surface. Expression of the appropriate class I or class II 
MHC in conjunction with a peptide having the activity of a B lymphocyte antigen (e.g., 

20 B7-1 , B7-2, B7-3) induces a T cell mediated immune response against the transfected tumor 
cell. Optionally, a gene encoding an antisense construct which blocks expression of an MHC 
class II associated protein, such as the invariant chain, can also be cotransfected with a DNA 
encoding a peptide having the activity of a B lymphocyte antigen to promote presentation of 
tumor associated antigens and induce tumor specific immunity. Thus, the induction of a T 

25 cell mediated immune response in a human subject may be sufficient to overcome 
tumor-specific tolerance in the subject. 

The activity of a protein of the invention may, among other means, be measured by 

the following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, 
30 those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, 
D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 
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78:2488-2492, 1981; Herrmann et al. s J. Immunol. 128:1968-1974, 1982; Handa et al., J. 
Immunol. 135:1564-1572, 1985; Takai et al., L Immunol. 137:3494-3500, 1986; Takai et al., 
J. Immunol. 140:508-512, 1988; Bowman etal., J. Virology 61:1992-1998; Bertagnolli et 
al., Cellular Immunology 133:327-341, 1991; Brown et al., J. Immunol. 153:3079-3092, 
5 1994. 

Assays for T-cell-dependent immunoglobulin responses and isotype switching 
(which will identify, among others, proteins that modulate T-cell dependent antibody 
responses and that affect Thl/Th2 profiles) include, without limitation, those described in: 
Maliszewski, J. Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro 

10 antibody production, Mond, J. J. and Brunswick, M. In Current Protocols in Immunology. J. 
E. e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, 
proteins that generate predominantly Thl and CTL responses) include, without limitation, 
those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, 

15 D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 

Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; 
Takai et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 
1992. 

20 Dendritic cell-dependent assays (which will identify, among others, proteins 

expressed by dendritic cells that activate naive T-cells) include, without limitation, those 
described in: Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of 
Experimental Medicine 173:549-559, 1991; Macatonia et al., Journal of Immunology 
154:5071-5079, 1995; Porgador et al., Journal of Experimental Medicine 182:255-260, 

25 1995; Nair et al., Journal of Virology 67:4062-4069, 1993; Huang et al., Science 

264:961-965, 1994; Macatonia et al., Journal of Experimental Medicine 169:1255-1264, 
1989; Bhardwaj et al., Journal of Clinical Investigation 94:797-807, 1994; and Inaba et al., 
Journal of Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, 

30 proteins that prevent apoptosis after superantigen induction and proteins that regulate 

lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz et 
al., Cytometry 13:795-808, 1992; Gorcizyca et al., Leukemia 7:659-670, 1993; Gorczyca et 
al., Cancer Research 53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; Zacharchuk, 
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Journal of Immunology 145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 1993; 
Gorczyca et al., International Journal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and development 
include, without limitation, those described in: Antica et al., Blood 84: 111-1 17, 1994; Fine 
5 et al., Cellular Immunology 155: 1 1 1-122, 1994; Galy et aL, Blood 85:2770-2778, 1995; 
Toki et ah, Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 

4.10.8 ACTIVIN/INIIIBIN ACTIVITY 

A polypeptide of the present invention may also exhibit activin- or inhibin-related 
10 activities. A polynucleotide of the invention may encode a polypeptide exhibiting such 
characteristics. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating homione (FSH), while activins and are characterized by their ability to stimulate 
the release of follicle stimulating hormone (FSH). Thus, a polypeptide of the present 
invention, alone or in heterodimers with a member of the inhibin family, may be useful as a 
15 contraceptive based on the ability of inhibins to decrease fertility in female mammals and 
decrease spermatogenesis in male mammals. Administration of sufficient amounts of other 
inhibins can induce infertility in these mammals. Alternatively, the polypeptide of the 
invention, as a homodimer or as a heterodimer with other protein subunits of the inhibin 
group, may be useful as a fertility inducing therapeutic, based upon the ability of activin 

i 

20 molecules in stimulating FSH release from cells of the anterior pituitary. See, for example, 

U.S. Pat. No. 4,798,885. A polypeptide of the invention may also be useful for advancement 
of the onset of fertility in sexually immature mammals, so as to increase the lifetime 
reproductive performance of domestic animals such as, but not limited to, cows, sheep and 
pigs. 

25 The activity of a polypeptide of the invention may, among other means, be measured 

by the following methods. 

Assays for activin/inhibin activity include, without limitation, those described in: 
Vale et al., Endocrinology 91:562-572, 1972; Ling et al., Nature 321:779-782, 1986; Vale et 
al., Nature 321 :776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et aL, Proc. 
30 Natl. Acad. Sci. USA 83:3091-3095, 1986. 



4.10.9 CHEMOTACTIC/CHEMOKINETIC ACTIVITY 
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A polypeptide of the present invention may be involved in chemotactic or 
chemokinetic activity for mammalian cells, including, for example, monocytes, fibroblasts, 
neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. A 
polynucleotide of the invention can encode a polypeptide exhibiting such attributes. 
5 Chemotactic and chemokinetic receptor activation can be used to mobilize or attract a 

desired cell population to a desired site of action. Chemotactic or chemokinetic compositions 
(e.g. proteins, antibodies, binding partners, or modulators of the invention) provide particular 
advantages in treatment of wounds and other trauma to tissues, as well as in treatment of 
localized infections. For example, attraction of lymphocytes, monocytes or neutrophils to 
10 tumors or sites of infection may result in improved immune responses against the tumor or 
infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it can 
stimulate, directly or indirectly, the directed orientation or movement of such cell 
population. Preferably, the protein or peptide has the ability to directly stimulate directed 
1 5 movement of cells. Whether a particular protein has chemotactic activity for a population of 
cells can be readily determined by employing such protein or peptide in any known assay for 
cell chemotaxis. 

Therapeutic compositions of the invention can be used in the following: 

Assays for chemotactic activity (which will identify proteins that induce or prevent 

20 chemotaxis) consist of assays that measure the ability of a protein to induce the migration of 
cells across a membrane as well as the ability of a protein to induce the adhesion of one cell 
population to another cell population. Suitable assays for movement and adhesion include, 
without limitation, those described in: Current Protocols in Immunology, Ed by J. E. 
Coligan, A. M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. Strober, Pub. Greene 

25 Publishing Associates and Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta 
Chemokines 6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. 
APMIS 103:140-146, 1995; Muller et al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of 
Immunol. 152:5860-5867, 1994; Johnston et al. J. of Immunol. 153:1762-1768, 1994. 



30 4.10.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

A polypeptide of the invention may also be involved in hemostatis or thrombolysis or 
thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such 
attributes. Compositions may be useful in treatment of various coagulation disorders 
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(including hereditary disorders, such as hemophilias) or to enhance coagulation and other 
hemostatic events in treating wounds resulting from trauma, surgery or other causes. A 
composition of the invention may also be useful for dissolving or inhibiting formation of 
thromboses and for treatment and prevention of conditions resulting therefrom (such as, for 
5 example, infarction of cardiac and central nervous system vessels (e.g., stroke). 

Therapeutic compositions of the invention can be used in the following: 
Assay for hemostatic and thrombolytic activity include, without limitation, those 
described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et aL, Thrombosis 

i 

Res. 45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, 
10 Prostaglandins 35:467-474, 1988. 

4.10.11 CANCER DIAGNOSIS AND THERAPY 

Polypeptides of the invention may be involved in cancer cell generation, proliferation 
or metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the 

1 5 invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. 
For example, the presence or increased expression of a polynucleotide/polypeptide of the 
invention may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing 
malignancy. Conversely, a defect in the gene or absence of the polypeptide may be 
associated with a cancer condition. Identification of single nucleotide polymorphisms 

20 associated with cancer or a predisposition to cancer may also be useful for diagnosis or 
prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, 
inhibiting angiogenesis (growth of new blood vessels that is necessary to support tumor 
growth) and/or prohibiting metastasis by reducing tumor cell motility or invasiveness. 

25 Therapeutic compositions of the invention may be effective in adult and pediatric oncology 
including in solid phase tumors/malignancies, locally advanced tumors, human soft tissue 
sarcomas, metastatic cancer, including lymphatic metastases, blood cell malignancies 
including multiple myeloma, acute and chronic leukemias, and lymphomas, head and neck 
cancers including mouth cancer, larynx cancer and thyroid cancer, lung cancers including 

30 small cell carcinoma and non-small cell cancers, breast cancers including small cell 

carcinoma and ductal carcinoma, gastrointestinal cancers including esophageal cancer, 
stomach cancer, colon cancer, colorectal cancer and polyps associated with colorectal 
neoplasia, pancreatic cancers, liver cancer, urologic cancers including bladder cancer and 
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prostate cancer, malignancies of the female genital tract including ovarian carcinoma, uterine 
(including endometrial) cancers, and solid tumor in the ovarian follicle, kidney cancers 
including renal cell carcinoma, brain cancers including intrinsic brain tumors, 
neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the central 
5 nervous system, bone cancers including osteomas, skin cancers including malignant 

melanoma, tumor progression of human skin keratinocytes, squamous cell carcinoma, basal 
cell carcinoma, hemangiopericytoma and Karposi's sarcoma. 

Polypeptides, polynucleotides, or modulators of polypeptides of the invention 
(including inhibitors and stimulators of the biological activity of the polypeptide of the 

10 invention) may be administered to treat cancer. Therapeutic compositions can be 

administered in therapeutically effective dosages alone or in combination with adjuvant 
cancer therapy such as surgery, chemotherapy, radiotherapy, thermotherapy, and laser 
therapy, and may provide a beneficial effect, e.g. reducing tumor size, slowing rate of tumor 
growth, inhibiting metastasis, or otherwise improving overall clinical condition, without 

1 5 necessarily eradicating the cancer. 

The composition can also be administered in therapeutically effective amounts as a 
portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or 
modulator of the invention with one or more anti-cancer drugs in addition to a 
pharmaceutically acceptable carrier for delivery. The use of anti-cancer cocktails as a cancer 

20 treatment is routine. Anti-cancer drugs that are well known in the art and can be used as a 
treatment in combination with the polypeptide or modulator of the invention include: 
Actinomycin D, Aminoglutethimide, Asparaginase, Bleomycin, Busulfan, Carboplatin, 
Carmustine, Chlorambucil, Cisplatin (cis-DDP), Cyclophosphamide, Cytarabine HC1 
(Cytosine arabinoside), Dacarbazine, Dactinomycin, Daunorubicin HC1, Doxorubicin HC1, 

25 Estramustine phosphate sodium, Etoposide (VI 6-2 13), Floxuridine, 5-Fluorouracil (5-Fu), 
Flutamide, Hydroxyurea (hydroxycarbamide), Ifosfamide, Interferon Alpha-2a, Interferon 
Alpha-2b, Leuprolide acetate (LHRH-releasing factor analog), Lomustine, Mechlorethamine 
HC1 (nitrogen mustard), Melphalan, Mercaptopurine, Mesna, Methotrexate (MTX), 
Mitomycin, Mitoxantrone HC1, Octreotide, Plicamycin, Procarbazine HC1, Streptozocin, 

30 Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine sulfate, Vincristine sulfate, 

Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, Mitoguazone, Pentostatin, 
Semustine, Teniposide, and Vindesine sulfate. 

In addition, therapeutic compositions of the invention may be used for prophylactic 
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treatment of cancer. There are hereditary conditions and/or environmental situations (e.g. 
exposure to carcinogens) known in the art that predispose an individual to developing 
cancers. Under these circumstances, it may be beneficial to treat these individuals with 
therapeutically effective doses of the polypeptide of the invention to reduce the risk of 
5 developing cancers. 

In vitro models can be used to determine the effective doses of the polypeptide of the 
invention as a potential cancer treatment. These in vitro models include proliferation assays 
of cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1987) 
Culture of Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, NY Ch 18 

10 and Ch 21), tumor systems in nude mice as described in Giovanella et al., J. Natl. Can. Inst., 
52: 921-30 (1974), mobility and invasive potential of tumor cells in Boyden Chamber assays 
as described in Pilkington et al., Anticancer Res., 17: 4107-9 (1997), and angiogenesis 
assays such as induction of vascularization of the chick chorioallantoic membrane or 
induction of vascular endothelial cell migration as described in Ribatta et al., Intl. J. Dev. 

15 Biol., 40: 1 189-97 (1999) and Li et al., Clin. Exp. Metastasis, 17:423-9 (1999) s respectively. 
Suitable tumor cells lines are available, e.g. from American Type Tissue Culture Collection 
catalogs. 

4.10.12 RECEPTOR/LIGAND ACTIVITY 

20 A polypeptide of the present invention may also demonstrate activity as receptor, 

receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of 
the invention can encode a polypeptide exhibiting such characteristics. Examples of such 
receptors and ligands include, without limitation, cytokine receptors and their ligands, 
receptor kinases and their ligands, receptor phosphatases and their ligands, receptors 

25 involved in cell-cell interactions and their ligands (including without limitation, cellular 

adhesion molecules (such as selectins, integrins and their ligands) and receptor/ligand pairs 
involved in antigen presentation, antigen recognition and development of cellular and 
humoral immune responses. Receptors and ligands are also useful for screening of potential 
peptide or small molecule inhibitors of the relevant receptor/ligand interaction. A protein of 

30 the present invention (including, without limitation, fragments of receptors and ligands) may 
themselves be useful as inhibitors of receptor/ligand interactions. 

The activity of a polypeptide of the invention may, among other means, be measured 
by the following methods: 
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Suitable assays for receptor-ligand activity include without limitation those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. 
Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- 
Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static conditions 
5 7.28. 1- 7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; Bierer et al., 
J. Exp. Med. 168:1145-1156, 1988; Rosenstein et al., J. Exp. Med. 169:149-160 1989; 
Stoltenborg et al., J. Immunol. Methods 175:59-68, 1994; Stitt et al., Cell 80:661-670, 1995. 

By way of example, the polypeptides of the invention may be used as a receptor for a 
ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may be 

10 identified through binding assays, affinity chromatography, dihybrid screening assays, 
BIAcore assays, gel overlay assays, or other methods known in the art. 

Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or 
a partial antagonist require the use of other proteins as competing ligands. The polypeptides 
of the present invention or ligand(s) thereof may be labeled by being coupled to 

15 radioisotopes, colorimetric molecules or a toxin molecules by conventional methods. 

("Guide to Protein Purification" Murray P. Deutscher (ed) Methods in Enzymology Vol. 182 
(1990) Academic Press, Inc. San Diego). Examples of radioisotopes include, but are not 
limited to, tritium and carbon- 14 . Examples of colorimetric molecules include, but are not 
limited to, fluorescent molecules such as fluorescamine, or rhodamine or other colorimetric 

20 molecules. Examples of toxins include, but are not limited, to ricin. 

4.10.13 DRUG SCREENING 

This invention is particularly useful for screening chemical compounds by using the 
novel polypeptides or binding fragments thereof in any of a variety of drug screening 

25 techniques. The polypeptides or fragments employed in such a test may either be free in 
solution, affixed to a solid support, borne on a cell surface or located intracellularly. One 
method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids expressing the polypeptide or a fragment 
thereof. Drugs are screened against such transformed cells in competitive binding assays. 

30 Such cells, either in viable or fixed form, can be used for standard binding assays. One may 
measure, for example, the formation of complexes between polypeptides of the invention or 
fragments and the agent being tested or examine the diminution in complex formation 
between the novel polypeptides and an appropriate cell line, which are well known in the art. 
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Sources for test compounds that may be screened for ability to bind to or modulate 
(i.e., increase or decrease) the activity of polypeptides of the invention include (1) inorganic 
and organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of either random or mimetic peptides, oligonucleotides or organic molecules. 
5 Chemical libraries may be readily synthesized or purchased from a number of 

commercial sources, and may include structural analogs of known compounds or compounds 
that are identified as "hits" or "leads" via natural product screening. 

The sources of natural product libraries are microorganisms (including bacteria and 
fungi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for 

1 0 screening may be created by: (1) fermentation and extraction of broths from soil, plant or 
marine microorganisms or (2) extraction of the organisms themselves. Natural product 
libraries include polyketides, non-ribosomal peptides, and (non-naturally occurring) variants 
thereof. For a review, see Science 252:63-68 (1998). 

Combinatorial libraries are composed of large numbers of peptides, oligonucleotides 

15 or organic compounds and can be readily prepared by traditional automated synthesis 

methods, PGR, cloning or proprietary synthetic methods. Of particular interest are peptide 
and oligonucleotide combinatorial libraries. Still other libraries of interest include peptide, 
protein, peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide 
libraries. For a review of combinatorial chemistry and libraries created therefrom, see 

20 Myers, Curr. Opiru Biotechnol 8:701-707 (1997). For reviews and examples of 

peptidomimetic libraries, see Al-Obeidi et al, Mol Biotechnol, 9(3):205-23 (1998); Hruby 
et al., Curr Opin Chem Biol, 1(1): 1 14-19 (1997); Domer et al., BioorgMed Chem, 
4(5):709-15 (1996) (alkylated dipeptides). 

Identification of modulators through use of the various libraries described herein 

25 permits modification of the candidate "hit" (or "lead") to optimize the capacity of the "hit" 
to bind a polypeptide of the invention. The molecules identified in the binding assay are then 
tested for antagonist or agonist activity in in vivo tissue culture or animal models that are 
well known in the art. In brief, the molecules are titrated into a plurality of cell cultures or 
animals and then tested for either cell/animal death or prolonged survival of the animal/cells. 

* 

30 The binding molecules thus identified may be complexed with toxins, e.g., ricin or 

cholera, or with other compounds that are toxic to cells such as radioisotopes. The 
toxin-binding molecule complex is then targeted to a tumor or other cell by the specificity of 
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the binding molecule for a polypeptide of the invention. Alternatively, the binding 
molecules may be complexed with imaging agents for targeting and imaging purposes. 

4.10.14 ASSAY FOR RECEPTOR ACTIVITY 

5 The invention also provides methods to detect specific binding of a polypeptide e.g. a 

ligand or a receptor. The art provides numerous assays particularly useful for identifying 
previously unknown binding partners for receptor polypeptides of the invention. For 
example, expression cloning using mammalian or bacterial cells, or dihybrid screening 
assays can be used to identify polynucleotides encoding binding partners. As another 

1 0 example, affinity chromatography with the appropriate immobilized polypeptide of the 
invention can be used to isolate polypeptides that recognize and bind polypeptides of the 
invention. There are a number of different libraries used for the identification of 
compounds, and in particular small molecules, that modulate (i.e., increase or decrease) 
biological activity of a polypeptide of the invention. Ligands for receptor polypeptides of the 

1 5 invention can also be identified by adding exogenous ligands, or cocktails of ligands to two 
cells populations that are genetically identical except for the expression of the receptor of the 
invention: one cell population expresses the receptor of the invention whereas the other does 
not. The responses of the two cell populations to the addition of ligands(s) are then 
compared. Alternatively, an expression library can be co-expressed with the polypeptide of 

20 the invention in cells and assayed for an autocrine response to identify potential ligand(s). As 
still another example, BIAcore assays, gel overlay assays, or other methods known in the art 
can be used to identify binding partner polypeptides, including, (1) organic and inorganic 
chemical libraries, (2) natural product libraries, and (3) combinatorial libraries comprised of 
random peptides, oligonucleotides or organic molecules. 

25 The role of downstream intracellular signaling molecules in the signaling cascade of 

the polypeptide of the invention can be determined. For example, a chimeric protein in 
which the cytoplasmic domain of the polypeptide of the invention is fused to the 
extracellular portion of a protein, whose ligand has been identified, is produced in a host 
cell. The cell is then incubated with the ligand specific for the extracellular portion of the 

30 chimeric protein, thereby activating the chimeric receptor. Known downstream proteins 
involved in intracellular signaling can then be assayed for expected modifications i.e. 
phosphorylation. Other methods known to those in the art can also be used to identify 
signaling molecules involved in receptor activity. 
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4.10.15 ANTIINFLAMMATORY ACTIVITY 

Compositions of the present invention may also exhibit anti-inflammatory activity. 
The anti-inflammatory activity may be achieved by providing a stimulus to cells involved in 
5 the inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for 
example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the 
inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or 
suppressing production of other factors which more directly inhibit or promote an 
inflammatory response. Compositions with such activities can be used to treat inflammatory 

10 conditions including chronic or acute conditions), including without limitation intimation 
associated with infection (such as septic shock, sepsis or systemic inflammatory response 
syndrome (SIRS)), ischemia-reperfusion injury, endotoxin lethality, arthritis, 
complement-mediated hyperacute rejection, nephritis, cytokine or chemokine-induced lung 
injury, inflammatory bowel disease, Crohn's disease or resulting from over production of 

1 5 cytokines such as TNF or IL-1 . Compositions of the invention may also be useful to treat 

anaphylaxis and hypersensitivity to an antigenic substance or material. Compositions of this 
invention may be utilized to prevent or treat conditions such as, but not limited to, sepsis, 
acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid arthritis, chronic 
inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1 , graft versus 

20 host disease, inflammatory bowel disease, inflamation associated with pulmonary disease, 
other autoimmune disease or inflammatory disease, an antiproliferative agent such as for 
acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to 
intrauterine infections. 

25 4.10.16 LEUKEMIAS 

Leukemias and related disorders may be treated or prevented by administration of a 
therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of 
the invention. Such leukemias and related disorders include but are not limited to acute 
leukemia, acute lymphocytic leukemia, acute myelocytic leukemia, myeloblasts, 
30 promyelocyte, myelomonocytic, monocytic, erythroleukemia, chronic leukemia, chronic 

myelocytic (granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such 
disorders, see Fishman et al., 1985, Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia). 
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4.10.17 NERVOUS SYSTEM DISORDERS 

Nervous system disorders, involving cell types which can be tested for efficacy of 
intervention with compounds that modulate the activity of the polynucleotides and/or 
polypeptides of the invention, and which can be treated upon thus observing an indication of 
5 therapeutic utility, include but are not limited to nervous system injuries, and diseases or 
disorders which result in either a disconnection of axons, a diminution or degeneration of 
neurons, or demyelination. Nervous system lesions which may be treated in a patient 
(including human and non-human mammalian patients) according to the invention include 
but are not limited to the following lesions of either the central (including spinal cord, brain) 
10 or peripheral nervous systems : 

(i) traumatic lesions, including lesions caused by physical injury or associated 
with surgery, for example, lesions which sever a portion of the nervous system, or 
compression injuries; 

(ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous system 
15 results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord 

infarction or ischemia; 

(iii) infectious lesions, in which a portion of the nervous system is destroyed or 
injured as a result of infection, for example, by an abscess or associated with infection by 
human immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme 

20 disease, tuberculosis, syphilis; 

(iv) degenerative lesions, in which a portion of the nervous system is destroyed or 
injured as a result of a degenerative process including but not limited to degeneration 
associated with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or 
amyotrophic lateral sclerosis; 

25 (v) lesions associated with nutritional diseases or disorders, in which a portion of 

the nervous system is destroyed or injured by a nutritional disorder or disorder of 
metabolism including but not limited to, vitamin B12 deficiency, folic acid deficiency, 
Wernicke disease, tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary 
degeneration of the corpus callosum), and alcoholic cerebellar degeneration; 

30 (vi) neurological lesions associated with systemic diseases including but not 

limited to diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, 
carcinoma, or sarcoidosis; 
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(vii) lesions caused by toxic substances including alcohol, lead, or particular 
neurotoxins; and 

(viii) demyelinated lesions in which a portion of the nervous system is destroyed or 
injured by a demyelinating disease including but not limited to multiple sclerosis, human 

5 immunodeficiency virus-associated myelopathy, transverse myelopathy or various 

etiologies, progressive multifocal leukoencephalopathy, and central pontine myelinolysis. 

Therapeutics which are useful according to the invention for treatment of a nervous 
system disorder may be selected by testing for biological activity in promoting the survival 
or differentiation of neurons. For example, and not by way of limitation, therapeutics which 
1 0 elicit any of the following effects may be useful according to the invention: 

(i) increased survival time of neurons in culture; 

(ii) increased sprouting of neurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in vivo, 
e.g., choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or 

1 5 (iv) decreased symptoms of neuron dysfunction in vivo. 

Such effects may be measured by any method known in the art. In preferred, 
non-limiting embodiments, increased survival of neurons may be measured by the method 
set forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons 
may be detected by methods set forth in Pestronk et al. (1980, Exp. Neurol. 70:65-82) or 

20 Brown et al. (1981, Ann. Rev. Neurosci. 4: 17-42); increased production of 

neuron-associated molecules may be measured by bioassay, enzymatic assay, antibody 
binding, Northern blot assay, etc., depending on the molecule to be measured; and motor 
neuron dysfunction may be measured by assessing the physical manifestation of motor 
neuron disorder, e.g., weakness, motor neuron conduction velocity, or functional disability. 

25 In specific embodiments, motor neuron disorders that may be treated according to the 

invention include but are not limited to disorders such as infarction, infection, exposure to 
toxin, trauma, surgical damage, degenerative disease or malignancy that may affect motor 
neurons as well as other components of the nervous system, as well as disorders that 
selectively affect neurons such as amyotrophic lateral sclerosis, and including but not limited 

30 to progressive spinal muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, 
infantile and juvenile muscular atrophy, progressive bulbar paralysis of childhood (Fazio- 
Londe syndrome), poliomyelitis and the post polio syndrome, and Hereditary Motorsensory 
Neuropathy (Charcot-Marie-Tooth Disease). 
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4.10.18 OTHER ACTIVITIES 

A polypeptide of the invention may also exhibit one or more of the following 
additional activities or effects: inhibiting the growth, infection or function of, or killing, 
5 infectious agents, including, without limitation, bacteria, viruses, fungi and other parasites; 
effecting (suppressing or enhancing) bodily characteristics, including, without limitation, 
height, weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or 
organ or body part size or shape (such as, for example, breast augmentation or diminution, 
change in bone form or shape); effecting biorhyfhms or circadian cycles or rhythms; 

10 effecting the fertility of male or female subjects; effecting the metabolism, catabolism, 
anabolism, processing, utilization, storage or elimination of dietary fat, lipid, protein, 
carbohydrate, vitamins, minerals, co-factors or other nutritional factors or component(s); 
effecting behavioral characteristics, including, without limitation, appetite, libido, stress, 
cognition (including cognitive disorders), depression (including depressive disorders) and 

1 5 violent behaviors; providing analgesic effects or other pain reducing effects; promoting 
differentiation and growth of embryonic stem cells in lineages other than hematopoietic 
lineages; hormonal or endocrine activity; in the case of enzymes, correcting deficiencies of 
the enzyme and treating deficiency-related diseases; treatment of hyperproliferative 
disorders (such as, for example, psoriasis); immunoglobulin-like activity (such as, for 

20 example, the ability to bind antigens or complement); and the ability to act as an antigen in a 
vaccine composition to raise an immune response against such protein or another material or 
entity which is cross-reactive with such protein. 

4.10.19 IDENTIFICATION OF POLYMORPHISMS 

25 The demonstration of polymorphisms makes possible the identification of such 

polymorphisms in human subjects and the pharmacogenetic use of this information for 
diagnosis and treatment. Such polymorphisms may be associated with, e.g., differential 
predisposition or susceptibility to various disease states (such as disorders involving 
inflammation or immune response) or a differential response to drug administration, and this 

30 genetic information can be used to tailor preventive or therapeutic treatment appropriately. 
For example, the existence of a polymorphism associated with a predisposition to 
inflammation or autoimmune disease makes possible the diagnosis of this condition in 
humans by identifying the presence of the polymorphism. 
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Polymorphisms can be identified in a variety of ways known in the art which all 
generally involve obtaining a sample from a patient, analyzing DNA from the sample, 
optionally involving isolation or amplification of the DNA, and identifying the presence of 
the polymorphism in the DNA. For example, PGR may be used to amplify an appropriate 
5 fragment of genomic DNA which may then be sequenced. Alternatively, the DNA may be 
subjected to allele-specific oligonucleotide hybridization (in which appropriate 
oligonucleotides are hybridized to the DNA under conditions permitting detection of a single 
base mismatch) or to a single nucleotide extension assay (in which an oligonucleotide that 
hybridizes immediately adjacent to the position of the polymorphism is extended with one or 

1 0 more labeled nucleotides). In addition, traditional restriction fragment length polymorphism 
analysis (using restriction enzymes that provide differential digestion of the genomic DNA 
depending on the presence or absence of the polymorphism) may be performed. Arrays with 
nucleotide sequences of the present invention can be used to detect polymorphisms. The 
array can comprise modified nucleotide sequences of the present invention in order to detect 

15 the nucleotide sequences of the present invention. In the alternative, any one of the 

nucleotide sequences of the present invention can be placed on the array to detect changes 
from those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence could 
also be detected by detecting a corresponding change in amino acid sequence of the protein, 

20 e.g., by an antibody specific to the variant sequence. 

4.10.20 ARTHRITIS AND INFLAMMATION 

The immunosuppressive effects of the compositions of the invention against 
rheumatoid arthritis is determined in an experimental animal model system. The 

25 experimental model system is adjuvant induced arthritis in rats, and the protocol is described 
by J. Holoshitz, et at, 1983, Science, 219:56, or by B. Waksman et al., 1963, Int. Arch. 
Allergy Appl. Immunol., 23 : 129. Induction of the disease can be caused by a single 
injection, generally intradermally, of a suspension of killed Mycobacterium tuberculosis in 
complete Freund's adjuvant (CFA). The route of injection can vary, but rats may be injected 

30 at the base of the tail with an adjuvant mixture. The polypeptide is administered in phosphate 
buffered solution (PBS) at a dose of about 1-5 mg/kg. The control consists of administering 
PBS only. 
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The procedure for testing the effects of the test compound would consist of 
intradermal^ injecting killed Mycobacterium tuberculosis in CFA followed by immediately 
administering the test compound and subsequent treatment every other day until day 24. At 
14, 15, 18, 20, 22, and 24 days after injection of Mycobacterium CFA, an overall arthritis 
5 score may be obtained as described by J. Holoskitz above. An analysis of the data would 
reveal that the test compound would have a dramatic affect on the swelling of the joints as 
measured by a decrease of the arthritis score. 

4.11 THERAPEUTIC METHODS 

10 The compositions (including polypeptide fragments, analogs, variants and antibodies 

or other binding partners or modulators including antisense polynucleotides) of the invention 
have numerous applications in a variety of therapeutic methods. Examples of therapeutic 
applications include, but are not limited to, those exemplified herein. 

15 4.11.1 EXAMPLE 

One embodiment of the invention is the administration of an effective amount of the 
polypeptides or other composition of the invention to individuals affected by a disease or 
disorder that can be modulated by regulating the peptides of the invention. While the mode 
of administration is not particularly important, parenteral administration is preferred. An 

20 exemplary mode of administration is to deliver an intravenous bolus. The dosage of the 
polypeptides or other composition of the invention will normally be determined by the 
prescribing physician. It is to be expected that the dosage will vary according to the age, 
weight, condition and response of the individual patient. Typically, the amount of 
polypeptide administered per dose will be in the range of about 0.01 |ng/kg to 100 mg/kg of 

25 body weight, with the preferred dose being about 0. 1 |Ug/kg to 1 0 mg/kg of patient body 

weight. For parenteral administration, polypeptides of the invention will be formulated in an 
injectable form combined with a pharmaceutically acceptable parenteral vehicle. Such 
vehicles are well known in the art and examples include water, saline, Ringer's solution, 
dextrose solution, and solutions consisting of small amounts of the human serum albumin. 

30 The vehicle may contain minor amounts of additives that maintain the isotonicity and 

stability of the polypeptide or other active ingredient. The preparation of such solutions is 
within the skill of the art. 
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4.12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 

ADMINISTRATION 

A protein or other composition of the present invention (from whatever source 
derived, including without limitation from recombinant and non-recombinant sources and 
5 including antibodies and other binding partners of the polypeptides of the invention) may be 
administered to a patient in need, by itself, or in pharmaceutical compositions where it is 
mixed with suitable carriers or excipient(s) at doses to treat or ameliorate a variety of 
disorders. Such a composition may optionally contain (in addition to protein or other active 
ingredient and a carrier) diluents, fillers, salts, buffers, stabilizers, solubilizers, and other 

10 materials well known in the art. The term "pharmaceutically acceptable" means a non-toxic 
material that does not interfere with the effectiveness of the biological activity of the active 
ingredient(s). The characteristics of the carrier will depend on the route of administration. 
The pharmaceutical composition of the invention may also contain cytokines, lymphokines, 
or other hematopoietic factors such as M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, 

15 IL-6, IL-7, IL-8, IL-9, IL-10, IL-1 1, IL-12, IL-13, IL-14, IL-15, IFN, TNFO, TNF1, TNF2, 
G-CSF, Meg-CSF, thrombopoietin, stem cell factor, and erythropoietin. In further 
compositions, proteins of the invention may be combined with other agents beneficial to the 
treatment of the disease or disorder in question. These agents include various growth factors 
such as epidermal growth factor (EGF), platelet-derived growth factor (PDGF), transforming 

20 growth factors (TGF-a and TGF-P), insulin-like growth factor (IGF), as well as cytokines 
described herein. 

The pharmaceutical composition may further contain other agents which either 
enhance the activity of the protein or other active ingredient or complement its activity or 
use in treatment Such additional factors and/or agents may be included in the 

25 pharmaceutical composition to produce a synergistic effect with protein or other active 

ingredient of the invention, or to minimize side effects. Conversely, protein or other active 
ingredient of the present invention may be included in formulations of the particular clotting 
factor, cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic 
factor, or anti- inflammatory agent to minimize side effects of the clotting factor, cytokine, 

30 lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or 

anti-inflammatory agent (such as IL-IRa, IL-1 Hyl, IL-1 Hy2, anti-TNF, corticosteroids, 
immunosuppressive agents). A protein of the present invention may be active in multimers 
(e.g., heterodimers or homodimers) or complexes with itself or other proteins. As a result, 
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pharmaceutical compositions of the invention may comprise a protein of the invention in 
such multimeric or complexed form. 

As an alternative to being included in a pharmaceutical composition of the invention 
including a first protein, a second protein or a therapeutic agent may be concurrently 
5 administered with the first protein (e.g., at the same time, or at differing times provided that 
therapeutic concentrations of the combination of agents is achieved at the treatment site). 
Techniques for formulation and administration of the compounds of the instant application 
may be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA 5 
latest edition. A therapeutically effective dose further refers to that amount of the compound 

10 sufficient to result in amelioration of symptoms, e.g., treatment, healing, prevention or 

amelioration of the relevant medical condition, or an increase in rate of treatment, healing, 
prevention or amelioration of such conditions. When applied to an individual active 1 
ingredient, administered alone, a therapeutically effective dose refers to that ingredient 
alone. When applied to a combination, a therapeutically effective dose refers to combined 

15 amounts of the active ingredients that result in the therapeutic effect, whether administered 
in combination, serially or simultaneously. 

In practicing the method of treatment or use of the present invention, a 
therapeutically effective amount of protein or other active ingredient of the present invention 
is administered to a mammal having a condition to be treated. Protein or other active 

20 ingredient of the present invention may be administered in accordance with the method of 
the invention either alone or in combination with other therapies such as treatments 
employing cytokines, lymphokines or other hematopoietic factors. When co- administered 
with one or more cytokines, lymphokines or other hematopoietic factors, protein or other 
active ingredient of the present invention may be administered either simultaneously with 

25 the cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolytic or 

anti-thrombotic factors, or sequentially. If administered sequentially, the attending physician 
will decide on the appropriate sequence of administering protein or other active ingredient of 
the present invention in combination with cytokine(s), lymphokine(s), other hematopoietic 
factor(s), thrombolytic or anti-thrombotic factors. 

30 

4.12.1 ROUTES OF ADMINISTRATION 

Suitable routes of administration may, for example, include oral, rectal, 
transmucosal, or intestinal administration; parenteral delivery, including intramuscular, 
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subcutaneous, intramedullary injections, as well as intrathecal, direct intraventricular, 
intravenous, intraperitoneal, intranasal, or intraocular injections. Administration of protein 
or other active ingredient of the present invention used in the pharmaceutical composition or 
to practice the method of the present invention can be carried out in a variety of conventional 
5 ways, such as oral ingestion, inhalation, topical application or cutaneous, subcutaneous, 

intraperitoneal, parenteral or intravenous injection. Intravenous administration to the patient 
is preferred. 

Alternately, one may administer the compound in a local rather than systemic 
manner, for example, via injection of the compound directly into a arthritic joints or in 

10 fibrotic tissue, often in a depot or sustained release formulation. In order to prevent the 

scarring process frequently occurring as complication of glaucoma surgery, the compounds 
may be administered topically, for example, as eye drops. Furthermore, one may administer 
the drug in a targeted drug delivery system, for example, in a liposome coated with a specific 
antibody, targeting, for example, arthritic or fibrotic tissue. The liposomes will be targeted 

15 to and taken up selectively by the afflicted tissue. 

The polypeptides of the invention are administered by any route that delivers an 
effective dosage to the desired site of action. The determination of a suitable route of 
administration and an effective dosage for a particular indication is within the level of skill 
in the art. Preferably for wound treatment, one administers the therapeutic compound 

20 directly to the site. Suitable dosage ranges for the polypeptides of the invention can be 
extrapolated from these dosages or from similar studies in appropriate animal models. 
Dosages can then be adjusted as necessary by the clinician to provide maximal therapeutic 
benefit. 

25 4.12.2 COMPOSITIONS/FORMULATIONS 

9 

Pharmaceutical compositions for use in accordance with the present invention thus 
may be formulated in a conventional manner using one or more physiologically acceptable 
carriers comprising excipients and auxiliaries which facilitate processing of the active 
compounds into preparations which can be used pharmaceutically. These pharmaceutical 
30 compositions may be manufactured in a manner that is itself known, e.g. , by means of 
conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, 
encapsulating, entrapping or lyophilizing processes. Proper formulation is dependent upon 
the route of administration chosen. When a therapeutically effective amount of protein or 
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other active ingredient of the present invention is administered orally, protein or other active 
ingredient of the present invention will be in the form of a tablet, capsule, powder, solution 
or elixir. When administered in tablet form, the pharmaceutical composition of the invention 
may additionally contain a solid carrier such as a gelatin or an adjuvant. The tablet, capsule, 
5 and powder contain from about 5 to 95% protein or other active ingredient of the present 
invention, and preferably from about 25 to 90% protein or other active ingredient of the 
present invention. When administered in liquid form, a liquid carrier such as water, 
petroleum, oils of animal or plant origin such as peanut oil, mineral oil, soybean oil, or 
sesame oil, or synthetic oils may be added. The liquid form of the pharmaceutical 

10 composition may further contain physiological saline solution, dextrose or other saccharide 
solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. When 
administered in liquid form, the pharmaceutical composition contains from about 0.5 to 90% 
by weight of protein or other active ingredient of the present invention, and preferably from 
about 1 to 50% protein or other active ingredient of the present invention. 

1 5 When a therapeutically effective amount of protein or other active ingredient of the 

present invention is administered by intravenous, cutaneous or subcutaneous injection, 
protein or other active ingredient of the present invention will be in the form of a 
pyrogen-free, parenterally acceptable aqueous solution. The preparation of such parenterally 
acceptable protein or other active ingredient solutions, having due regard to pH, isotonicity, 

20 stability, and the like, is within the skill in the art. A preferred pharmaceutical composition 
for intravenous, cutaneous, or subcutaneous injection should contain, in addition to protein 
or other active ingredient of the present invention, an isotonic vehicle such as Sodium 
Chloride Injection, Ringer's Injection, Dextrose Injection, Dextrose and Sodium Chloride 
Injection, Lactated Ringer's Injection, or other vehicle as known in the art. The 

i 

25 pharmaceutical composition of the present invention may also contain stabilizers, 

preservatives, buffers, antioxidants, or other additives known to those of skill in the art. For 
injection, the agents of the invention may be formulated in aqueous solutions, preferably in 
physiologically compatible buffers such as Hanks's solution, Ringer's solution, or 
physiological saline buffer. For transmucosal administration, penetrants appropriate to the 

30 barrier to be permeated are used in the formulation. Such penetrants are generally known in 
the art. 

For oral administration, the compounds can be formulated readily by combining the 
active compounds with pharmaceutical^ acceptable carriers well known in the art. Such 
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carriers enable the compounds of the invention to be formulated as tablets, pills, dragees, 
capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a 
patient to be treated. Pharmaceutical preparations for oral use can be obtained from a solid 
excipient, optionally grinding a resulting mixture, and processing the mixture of granules, 
5 after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable 

excipients are, in particular, fillers such as sugars, including lactose, sucrose, mannitol, or 
sorbitol; cellulose preparations such as, for example, maize starch, wheat starch, rice starch, 
potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, 
sodium carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, 

10 disintegrating agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or 
alginic acid or a salt thereof such as sodium alginate. Dragee cores are provided with 
suitable coatings. For this purpose, concentrated sugar solutions may be used, which may 
optionally contain gum arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, 
and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. 

15 Dyestuffs or pigments may be added to the tablets or dragee coatings for identification or to 
characterize different combinations of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules made 
of gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol 
or sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler 

20 such as lactose, binders such as starches, and/or lubricants such as talc or magnesium 

stearate and, optionally, stabilizers. In soft capsules, the active compounds may be dissolved 
or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene 
glycols. In addition, stabilizers may be added. All formulations for oral administration 
should be in dosages suitable for such administration. For buccal administration, the 

25 compositions may take the form of tablets or lozenges formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 
invention are conveniently delivered in the form of an aerosol spray presentation from 
pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., 
dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide 

30 or other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined 
by providing a valve to deliver a metered amount. Capsules and cartridges of, e.g., gelatin 
for use in an inhaler or insufflator may be formulated containing a powder mix of the 
compound and a suitable powder base such as lactose or starch. The compounds may be 



WO 03/054152 PCT/US02/39555 

78 

formulated for parenteral administration by injection, e.g., by bolus injection or continuous 
infusion. Formulations for injection may be presented in unit dosage form, e.g., in ampules 
or in multi-dose containers, with an added preservative. The compositions may take such 
forms as suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain 
5 formulatory agents such as suspending, stabilizing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions 
of the active compounds in water-soluble form. Additionally, suspensions of the active 
compounds may be prepared as appropriate oily injection suspensions. Suitable lipophilic 
solvents or vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such 

10 as ethyl oleate or triglycerides, or liposomes. Aqueous injection suspensions may contain 
substances which increase the viscosity of the suspension, such as sodium carboxymethyl 
cellulose, sorbitol, or dextran. Optionally, the suspension may also contain suitable 
stabilizers or agents which increase the solubility of the compounds to allow for the 
preparation of highly concentrated solutions. Alternatively, the active ingredient may be in 

1 5 powder form for constitution with a suitable vehicle, e.g. , sterile pyrogen-free water, before 
use. 

The compounds may also be formulated in rectal compositions such as suppositories 
or retention enemas, e.g., containing conventional suppository bases such as cocoa butter or 
other glycerides. In addition to the formulations described previously, the compounds may 

20 also be formulated as a depot preparation. Such long acting formulations may be 

administered by implantation (for example subcutaneously or intramuscularly) or by 
intramuscular injection. Thus, for example, the compounds may be formulated with suitable 
polymeric or hydrophobic materials (for example as an emulsion in an acceptable oil) or ion 
exchange resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt. 

25 A pharmaceutical carrier for the hydrophobic compounds of the invention is a co- 

solvent system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic 
polymer, and an aqueous phase. The co-solvent system may be the VPD co-solvent system. 
VPD is a solution of 3% w/v benzyl alcohol, 8% w/v of the nonpolar surfactant polysorbate 
80, and 65% w/v polyethylene glycol 300, made up to volume in absolute ethanol. The VPD 

30 co-solvent system (VPD:5W) consists of VPD diluted 1:1 with a 5% dextrose in water 

solution. This co-solvent system dissolves hydrophobic compounds well, and itself produces 
low toxicity upon systemic administration. Naturally, the proportions of a co-solvent system 
may be varied considerably without destroying its solubility and toxicity characteristics. 
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Furthermore, the identity of the co-solvent components may be varied: for example, other 
low-toxicity nonpolar surfactants may be used instead of polysorbate 80; the fraction size of 
polyethylene glycol may be varied; other biocompatible polymers may replace polyethylene 
glycol, e.g. polyvinyl pyrrolidone; and other sugars or polysaccharides may substitute for 
5 dextrose. Alternatively, other delivery systems for hydrophobic pharmaceutical compounds 
may be employed. Liposomes and emulsions are well known examples of delivery vehicles 
or carriers for hydrophobic drugs. Certain organic solvents such as dimethylsulfoxide also 
may be employed, although usually at the cost of greater toxicity. Additionally, the 
compounds may be delivered using a sustained-release system, such as semipermeable 

10 matrices of solid hydrophobic polymers containing the therapeutic agent. Various types of 
sustained-release materials have been established and are well known by those skilled in the 
art. Sustained-release capsules may, depending on their chemical nature, release the 
compounds for a few weeks up to over 100 days. Depending on the chemical nature and the 
biological stability of the therapeutic reagent, additional strategies for protein or other active 

1 5 ingredient stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase 
carriers or excipients. Examples of such carriers or excipients include but are not limited to 
calcium carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, 
gelatin, and polymers such as polyethylene glycols. Many of the active ingredients of the 

20 invention may be provided as salts with pharmaceutically compatible counter ions. Such 
pharmaceutically acceptable base addition salts are those salts which retain the biological 
effectiveness and properties of the free acids and which are obtained by reaction with 
inorganic or organic bases such as sodium hydroxide, magnesium hydroxide, ammonia, 
trialkylamine, dialkylamine, mono alky lamine , dibasic amino acids, sodium acetate, 

25 potassium benzoate, triethanol amine and the like. 

The pharmaceutical composition of the invention may be in the form of a complex of 
the protein(s) or other active ingredient(s) of present invention along with protein or peptide 
antigens. The protein and/or peptide antigen will deliver a stimulatory signal to both B and T 
lymphocytes. B lymphocytes will respond to antigen through their surface immunoglobulin 

30 receptor. T lymphocytes will respond to antigen through the T cell receptor (TCR) 

following presentation of the antigen by MHC proteins. MHC and structurally related 
proteins including those encoded by class I and class II MHC genes on host cells will serve 
to present the peptide antigen(s) to T lymphocytes. The antigen components could also be 
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supplied as purified MHC-peptide complexes alone or with co-stimulatory molecules that 
can directly signal T cells. Alternatively antibodies able to bind surface immunoglobulin 
and other molecules on B cells as well as antibodies able to bind the TCR and other 
molecules on T cells can be combined with the pharmaceutical composition of the invention. 
5 The pharmaceutical composition of the invention may be in the form of a liposome in 

which protein of the present invention is combined, in addition to other pharmaceutically 
acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as 
micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. 
Suitable lipids for liposomal formulation include, without limitation, monoglycerides, 

10 diglycerides, sulfatides, lysolecithins, phospholipids, saponin, bile acids, and the like. 

Preparation of such liposomal formulations is within the level of skill in the art, as disclosed, 
for example, in U.S. Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 4,737,323, all of 
which are incorporated herein by reference. 

The amount of protein or other active ingredient of the present invention in the 

15 pharmaceutical composition of the present invention will depend upon the nature and 
severity of the condition being treated, and on the nature of prior treatments which the 
patient has undergone. Ultimately, the attending physician will decide the amount of protein 
or other active ingredient of the present invention with which to treat each individual patient. 
Initially, the attending physician will administer low doses of protein or other active 

20 ingredient of the present invention and observe the patient's response. Larger doses of 
protein or other active ingredient of the present invention may be administered until the 
optimal therapeutic effect is obtained for the patient, and at that point the dosage is not 
increased further. It is contemplated that the various pharmaceutical compositions used to 
practice the method of the present invention should contain about 0.01 \ig to about 100 mg 

25 (preferably about 0.1 jug to about 10 mg, more preferably about 0.1 \xg to about 1 mg) of 
protein or other active ingredient of the present invention per kg body weight. For 
compositions of the present invention which are useful for bone, cartilage, tendon or 
ligament regeneration, the therapeutic method includes administering the composition 
topically, systematically, or locally as an implant or device. When administered, the 

30 therapeutic composition for use in this invention is, of course, in a pyrogen-ftee, 

physiologically acceptable form. Further, the composition may desirably be encapsulated or 
injected in a viscous form for delivery to the site of bone, cartilage or tissue damage. 
Topical administration may be suitable for wound healing and tissue repair. Therapeutically 
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useful agents other than a protein or other active ingredient of the invention which may also 
optionally be included in the composition as described above, may alternatively or 
additionally, be administered simultaneously or sequentially with the composition in the 
methods of the invention. Preferably for bone and/or cartilage formation, the composition 
5 would include a matrix capable of delivering the protein-containing or other active 

ingredient-containing composition to the site of bone and/or cartilage damage, providing a 
structure for the developing bone and cartilage and optimally capable of being resorbed into 
the body. Such matrices may be formed of materials presently in use for other implanted 
medical applications. 

1 0 The choice of matrix material is based on biocompatibility, biodegradability, 

mechanical properties, cosmetic appearance and interface properties. The particular 
application of the compositions will define the appropriate formulation. Potential matrices 
for the compositions may be biodegradable and chemically defined calcium sulfate, 
tricalcium phosphate, hydroxyapatite, polylactic acid, polyglycolic acid and poly anhydrides. 

1 5 Other potential materials are biodegradable and biologically well-defined, such as bone or 
dermal collagen. Further matrices are comprised of pure proteins or extracellular matrix 
components. Other potential matrices are nonbiodegradable and chemically defined, such as 
sintered hydroxyapatite, bioglass, aluminates, or other ceramics. Matrices may be comprised 
of combinations of any of the above-mentioned types of material, such as polylactic acid and 

20 hydroxyapatite or collagen and tricalcium phosphate. The bioceramics may be altered in 
composition, such as in calcium-aluminate-phosphate and processing to alter pore size, 
particle size, particle shape, and biodegradability. Presently preferred is a 50:50 (mole 
weight) copolymer of lactic acid and glycolic acid in the form of porous particles having 
diameters ranging from 150 to 800 microns. In some applications, it will be useful to utilize 

25 a sequestering agent, such as carboxymethyl cellulose or autologous blood clot, to prevent 
the protein compositions from disassociating from the matrix. 

A preferred family of sequestering agents is cellulosic materials such as 
alkylcelluloses (including hydroxyalkylcelluloses), including methylcellulose, 
ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, 

30 hydroxypropyl-methylcellulose, and carboxymethylcellulose, the most preferred being 
cationic salts of carboxymethylcellulose (CMC). Other preferred sequestering agents 
include hyaluronic acid, sodium alginate, poly(ethylene glycol), polyoxyethylene oxide, 
carboxyvinyl polymer and poly (vinyl alcohol). The amount of sequestering agent useful 
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herein is 0.5-20 wt %, preferably 1-10 wt % based on total formulation weight, which 
represents the amount necessary to prevent desorption of the protein from the polymer 
matrix and to provide appropriate handling of the composition, yet not so much that the 
progenitor cells are prevented from infiltrating the matrix, thereby providing the protein the 
5 opportunity to assist the osteogenic activity of the progenitor cells. In further compositions, 
proteins or other active ingredients of the invention may be combined with other agents 
beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in question. 
These agents include various growth factors such as epidermal growth factor (EGF), platelet 
derived growth factor (PDGF), transforming growth factors (TGF-oc and TGF-P), and 

10 insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary applications. 
Particularly domestic animals and thoroughbred horses, in addition to humans, are desired 
patients for such treatment with proteins or other active ingredients of the present invention. 
The dosage regimen of a protein-containing pharmaceutical composition to be used in tissue 

1 5 regeneration will be determined by the attending physician considering various factors which 
modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site 
of damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue 
(e.g. , bone), the patient's age, sex, and diet, the severity of any infection, time of 
administration and other clinical factors. The dosage may vary with the type of matrix used 

20 in the reconstitution and with inclusion of other proteins in the pharmaceutical composition. 
For example, the addition of other known growth factors, such as IGF I (insulin like growth 
factor I), to the final composition, may also effect the dosage. Progress can be monitored by 
periodic assessment of tissue/bone growth and/or repair, for example, X-rays, 
histomorphometric determinations and tetracycline labeling. 

25 Polynucleotides of the present invention can also be used for gene therapy. Such 

polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other 
known methods for introduction of nucleic acid into a cell or organism (including, without 
limitation, in the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in 

30 the presence of proteins of the present invention in order to proliferate or to produce a 

desired effect on or activity in such cells. Treated cells can then be introduced in vivo for 
therapeutic purposes. 
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4.12.3 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve 
its intended purpose. More specifically, a therapeutically effective amount means an amount 
effective to prevent development of or to alleviate the existing symptoms of the subject 
being treated. Determination of the effective amount is well within the capability of those 
skilled in the art, especially in light of the detailed disclosure provided herein. For any 
compound used in the method of the invention, the therapeutically effective dose can be 
estimated initially from appropriate in vitro assays. For example, a dose can be formulated in 
animal models to achieve a circulating concentration range that can be used to more 
accurately determine useful doses in humans. For example, a dose can be formulated in 
animal models to achieve a circulating concentration range that includes the IC50 as 
determined in cell culture (i.e., the concentration of the test compound which achieves a 
half-maximal inhibition of the protein's biological activity). Such information can be used 
to more accurately determine useful doses in humans. 

A therapeutically effective dose refers to that amount of the compound that results in 
amelioration of symptoms or a prolongation of survival in a patient. Toxicity and therapeutic 
efficacy of such compounds can be determined by standard pharmaceutical procedures in 
cell cultures or experimental animals, e.g., for determining the LD50 (the dose lethal to 50% 
of the population) and the ED50 (the dose therapeutically effective in 50% of the population). 
The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be 
expressed as the ratio between LD 50 and ED50. Compounds which exhibit high therapeutic 
indices are preferred. The data obtained from these cell culture assays and animal studies 
can be used in formulating a range of dosage for use in human. The dosage of such 
compounds lies preferably within a range of circulating concentrations that include the ED50 
with little or no toxicity. The dosage may vary within this range depending upon the dosage 
form employed and the route of administration utilized. The exact formulation, route of 
administration and dosage can be chosen by the individual physician in view of the patient's 
condition. See, e.g., Fingl et al., 1975, in "The Pharmacological Basis of Therapeutics", Ch. 
1 p. 1 . Dosage amount and interval may be adjusted individually to provide plasma levels of 
the active moiety which are sufficient to maintain the desired effects, or minimal effective 
concentration (MEC). The MEC will vary for each compound but can be estimated from in 
vitro data. Dosages necessary to achieve the MEC will depend on individual characteristics 
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and route of administration. However, HPLC assays or bioassays can be used to determine 
plasma concentrations. 

Dosage intervals can also be determined using MEC value. Compounds should be 
administered using a regimen which maintains plasma levels above the MEC for 10-90% of 
5 the time, preferably between 30-90% and most preferably between 50-90%). In cases of local 
administration or selective uptake, the effective local concentration of the drug may not be 
related to plasma concentration. 

An exemplary dosage regimen for polypeptides or other compositions of the 
invention will be in the range of about 0.01 p,g/kg to 100 mg/kg of body weight daily, with 
1 0 the preferred dose being about 0. 1 jag/kg to 25 mg/kg of patient body weight daily, varying 
in adults and children. Dosing may be once daily, or equivalent doses may be delivered at 

longer or shorter intervals. 

The amount of composition administered will, of course, be dependent on the subject 
being treated, on the subject's age and weight, the severity of the affliction, the manner of 
1 5 administration and the judgment of the prescribing physician. 



4.12.4 PACKAGING 

The compositions may, if desired, be presented in a pack or dispenser device which 
may contain one or more unit dosage forms containing the active ingredient. The pack may, 
20 for example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser 
device may be accompanied by instructions for administration. Compositions comprising a 
compound of the invention formulated in a compatible pharmaceutical carrier may also be 
prepared, placed in an appropriate container, and labeled for treatment of an indicated 
condition. 

25 

4.13 ANTIBODIES 

Also included in the invention are antibodies to proteins, or fragments of proteins of 
the invention. The term 11 antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that 
30 contain an antigen-binding site that specifically binds (immunoreacts with) an antigen. Such 
antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, 
F ab , F a v and F( ab *)2 fragments, and an F ab expression library. In general, an antibody molecule 
obtained from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ 



WO 03/054152 PCT/US02/39555 

85 

from one another by the nature of the heavy chain present in the molecule. Certain classes 
have subclasses as well, such as IgGi, IgG 2 , and others. Furthermore, in humans, the light 
chain may be a kappa chain or a lambda chain. Reference herein to antibodies includes a 
reference to all such classes, subclasses and types of human antibody species. 
5 An isolated related protein of the invention may be intended to serve as an antigen, or 

a portion or fragment thereof, and additionally can be used as an immunogen to generate 
antibodies that immunospecifically bind the antigen, using standard techniques for 
polyclonal and monoclonal antibody preparation. The full-length protein can be used or, 
alternatively, the invention provides antigenic peptide fragments of the antigen for use as 

10 immunogens. An antigenic peptide fragment comprises at least 6 amino acid residues of the 
amino acid sequence of the full length protein, such as an amino acid sequence shown in 
SEQ ID NO: 912-1822, or 2479-3134, or Tables 3A, 3B, 5, or 6, and encompasses an 
epitope thereof such that an antibody raised against the peptide forms a specific immune 
complex with the full length protein or with any fragment that contains the epitope. 

15 Preferably, the antigenic peptide comprises at least 10 amino acid residues, or at least 15 
amino acid residues, or at least 20 amino acid residues, or at least 30 amino acid residues. 
Preferred epitopes encompassed by the antigenic peptide are regions of the protein that are 
located on its surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 

20 antigenic peptide is a surface region of the protein, e.g., a hydrophilic region. A 

hydrophobicity analysis of the human related protein sequence will indicate which regions of 
a related protein are particularly hydrophilic and, therefore, are likely to encode surface 
residues useful for targeting antibody production. As a means for targeting antibody 
production, hydropathy plots showing regions of hydrophilicity and hydrophobicity may be 

25 generated by any method well known in the art, including, for example, the Kyte Doolittle or 
the Hopp Woods methods, either with or without Fourier transformation. See, e.g., Hopp and 
Woods, 1981, Proc. Nat. Acad. Sci. USA 78: 3824-3828; Kyte and Doolittle 1982, J. Mol. 
Biol. 157: 105-142, each of which is incorporated herein by reference in its entirety. 
Antibodies that are specific for one or more domains within an antigenic protein, or 

30 derivatives, fragments, analogs or homologs thereof, are also provided herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 
thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. 
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The term "specific for" indicates that the variable regions of the antibodies of the 
invention recognize and bind polypeptides of the invention exclusively (i.e., able to 
distinguish the polypeptide of the invention from other similar polypeptides despite sequence 
identity, homology, or similarity found in the family of polypeptides), but may also interact 
with other proteins (for example, S. aureus protein A or other antibodies in ELISA 
techniques) through interactions with sequences outside the variable region of the antibodies, 
and in particular, in the constant region of the molecule. Screening assays to determine 
binding specificity of an antibody of the invention are well known and routinely practiced in 
the art. For a comprehensive discussion of such assays, see Harlow et al. (Eds), Antibodies 
A Laboratory Manual; Cold Spring Harbor Laboratory; Cold Spring Harbor, NY (1988), 
Chapter 6. Antibodies that recognize and bind fragments of the polypeptides of the 
invention are also contemplated, provided that the antibodies are first and foremost specific 
for, as defined above, full-length polypeptides of the invention. As with antibodies that are 
specific for full length polypeptides of the invention, antibodies of the invention that 
recognize fragments are those which can distinguish polypeptides from the same family of 
polypeptides despite inherent sequence identity, homology, or similarity found in the family 
of proteins. 

Antibodies of the invention are useful for, for example, therapeutic purposes (by 
modulating activity of a polypeptide of the invention), diagnostic purposes to detect or 
quantitate a polypeptide of the invention, as well as purification of a polypeptide of the 
invention. Kits comprising an antibody of the invention for any of the purposes described 
herein are also comprehended. In general, a kit of the invention also includes a control 
antigen for which the antibody is immunospecific. The invention further provides a 
hybridoma that produces an antibody according to the invention. Antibodies of the 
invention are useful for detection and/or purification of the polypeptides of the invention. 

Monoclonal antibodies binding to the protein of the invention may be useful 
diagnostic agents for the immunodetection of the protein. Neutralizing monoclonal 
antibodies binding to the protein may also be useful therapeutics for both conditions 
associated with the protein and also in the treatment of some forms of cancer where 
abnormal expression of the protein is involved. In the case of cancerous cells or leukemic 
cells, neutralizing monoclonal antibodies against the protein may be useful in detecting and 
preventing the metastatic spread of the cancerous cells, which may be mediated by the 
protein. 
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The labeled antibodies of the present invention can be used for in vitro, in vivo, and 
in situ assays to identify cells or tissues in which a fragment of the polypeptide of interest is 
expressed. The antibodies may also be used directly in therapies or other diagnostics. The 
present invention further provides the above-described antibodies immobilized on a solid 
support. Examples of such solid supports include plastics such as polycarbonate, complex 
carbohydrates such as agarose and Sepharose®, acrylic resins and such as polyacrylamide 
and latex beads. Techniques for coupling antibodies to such solid supports are well known 
in the art (Weir, D.M. et al., "Handbook of Experimental Immunology" 4th Ed., Blackwell 
Scientific Publications, Oxford, England, Chapter 10 (1986); Jacoby, W.D. et al., Meth. 
Enzym. 34 Academic Press, N.Y. (1974)). The immobilized antibodies of the present 
invention can be used for in vitro, in vivo, and in situ assays as well as for immuno-affinity 
purification of the proteins of the present invention. 

Various procedures known within the art may be used for the production of 
polyclonal or monoclonal antibodies directed against a protein of the invention, or against 
derivatives, fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: 
A Laboratory Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY, incorporated herein by reference). Some of these antibodies are 
discussed below. 

4.13.1 POLYCLONAL ANTIBODIES 

For the production of polyclonal antibodies, various suitable host animals (e.g., 
rabbit, goat, mouse or other mammal) may be immunized by one or more injections with the 
native protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring immunogenic 
protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 
recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated 
to a second protein known to be immunogenic in the mammal being immunized. Examples 
of such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, 
serum albumin, bovine thyreoglobulin, and soybean trypsin inhibitor. The preparation can 
further include an adjuvant. Various adjuvants used to increase the immunological response 
include, but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., 
aluminum hydroxide), surface-active substances (e.g., lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, dinitrophenol, etc.), adjuvants usable in humans such as 
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Bacille Calmette-Guerin and Corynebacterium parvum, or similar immuno stimulatory 
agents. Additional examples of adjuvants that can be employed include MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 

The polyclonal antibody molecules directed against the immunogenic protein can be 
5 isolated from the mammal (e.g., from the blood) and further purified by well known 

techniques, such as affinity chromatography using protein A or protein G, which provide 
primarily the IgG fraction of immune serum. Subsequently, or alternatively, the specific 
antigen which is the target of the immunoglobulin sought, or an epitope thereof, may be 
immobilized on a column to purify the immune specific antibody by immunoaffinity 
10 chromatography. Purification of immunoglobulins is discussed, for example, by D. 

Wilkinson (The Scientist, published by The Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 
(April 17, 2000), pp. 25-28). 



4.13.2 MONOCLONAL ANTIBODIES 

15 The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as 

used herein, refers to a population of antibody molecules that contain only one molecular 
species of antibody molecule consisting of a unique light chain gene product and a unique 
heavy chain gene product. In particular, the complementarity determining regions (CDRs) 
of the monoclonal antibody are identical in all the molecules of the population. MAbs thus 

20 contain an antigen-binding site capable of immunoreacting with a particular epitope of the 
antigen characterized by a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature, 256, 495 (1975). In a hybridoma method, a 
mouse, hamster, or other appropriate host animal, is typically immunized with an 

25 immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies 
that will specifically bind to the immunizing agent. Alternatively, the lymphocytes can be 
immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment thereof 
or a fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells 
30 of human origin are desired, or spleen cells or lymph node cells are used if non-human 

mammalian sources are desired. The lymphocytes are then fused with an immortalized cell 
line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell 
(Goding, Monoclonal Antibodies: Principles and Practice, Academic Press, (1986) pp. 59- 



i 
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103). Immortalized cell lines are usually transformed mammalian cells, particularly 
myeloma cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell 
lines are employed. The hybridoma cells can be cultured in a suitable culture medium that 
preferably contains one or more substances that inhibit the growth or survival of the unfused, 
5 immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas 
typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which 
substances prevent the growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high 

1 0 level expression of antibody by the selected antibody-producing cells, and are sensitive to a 
medium such as HAT medium. More preferred immortalized cell lines are murine myeloma 
lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, 
San Diego, California and the American Type Culture Collection, Manassas, Virginia. 
Human myeloma and mouse-human heteromyeloma cell lines also have been described for 

15 the production of human monoclonal antibodies (Kozbor, J. Immunol., 133:3001 (1984); 
Brodeur et al., Monoclonal Antibody Production Techniques and Applications, Marcel 
Dekker, Inc., New York, (1987) pp. 51-63). 

The culture medium in which the hybridoma cells are cultured can then be assayed 
for the presence of monoclonal antibodies directed against the antigen. Preferably, the 

20 binding specificity of monoclonal antibodies produced by the hybridoma cells is determined 
by immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in 
the art. The binding affinity of the monoclonal antibody can, for example, be determined by 
the Scatchard analysis of Munson and Pollard, Anal. Biochem., 107, 220 (1980). Preferably, 

25 antibodies having a high degree of specificity and a high binding affinity for the target 
antigen are isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by 
limiting dilution procedures and grown by standard methods. Suitable culture media for this 
purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 

30 medium. Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from 
the culture medium or ascites fluid by conventional immunoglobulin purification procedures 
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such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel 
electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 
those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of 
5 the invention can be readily isolated and sequenced using conventional procedures (e.g., by 
using oligonucleotide probes that are capable of binding specifically to genes encoding the 
heavy and light chains of murine antibodies). The hybridoma cells of the invention serve as 
a preferred source of such DNA. Once isolated, the DNA can be placed into expression 
vectors, which are then transfected into host cells such as simian COS cells, Chinese hamster 

10 ovary (CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, 
to obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA 
also can be modified, for example, by substituting the coding sequence for human heavy and 
light chain constant domains in place of the homologous murine sequences (U.S. Patent No. 
4,816,567; Morrison, Nature 368, 812-13 (1994)) or by covalently joining to the 

15 immunoglobulin coding sequence all or part of the coding sequence for a non- 

immunoglobulin polypeptide. Such a non-immunoglobulin polypeptide can be substituted 
for the constant domains of an antibody of the invention, or can be substituted for the 
variable domains of one antigen-combining site of an antibody of the invention to create a 
chimeric bivalent antibody. 

20 

4.13.3 HUMANIZED ANTIBODIES 

The antibodies directed against the protein antigens of the invention can further 
comprise humanized antibodies or human antibodies. These antibodies are suitable for 
administration to humans without engendering an immune response by the human against 

25 the administered immunoglobulin. Humanized forms of antibodies are chimeric 

immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', 
F(ab') 2 or other antigen-binding subsequences of antibodies) that are principally comprised 
of the sequence of a human immunoglobulin, and contain minimal sequence derived from a 
non-human immunoglobulin. Humanization can be performed following the method of 

30 Winter and co-workers (Jones et al., Nature, 321, 522-525 (1986); Riechmann et al., Nature, 
332, 323-327 (1988); Verhoeyen et al., Science, 239, 1534-1536 (1988)), by substituting 
rodent CDRs or CDR sequences for the corresponding sequences of a human antibody. (See 
also U.S. Patent No. 5,225,539). In some instances, Fv framework residues of the human 
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immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
can also comprise residues that are found neither in the recipient antibody nor in the 
imported CDR or framework sequences. In general, the humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 
5 substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the framework regions are those of a human immunoglobulin 

consensus sequence. The humanized antibody optimally also will comprise at least a portion 

i 

of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin 
(Jones et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. Struct. Biol., 2, 593-596 
10 (1992)). 

4,13.4 HUMAN ANTIBODIES 

Fully human antibodies relate to antibody molecules in which essentially the entire 
sequences of both the light chain and the heavy chain, including the CDRs, arise from 

15 human genes. Such antibodies are termed "human antibodies", or "fully human antibodies" 
herein. Human monoclonal antibodies can be prepared by the trioma technique; the human 
B-cell hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV 
hybridoma technique to produce human monoclonal antibodies (see Cole, et aL, 1985 In: 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human 

20 monoclonal antibodies may be utilized in the practice of the present invention and may be 
produced by using human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80, 
2026-2030) or by transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et 
al., 1985 In: Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 
In addition, human antibodies can also be produced using additional techniques, 

25 including phage display libraries (Hoogenboom and Winter, J. Mol. Biol., 227, 381 (1991); 
Marks et al., J. Mol. Biol., 222:581 (1991)). Similarly, human antibodies can be made by 
introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 

30 humans in all respects, including gene rearrangement, assembly, and antibody repertoire. 
This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 
5,569,825; 5,625,126; 5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10, 779- 
783 (1992)); Lonberg et al. (Nature 368, 856-859 (1994)); Morrison (Nature 368, 812-13 
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(1994) ); Fishwild et al, (Nature Biotechnology 14, 845-51 (1996)); Neuberger (Nature 
Biotechnology 14, 826 (1996)); and Lonberg and Huszar (Intern. Rev. Immunol. 13, 65-93 

(1995) ). 

Human antibodies may additionally be produced using transgenic nonhuman animals 
that are modified so as to produce fully human antibodies rather than the animal's 
endogenous antibodies in response to challenge by an antigen. (See PCT publication 
WO94/02602). The endogenous genes encoding the heavy and light immunoglobulin chains 
in the nonhuman host have been incapacitated, and active loci encoding human heavy and 
light chain immunoglobulins are inserted into the host's genome. The human genes are 
incorporated, for example, using yeast artificial chromosomes containing the requisite 
human DNA segments. An animal which provides all the desired modifications is then 
obtained as progeny by crossbreeding intermediate transgenic animals containing fewer than 
the full complement of the modifications. The preferred embodiment of such a nonhuman 
animal is a mouse, and is termed the Xenomouse™ as disclosed in PCT publications WO 
96/33735 and WO 96/34096. This animal produces B cells that secrete fully human 
immunoglobulins. The antibodies can be obtained directly from the animal after 
immunization with an immunogen of interest, as, for example, a preparation of a polyclonal 
antibody, or alternatively from immortalized B cells derived from the animal, such as 
hybridomas producing monoclonal antibodies. Additionally, the genes encoding the 
immunoglobulins with human variable regions can be recovered and expressed to obtain the 
antibodies directly, or can be further modified to obtain analogs of antibodies such as, for 

example, single chain Fv molecules. 

An example of a method of producing a nonhuman host, exemplified as a mouse, 
lacking expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. 
Patent No. 5,939,598. It can be obtained by a method including deleting the J segment genes 
from at least one endogenous heavy chain locus in an embryonic stem cell to prevent 
rearrangement of the locus and to prevent formation of a transcript of a rearranged 
immunoglobulin heavy chain locus, the deletion being effected by a targeting vector 
containing a gene encoding a selectable marker; and producing from the embryonic stem cell 
a transgenic mouse whose somatic and germ cells contain the gene encoding the selectable 
marker. 

A method for producing an antibody of interest, such as a human antibody, is 
disclosed in U.S. Patent No. 5,916,771. It includes introducing an expression vector that 
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contains a nucleotide sequence encoding a heavy chain into one mammalian host cell in 
culture, introducing an expression vector containing a nucleotide sequence encoding a light 
chain into another mammalian host cell, and fusing the two cells to form a hybrid cell. The 
hybrid cell expresses an antibody containing the heavy chain and the light chain. 
5 In a further improvement on this procedure, a method for identifying a clinically 

relevant epitope on an immunogen, and a correlative method for selecting an antibody that 
binds immunospecifically to the relevant epitope with high affinity, are disclosed in PCT 
publication WO 99/53049. 

10 4.13.5 FAB FRAGMENTS AND SINGLE CHAIN ANTIBODIES 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to an antigenic protein of the invention (see e.g., U.S. Patent 
No. 4,946,778). In addition, methods can be adapted for the construction of F a b expression 
libraries (see e.g., Huse, et al., 1989 Science 246, 1275-1281) to allow rapid and effective 

15 identification of monoclonal F ab fragments with the desired specificity for a protein or 

derivatives, fragments, analogs or homologs thereof. Antibody fragments that contain the 
idiotypes to a protein antigen may be produced by techniques known in the art including, but 
not limited to: (i) an F (ab .) 2 fragment produced by pepsin digestion of an antibody molecule; 
(ii) an F a b fragment generated by reducing the disulfide bridges of an F (ab -)2 fragment; (iii) an 

20 F a b fragment generated by the treatment of the antibody molecule with papain and a reducing 
agent and (iv) F v fragments. 

4.13.6 BISPECIFIC ANTIBODIES 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies 
25 that have binding specificities for at least two different antigens. In the present case, one of 
the binding specificities is for an antigenic protein of the invention. The second binding 
target is any other antigen, and advantageously is a cell-surface protein or receptor or 
receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
30 recombinant production of bispecific antibodies is based on the co-expression of two 

immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and Cuello, Nature, 305, 537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) 
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produce a potential mixture of ten different antibody molecules, of which only one has the 
correct bispecific structure. The purification of the correct molecule is usually accomplished 
by affinity chromatography steps. Similar procedures are disclosed in WO 93/08829, 
published 13 May 1993 5 and in Traunecker et ah, 1991 EMBOJ., 10, 3655-3659. 
5 Antibody variable domains with the desired binding specificities (antibody-antigen 

combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 
preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part 
of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant 
region (CHI) containing the site necessary for light-chain binding present in at least one of 

10 the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the 
immunoglobulin light chain, are inserted into separate expression vectors, and are co- 
transfected into a suitable host organism. For further details of generating bispecific 
antibodies see, for example, Suresh et al., Methods in Enzymology, 121, 210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a 

15 pair of antibody molecules can be engineered to maximize the percentage of heterodimers 
that are recovered from recombinant cell culture. The preferred interface comprises at least 
a part of the CH3 region of an antibody constant domain. In this method, one or more small 
amino acid side chains from the interface of the first antibody molecule are replaced with 
larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or 

20 similar size to the large side chain(s) are created on the interface of the second antibody 
molecule by replacing large amino acid side chains with smaller ones (e.g. alanine or 
threonine). This provides a mechanism for increasing the yield of the heterodimer over other 
unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full-length antibodies or antibody fragments 

25 (e.g. F(ab') 2 bispecific antibodies). Techniques for generating bispecific antibodies from 

antibody fragments have been described in the literature. For example, bispecific antibodies 
can be prepared using chemical linkage. Brennan et al., Science 229, 81 (1985) describe a 
procedure wherein intact antibodies are proteolytically cleaved to generate F(ab')2 
fragments. These fragments are reduced in the presence of the dithiol complexing agent 

30 sodium arsenite to stabilize vicinal dithiols and prevent intermolecular disulfide formation. 
The Fab' fragments generated are then converted to thionitrobenzoate (TNB) derivatives. 
One of the Fab' -TNB derivatives is then reconverted to the Fab 5 -thiol by reduction with 
mercaptoethylamine and is mixed with an equimolar amount of the other Fab' -TNB 
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derivative to form the bispecific antibody. The bispecific antibodies produced can be used 
as agents for the selective immobilization of enzymes. 

Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med., 175, 217-225 (1992) 
5 describe the production of a fully humanized bispecific antibody F(ab') 2 molecule. Each 
Fab 5 fragment was separately secreted from E. coli and subjected to directed chemical 
coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was 
able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well as 
trigger the lytic activity of human cytotoxic lymphocytes against human breast tumor targets. 

10 Various techniques for making and isolating bispecific antibody fragments directly 

from recombinant cell culture have also been described. For example, bispecific antibodies 
have been produced using leucine zippers. Kostelny et ah, J. Immunol. 148(5), 1547-1553 
(1992). The leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' 
portions of two different antibodies by gene fusion. The antibody homodimers were reduced 

15 at the hinge region to form monomers and then re-oxidized to form the antibody 

heterodimers. This method can also be utilized for the production of antibody homodimers. 
The "diabody" technology described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90, 
6444-6448 (1 993) has provided an alternative mechanism for making bispecific antibody 
fragments. The fragments comprise a heavy-chain variable domain (V H ) connected to a 

20 light-chain variable domain (V L ) by a linker which is too short to allow pairing between the 
two domains on the same chain. Accordingly, the Vh and Vl domains of one fragment are 
forced to pair with the complementary V L and V H domains of another fragment, thereby 
forming two antigen-binding sites. Another strategy for making bispecific antibody 
fragments by the use of single-chain Fv (sFv) dimers has also been reported. See, Gruber et 

25 al., J. Immunol. 152, 5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific 
antibodies can be prepared. Tutt et al., J. Immunol. 147, 60 (1991). 

Exemplary bispecific antibodies can bind to two different epitopes, at least one of 
which originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm 

30 of an immunoglobulin molecule can be combined with an arm which binds to a triggering 
molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), 
or Fc receptors for IgG (FC7R), such as FcyRI (CD64), Fc 7 RII (CD32) and FC7RIII (CD 16) 
so as to focus cellular defense mechanisms to the cell expressing the particular antigen. 
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Bispecific antibodies can also be used to direct cytotoxic agents to cells which express a 
particular antigen. These antibodies possess an antigen-binding arm and an arm which binds 
a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPT A, DOTA, or TETA. 
Another bispecific antibody of interest binds the protein antigen described herein and further 
5 binds tissue factor (TF). 

4.13.7 HETEROCONJUGATE ANTIBODIES 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such 

10 antibodies have, for example, been proposed to target immune system cells to unwanted cells 
(U.S. Patent No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 
92/200373; EP 03089). It is contemplated that the antibodies can be prepared in vitro using 
known methods in synthetic protein chemistry, including those involving crosslinking 
agents. For example, immunotoxins can be constructed using a disulfide exchange reaction 

15 or by forming a thioether bond. Examples of suitable reagents for this purpose include 
iminothiolate and methyl-4-mercaptobutyrimidate and those disclosed, for example, in U.S. 
Patent No. 4,676,980. 

4.13.8 EFFECTOR FUNCTION ENGINEERING 

20 It can be desirable to modify the antibody of the invention with respect to effector 

function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For 
example, cysteine residue(s) can be introduced into the Fc region, thereby allowing 
interchain disulfide bond formation in this region. The homodimeric antibody thus 
generated can have improved internalization capability and/or increased complement- 

25 mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et 
al., J. Exp Med., 176, 1191-1195 (1992) and Shopes, J. Immunol., 148, 2918-2922 (1992). 
Homodimeric antibodies with enhanced anti-tumor activity can also be prepared using 
heterobiflinctional cross-linkers as described in Wolff et al. Cancer Research, 53, 2560- 
2565 (1993). Alternatively, an antibody can be engineered that has dual Fc regions and can 

30 thereby have enhanced complement lysis and ADCC capabilities. See Stevenson et al., 
Anti-Cancer Drug Design, 3, 219-230 (1989). 



4.13.9 IMMUNOCONJUGATES 
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The invention also pertains to immunoconjugates comprising an antibody conjugated 
to a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active 
toxin of bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive 
isotope (i.e., a radioconjugate). 
5 Chemotherapeutic agents useful in the generation of such immunoconjugates have 

been described above. Enzymatically active toxins and fragments thereof that can be used 
include diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A 
chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, 
alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins 

10 (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria 
officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the 
tricothecenes. A variety of radionuclides are available for the production of radioconjugated 
antibodies. Examples include 212 Bi, 131 I ? 131 In, 90 Y, and 186 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of 

1 5 Afunctional protein-coupling agents such as N-succinimidyl-3 -(2-pyridyldithiol) propionate 
(SPDP), iminothiolane (IT), biftmctional derivatives of imidoesters (such as dimethyl 
adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 
glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis- 
diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates 

20 (such as tolyene 2,6-diisocyanate), and bis-active fluorine compounds (such as 1,5-difluoro- 
2,4-dinitrobenzene). For example, a ricin immunotoxin can be prepared as described in 
Vitetta et al., Science, 238: 1098 (1987). Carbon- 14-labeled l-isothiocyanatobenzyl-3- 
methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for 
conjugation of radionucleotide to the antibody. See W094/1 1026. 

25 In another embodiment, the antibody can be conjugated to a "receptor" (such 

streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation 
using a clearing agent and then administration of a "ligand" (e.g., avidin) that is in turn 
conjugated to a cytotoxic agent. 



4.14 COMPUTER READABLE SEQUENCES 
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In one application of this embodiment, a nucleotide sequence of the present invention 
can be recorded on computer readable media. As used herein, "computer readable media 11 
refers to any medium which can be read and accessed directly by a computer. Such media 
include, but are not limited to: magnetic storage media, such as floppy discs, hard disc 
5 storage medium, and magnetic tape; optical storage media such as CD-ROM; electrical 
storage media such as RAM and ROM; and hybrids of these categories such as 
magnetic/optical storage media. A skilled artisan can readily appreciate how any of the 
presently known computer readable mediums can be used to create a manufacture 
comprising computer readable medium having recorded thereon a nucleotide sequence of the 

10 present invention. As used herein, "recorded" refers to a process for storing information on 
computer readable medium. A skilled artisan can readily adopt any of the presently known 
methods for recording information on computer readable medium to generate manufactures 
comprising the nucleotide sequence information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a 

1 5 computer readable medium having recorded thereon a nucleotide sequence of the present 
invention. The choice of the data storage structure will generally be based on the means 
chosen to access the stored information. In addition, a variety of data processor programs 
and formats can be used to store the nucleotide sequence information of the present 
invention on computer readable medium. The sequence information can be represented in a 

20 word processing text file, formatted in commercially-available software such as WordPerfect 
and Microsoft Word, or represented in the form of an ASCII file, stored in a database 
application, such as DB2, Sybase, Oracle, or the like. A skilled artisan can readily adapt any 
number of data processor structuring formats {e.g. text file or database) in order to obtain 
computer readable medium having recorded thereon the nucleotide sequence information of 

25 the present invention. 

By providing any of the nucleotide sequences SEQ ID NO: 1-91 1, or 1823-2478 or a 
representative fragment thereof; or a nucleotide sequence at least 95% identical to any of the 
nucleotide sequences of SEQ ID NO: 1-91 1, or 1 823-2478 in computer readable form, a 
skilled artisan can routinely access the sequence information for a variety of purposes. 

30 Computer software is publicly available which allows a skilled artisan to access sequence 
information provided in a computer readable medium. The examples which follow 
demonstrate how software which implements the BLAST (Altschul et al., J. Mol. Biol. 
215:403-410 (1990)) and BLAZE (Brutlag et al., Comp. Chem. 17:203-207 (1993)) search 
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algorithms on a Sybase system is used to identify open reading frames (ORFs) within a 
nucleic acid sequence. Such ORFs may be protein-encoding fragments and may be useful in 
producing commercially important proteins such as enzymes used in fermentation reactions 
and in the production of commercially useful metabolites. 
5 As used herein, f, a computer-based system" refers to the hardware means, software 

means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware means of the computer-based systems of the 
present invention comprises a central processing unit (CPU), input means, output means, and 
data storage means. A skilled artisan can readily appreciate that any one of the currently 

10 available computer-based systems are suitable for use in the present invention. As stated 
above, the computer-based systems of the present invention comprise a data storage means 
having stored therein a nucleotide sequence of the present invention and the necessary 
hardware means and software means for supporting and implementing a search means. As 
used herein, "data storage means" refers to memory which can store nucleotide sequence 

1 5 information of the present invention, or a memory access means which can access 

manufactures having recorded thereon the nucleotide sequence information of the present 
invention. 

As used herein, "search means" refers to one or more programs which are 
implemented on the computer-based system to compare a target sequence or target structural 

20 motif with the sequence information stored within the data storage means. Search means are 
used to identify fragments or regions of a known sequence which match a particular target 
sequence or target motif. A variety of known algorithms are disclosed publicly and a variety 
of commercially available software for conducting search means are and can be used in the 
computer-based systems of the present invention. Examples of such software includes, but 

25 is not limited to, Smith-Waterman, MacPattern (EMBL), BLASTN and BLASTA 

(NPOLYPEPTIDEI A) . A skilled artisan can readily recognize that any one of the available 
algorithms or implementing software packages for conducting homology searches can be 
adapted for use in the present computer-based systems. As used herein, a "target sequence" 
can be any nucleic acid or amino acid sequence of six or more nucleotides or two or more 

30 amino acids. A skilled artisan can readily recognize that the longer a target sequence is, the 
less likely a target sequence will be present as a random occurrence in the database. The 
most preferred sequence length of a target sequence is from about 10 to 300 amino acids, 
more preferably from about 30 to 100 nucleotide residues. However, it is well recognized 
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that searches for commercially important fragments, such as sequence fragments involved in 
gene expression and protein processing, may be of shorter length. 

As used herein, M a target structural motif," or "target motif," refers to any rationally 
selected sequence or combination of sequences in which the sequence(s) are chosen based on 
a three-dimensional configuration which is formed upon the folding of the target motif. 
There are a variety of target motifs known in the art. Protein target motifs include, but are 
not limited to 5 enzyme active sites and signal sequences. Nucleic acid target motifs include, 
but are not limited to, promoter sequences, hairpin structures and inducible expression 
elements (protein binding sequences). 

4.15 TRIPLE HELIX FORMATION 

In addition, the fragments of the present invention, as broadly described, can be used 
to control gene expression through triple helix formation or antisense DNA or RNA, both of 
which methods are based on the binding of a polynucleotide sequence to DNA or RNA. 
Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in length and 
are designed to be complementary to a region of the gene involved in transcription (triple 
helix-see Lee et al., Nucl. Acids Res. 6, 3073 (1979); Cooney et al., Science 15241, 456 
(1988); and Dervan et al., Science 251, 1360 (1991)) or to the mRNA itself (antisense- 
Olmno, J. Neurochem. 56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of 
Gene Expression, CRC Press, Boca Raton, FL (1988)). Triple helix-formation optimally 
results in a shut-off of RNA transcription from DNA, while antisense RNA hybridization 
blocks translation of an mRNA molecule into polypeptide. Both techniques have been 
demonstrated to be effective in model systems. Information contained in the sequences of 
the present invention is necessary for the design of an antisense or triple helix 
oligonucleotide. 

4.16 DIAGNOSTIC ASSAYS AND KITS 

The present invention further provides methods to identify the presence or expression 
of one of the ORFs of the present invention, or homolog thereof, in a test sample, using a 
nucleic acid probe or antibodies of the present invention, optionally conjugated or otherwise 
associated with a suitable label. 

In general, methods for detecting a polynucleotide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the 
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polynucleotide for a period sufficient to form the complex, and detecting the complex, so 
that if a complex is detected, a polynucleotide of the invention is detected in the sample. 
Such methods can also comprise contacting a sample under stringent hybridization 
conditions with nucleic acid primers that anneal to a polynucleotide of the invention under 
5 such conditions, and amplifying annealed polynucleotides, so that if a polynucleotide is 
amplified, a polynucleotide of the invention is detected in the sample. 

In general, methods for detecting a polypeptide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the 
polypeptide for a period sufficient to form the complex, and detecting the complex, so that if 

10 a complex is detected, a polypeptide of the invention is detected in the sample. 

In detail, such methods comprise incubating a test sample with one or more of the 
antibodies or one or more of the nucleic acid probes of the present invention and assaying 
for binding of the nucleic acid probes or antibodies to components within the test sample. 
Conditions for incubating a nucleic acid probe or antibody with a test sample vary. 

15 Incubation conditions depend on the format employed in the assay, the detection methods 
employed, and the type and nature of the nucleic acid probe or antibody used in the assay. 
One skilled in the art will recognize that any one of the commonly available hybridization, 
amplification or immunological assay formats can readily be adapted to employ the nucleic 
acid probes or antibodies of the present invention. Examples of such assays can be found in 

20 Chard, T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science 
Publishers, Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in 
Immunocytochemistry, Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol 3 
(1985); Tijssen, P., Practice and Theory of immunoassays: Laboratory Techniques in 
Biochemistry and Molecular Biology, Elsevier Science Publishers, Amsterdam, The 

25 Netherlands (1985). The test samples of the present invention include cells, protein or 
membrane extracts of cells, or biological fluids such as sputum, blood, serum, plasma, or 
urine. The test sample used in the above-described method will vary based on the assay 
format, nature of the detection method and the tissues, cells or extracts used as the sample to 
be assayed. Methods for preparing protein extracts or membrane extracts of cells are well 

30 known in the art and can be readily be adapted in order to obtain a sample which is 
compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the 
necessary reagents to carry out the assays of the present invention. Specifically, the 
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invention provides a compartment kit to receive, in close confinement, one or more 
containers which comprises: (a) a first container comprising one of the probes or antibodies 
of the present invention; and (b) one or more other containers comprising one or more of the 
following: wash reagents, reagents capable of detecting presence of a bound probe or 
antibody. 

In detail, a compartment kit includes any kit in which reagents are contained in 
separate containers. Such containers include small glass containers, plastic containers or 
strips of plastic or paper. Such containers allows one to efficiently transfer reagents from 
one compartment to another compartment such that the samples and reagents are not 
cross-contaminated, and the agents or solutions of each container can be added in a 
quantitative fashion from one compartment to another. Such containers will include a 
container which will accept the test sample, a container which contains the antibodies used 
in the assay, containers which contain wash reagents (such as phosphate buffered saline, 
Tris-buffers, etc.), and containers which contain the reagents used to detect the bound 
antibody or probe. Types of detection reagents include labeled nucleic acid probes, labeled 
secondary antibodies, or in the alternative, if the primary antibody is labeled, the enzymatic, 
or antibody binding reagents which are capable of reacting with the labeled antibody. One 
skilled in the art will readily recognize that the disclosed probes and antibodies of the present 
invention can be readily incorporated into one of the established kit formats which are well 
known in the art. 

4.17 MEDICAL IMAGING 

The novel polypeptides and binding partners of the invention are useful in medical 
imaging of sites expressing the molecules of the invention (e.g., where the polypeptide of the 
invention is involved in the immune response, for imaging sites of inflammation or 
infection). See, e.g., Kunkel et al., U.S. Pat. NO. 5,413,778. Such methods involve 
chemical attachment of a labeling or imaging agent, administration of the labeled 
polypeptide to a subject in a pharmaceutically acceptable carrier, and imaging the labeled 
polypeptide in vivo at the target site. 



4.18 SCREENING ASSAYS 

Using the isolated proteins and polynucleotides of the invention, the present 
invention further provides methods of obtaining and identifying agents which bind to a 
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polypeptide encoded by an ORF corresponding to any of the nucleotide sequences set forth 
in SEQ ID NO: 1-91 1, or 1823-2478, or bind to a specific domain of the polypeptide 
encoded by the nucleic acid. In detail, said method comprises the steps of: 

(a) contacting an agent with an isolated protein encoded by an ORF of the 
5 present invention, or nucleic acid of the invention; and 

(b) determining whether the agent binds to said protein or said nucleic acid. 
In general, therefore, such methods for identifying compounds that bind to a 

polynucleotide of the invention can comprise contacting a compound with a polynucleotide 
of the invention for a time sufficient to form a polynucleotide/compound complex, and 

10 detecting the complex, so that if a polynucleotide/compound complex is detected, a 
compound that binds to a polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that bind to 
a polypeptide of the invention can comprise contacting a compound with a polypeptide of 
the invention for a time sufficient to form a polypeptide/compound complex, and detecting 

1 5 the complex, so that if a polypeptide/compound complex is detected, a compound that binds 
to a polynucleotide of the invention is identified. 

Methods for identifying compounds that bind to a polypeptide of the invention can 
also comprise contacting a compound with a polypeptide of the invention in a cell for a time 
sufficient to form a polypeptide/compound complex, wherein the complex drives expression 

20 of a receptor gene sequence in the cell, and detecting the complex by detecting reporter gene 
sequence expression, so that if a polypeptide/compound complex is detected, a compound 
that binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include compounds which modulate the 
activity of a polypeptide of the invention (that is, increase or decrease its activity, relative to 

25 activity observed in the absence of the compound). Alternatively, compounds identified via 
such methods can include compounds which modulate the expression of a polynucleotide of 
the invention (that is, increase or decrease expression relative to expression levels observed 
in the absence of the compound). Compounds, such as compounds identified via the 
methods of the invention, can be tested using standard assays well known to those of skill in 

30 the art for their ability to modulate activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be 
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selected and screened at random or rationally selected or designed using protein modeling 
techniques. 

For random screening, agents such as peptides, carbohydrates, pharmaceutical agents 
and the like are selected at random and are assayed for their ability to bind to the protein 
5 encoded by the ORF of the present invention. Alternatively, agents may be rationally 

selected or designed. As used herein, an agent is said to be "rationally selected or designed" 
when the agent is chosen based on the configuration of the particular protein. For example, 
one skilled in the art can readily adapt currently available procedures to generate peptides, 
pharmaceutical agents and the like, capable of binding to a specific peptide sequence, in 

10 order to generate rationally designed antipeptide peptides, for example see Hurby et al., 
Application of Synthetic Peptides: Antisense Peptides," In Synthetic Peptides, A User's 
Guide, W.H. Freeman, NY (1992), pp. 289-307, and Kaspczak et al., Biochemistry 
28:9230-8 (1989), or pharmaceutical agents, or the like. 

In addition to the foregoing, one class of agents of the present invention, as broadly 

1 5 described, can be used to control gene expression through binding to one of the ORFs or 

EMFs of the present invention. As described above, such agents can be randomly screened 
or rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design 
sequence specific or element specific agents, modulating the expression of either a single 
ORF or multiple ORFs which rely on the same EMF for expression control. One class of 

20 DNA binding agents are agents which contain base residues which hybridize or form a triple 
helix formation by binding to DNA or RNA. Such agents can be based on the classic 
phosphodiester, ribonucleic acid backbone, or can be a variety of sulfhydryl or polymeric 
derivatives which have base attachment capacity. 

Agents suitable for use in these methods preferably contain 20 to 40 bases and are 

25 designed to be complementary to a region of the gene involved in transcription (triple helix - 
see Lee et al., Nucl. Acids Res. 6, 3073 (1979); Cooney et al., Science 241, 456 (1988); and 
Dervan et al., Science 251, 1360 (1991)) or to the mRNA itself (antisense-Okano, J. 
Neurochem. 56, 560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene 
Expression, CRC Press, Boca Raton, FL (1988)). Triple helix-formation optimally results in 

30 a shut-off of RNA transcription from DNA, while antisense RNA hybridization blocks 
translation of an mRNA molecule into polypeptide. Both techniques have been 
demonstrated to be effective in model systems. Information contained in the sequences of 
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the present invention is necessary for the design of an antisense or triple helix 
oligonucleotide and other DNA binding agents. 

Agents which bind to a protein encoded by one of the ORFs of the present invention 
can be used as a diagnostic agent. Agents which bind to a protein encoded by one of the 
5 ORFs of the present invention can be formulated using known techniques to generate a 
pharmaceutical composition. 

4.19 USE OF NUCLEIC ACIDS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific nucleic 
1 0 acid hybridization probes capable of hybridizing with naturally occurring nucleotide 

sequences. The hybridization probes of the subject invention may be derived from any of 
the nucleotide sequences SEQ ID NO: 1-911, or 1823-2478. Because the corresponding 
gene is only expressed in a limited number of tissues, a hybridization probe derived from 
any of the nucleotide sequences SEQ ID NO: 1-91 1, or 1823-2478 can be used as an 
15 indicator of the presence of RNA of cell type of such a tissue in a sample. 

Any suitable hybridization technique can be employed, such as, for example, in situ 
hybridization. PGR as described in US Patents Nos. 4,683,195 and 4,965,188 provides 
additional uses for oligonucleotides based upon the nucleotide sequences. Such probes used 
in PCR may be of recombinant origin, may be chemically synthesized, or a mixture of both. 
20 The probe will comprise a discrete nucleotide sequence for the detection of identical 

sequences or a degenerate pool of possible sequences for identification of closely related 
genomic sequences. 

Other means for producing specific hybridization probes for nucleic acids include the 
cloning of nucleic acid sequences into vectors for the production of mRNA probes. Such 

25 vectors are known in the art and are commercially available and may be used to synthesize 
RNA probes in vitro by means of the addition of the appropriate RNA polymerase as T7 or 
SP6 RNA polymerase and the appropriate radioactively labeled nucleotides. The nucleotide 
sequences may be used to construct hybridization probes for mapping their respective 
genomic sequences. The nucleotide sequence provided herein may be mapped to a 

30 chromosome or specific regions of a chromosome using well-known genetic and/or 

chromosomal mapping techniques. These techniques include in situ hybridization, linkage 
analysis against known chromosomal markers, hybridization screening with libraries or 
flow-sorted chromosomal preparations specific to known chromosomes, and the like. The 
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technique of fluorescent in situ hybridization of chromosome spreads has been described, 
among other places, in Verma et al (1988) Human Chromosomes: A Manual of Basic 
Techniques, Pergamon Press, New York NY. 

Fluorescent in situ hybridization of chromosomal preparations and other physical 
5 chromosome mapping techniques may be correlated with additional genetic map data. 
Examples of genetic map data can be found in the 1994 Genome Issue of Science 
(265: 198 If). Correlation between the location of a nucleic acid on a physical chromosomal 
map and a specific disease (or predisposition to a specific disease) may help delimit the 
region of DNA associated with that genetic disease. The nucleotide sequences of the subject 
10 invention may be used to detect differences in gene sequences between normal, carrier or 
affected individuals. 

« 

4,20 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 

Oligonucleotides, i.e., small nucleic acid segments, may be readily prepared by, for 
example, directly synthesizing the oligonucleotide by chemical means, as is commonly 

1 5 practiced using an automated oligonucleotide synthesizer. 

Support bound oligonucleotides may be prepared by any of the methods known to those 
of skill in the art using any suitable support such as glass, polystyrene or Teflon. One strategy 
is to precisely spot oligonucleotides synthesized by standard synthesizers. Immobilization can 
be achieved using passive adsorption (Inouye & Hondo, (1990) J. Clin. Microbiol. 28(6), 1469- 

20 72); using UV light (Nagata et al, 1985; Dahlen et al, 1987; Morrissey & Collins, (1989) Mol. 
Cell Probes 3(2) 189-207) or by covalent binding of base modified DNA (Keller et al, 1988; 
1989); all references being specifically incorporated herein. 

Another strategy that may be employed is the use of the strong biotin-streptavidin 
interaction as a linker. For example, Broude et al (1994) Proc. Natl. Acad. Sci. USA 91(8), 

25 3072-6, describe the use of biotinylated probes, although these are duplex probes, that are 
immobilized on streptavidin-coated magnetic beads. Streptavidin-coated beads may be 
purchased from Dynal, Oslo. Of course, this same linking chemistry is applicable to coating 
any surface with streptavidin. Biotinylated probes may be purchased from various sources, 
such as, e.g., Operon Technologies (Alameda, CA). 

30 Nunc Laboratories (Naperville, IL) is also selling suitable material that could be used. 

Nunc Laboratories have developed a method by which DNA can be covalently bound to the 
microwell surface termed Covalink NH. CovaLink NH is a polystyrene surface grafted with 
secondary amino groups (>NH) that serve as bridgeheads for further covalent coupling. 
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CovaLink Modules may be purchased from Nunc Laboratories. DNA molecules may be bound 
to CovaLink exclusively at the 5-end by a phosphoramidate bond, allowing immobilization of 
more than 1 pmol of DNA (Rasmussen et al t (1991) Anal. Biochem. 198(1) 138-42). 

Hie use of CovaLink NH strips for covalent binding of DNA molecules at the 5 ! -end 
5 has been described (Rasmussen et al., (1 99 1). In this technology, a phosphoramidate bond is 
employed (Chu et al., (1983) Nucleic Acids Res. 1 1(8) 6513-29). This is beneficial as 
immobilization using only a single covalent bond is preferred. The phosphoramidate bond joins 
the DNA to the CovaLink NH secondary amino groups that are positioned at the end of spacer 
arms covalently grafted onto the polystyrene surface through a 2 nm long spacer arm. To link 
10 an oligonucleotide to CovaLink NH via an phosphoramidate bond, the oligonucleotide terminus 
must have a 5 ! -end phosphate group. It is, perhaps, even possible for biotin to be covalently 
bound to CovaLink and then streptavidin used to bind the probes. 

More specifically, the linkage method includes dissolving DNA in water (7.5 ng/|ul) and 
denaturing for 10 min. at 95°C and cooling on ice for 10 min. Ice-cold 0.1 M 1- 
1 5 methylimidazole, pH 7.0 (I-Melm.7), is then added to a final concentration of 10 mM 1-Melm 7 . 
A ss DNA solution is then dispensed into CovaLink NH strips (75 jal/well) standing on ice. 

Carbodiimide 0.2 M l-ethyl-3-(3-dimethylaminopropyl)-carbodiimide (EDC), 
dissolved in 10 mM 1-Melm7, is made fresh and 25 pi added per well. The strips are incubated 
for 5 hours at 50°C. After incubation the strips are washed using, e.g., Nunc-Etnmuno Wash; 
20 first the wells are washed 3 times, then they are soaked with washing solution for 5 min., and 
finally they are washed 3 times (where in the washing solution is 0.4 N NaOH, 0.25% SDS 
heated to 50°C). 

It is contemplated that a further suitable method for use with the present invention is 
that described in PCT Patent Application WO 90/03382 (Southern & Maskos), incorporated 

25 herein by reference. This method of preparing an oligonucleotide bound to a support involves 
attaching a nucleoside 3 '-reagent through the phosphate group by a covalent phosphodiester link 
to aliphatic hydroxyl groups carried by the support. The oligonucleotide is then synthesized on 
the supported nucleoside and protecting groups removed from the synthetic oligonucleotide 
chain under standard conditions that do not cleave the oligonucleotide from the support. 

30 Suitable reagents include nucleoside phosphoramidite and nucleoside hydrogen phosphorate. 

An on-chip strategy for the preparation of DNA probe for the preparation of DNA probe 
arrays may be employed. For example, addressable laser-activated photodeprotection may be 
employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described 
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by Fodor et al (1991) Science 251(4995), 767-73, incorporated herein by reference. Probes 
may also be immobilized on nylon supports as described by Van Ness et al (1991) Nucleic 
Acids Res., 19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1988) 
Anal. Biochem. 169(1), 104-8; all references being specifically incorporated herein. 
5 To link an oligonucleotide to a nylon support, as described by Van Ness et al (1991), 

requires activation of the nylon surface via alkylation and selective activation of the 5'-amine of 
oligonucleotides with cyanuric chloride. 

One particular way to prepare support bound oligonucleotides is to utilize the 
light-generated synthesis described by Pease et al, (1994) Proc. Natl. Acad. Sci., USA 91(1 1), 

1 0 5022-6, incorporated herein by reference). These authors used current photolithographic 
techniques to generate arrays of immobilized oligonucleotide probes (DNA chips). These 
methods, in which light is used to direct the synthesis of oligonucleotide probes in high-density, 
miniaturized arrays, utilize photolabile S'-protected N-acyl-deoxynucleoside phosphoramidites, 
surface linker chemistry and versatile combinatorial synthesis strategies. A matrix of 256 

1 5 spatially defined oligonucleotide probes may be generated in this manner. 

4.21 PREPARATION OF NUCLEIC ACID FRAGMENTS 

The nucleic acids may be obtained from any appropriate source, such as cDNAs, 
genomic DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC 
inserts, and RNA, including mRNA without any amplification steps. For example, Sambrook 
20 et al (1989) describes three protocols for the isolation of high molecular weight DNA from 
mammalian cells (p. 9.14-9.23). 

DNA fragments may be prepared as clones in M13, plasmid or lambda vectors and/or 
prepared directly from genomic DNA or cDNA by PGR or other amplification methods. 
Samples may be prepared or dispensed in multiwell plates. About 100-1000 ng of DNA 
25 samples may be prepared in 2-500 ml of final volume. 

The nucleic acids would then be fragmented by any of the methods known to those of 
skill in the art including, for example, using restriction enzymes as described at 9.24-9.28 of 
Sambrook et al (1989), shearing by ultrasound and NaOH treatment. 

Low pressure shearing is also appropriate, as described by Schriefer et al (1990) 
30 Nucleic Acids Res. 1 8(24), 7455-6, incorporated herein by reference). In this method, DNA 
samples are passed through a small French pressure cell at a variety of low to intermediate 
pressures. A lever device allows controlled application of low to intermediate pressures to the 
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cell. The results of these studies indicate that low-pressure shearing is a useful alternative to 
sonic and enzymatic DNA fragmentation methods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using the 
two base recognition endonuclease, CWJI, described by Fitzgerald et al (1992) Nucleic Acids 
5 Res. 20(14) 3753-62. These authors described an approach for the rapid fragmentation and 
fractionation of DNA into particular sizes that they contemplated to be suitable for shotgun 

cloning and sequencing. 

The restriction endonuclease CviJl normally cleaves the recognition sequence PuGCPy 
between the G and C to leave blunt ends. Atypical reaction conditions, which alter the 

1 0 specificity of this enzyme (Cvi JI**) 5 yield a quasi-random distribution of DNA fragments form 
the small molecule pUC19 (2688 base pairs). Fitzgerald et al (1992) quantitatively evaluated 
the randomness of this fragmentation strategy, using a Cv/JI** digest of pUC19 that was size 
fractionated by a rapid gel filtration method and directly ligated, without end repair, to a lac Z 
minus Ml 3 cloning vector. Sequence analysis of 76 clones showed that CWJI** restricts 

1 5 pyGCPy and PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated 
at a rate consistent with random fragmentation. 

As reported in the literature, advantages of this approach compared to sonication and 
agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 |tig instead of 
2-5 |Ltg); and fewer steps are involved (no preligation, end repair, chemical extraction, or 

20 agarose gel electrophoresis and elution are needed). 

Irrespective of the manner in which the nucleic acid fragments are obtained or prepared, 
it is important to denature the DNA to give single stranded pieces available for hybridization. 
This is achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. The solution is 
then cooled quickly to 2°C to prevent renaturation of the DNA fragments before they are 

25 contacted with the chip. Phosphate groups must also be removed from genomic DNA by 
methods known in the art. 

4.22 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon 
membrane. Spotting may be performed by using arrays of metal pins (the positions of which 
30 correspond to an array of wells in a microtiter plate) to repeated by transfer of about 20 nl of a 
DNA solution to a nylon membrane. By offset printing, a density of dots higher than the density 
of the wells is achieved. One to 25 dots may be accommodated in 1 mm , depending on the 
type of label used. By avoiding spotting in some preselected number of rows and columns, 
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separate subsets (subarrays) may be formed. Samples in one subarray may be the same genomic 
segment of DNA (or the same gene) from different individuals, or may be different, overlapped 
genomic clones. Each of the subarrays may represent replica spotting of the same samples. In 
one example, a selected gene segment may be amplified from 64 patients. For each patient, the 
5 amplified gene segment may be in one 96-well plate (all 96 wells containing the same sample). 
A plate for each of the 64 patients is prepared. By using a 96-pin device, all samples may be 
spotted on one 8 x 12 cm membrane. Subarrays may contain 64 samples, one from each patient. 
Where the 96 subarrays are identical, the dot span may be 1 mm 2 and there may be a 1 mm 
space between subarrays. 

1 0 Another approach is to use membranes or plates (available from NUNC, Naperville, 

Illinois) which may be partitioned by physical spacers e.g. a plastic grid molded over the 
membrane, the grid being similar to the sort of membrane applied to the bottom of multiwell 
plates, or hydrophobic strips. A fixed physical spacer is not preferred for imaging by exposure 
to flat phosphor-storage screens or x-ray films. 

1 5 The present invention is illustrated in the following examples. Upon consideration of 

the present disclosure, one of skill in the art will appreciate that many other embodiments and 
variations may be made in the scope of the present invention. Accordingly, it is intended that 
the broader aspects of the present invention not be limited to the disclosure of the following 
examples. The present invention is not to be limited in scope by the exemplified embodiments 

20 which are intended as illustrations of single aspects of the invention, and compositions and 
methods which are functionally equivalent are within the scope of the invention. Indeed, 
numerous modifications and variations in the practice of the invention are expected to occur to 
those skilled in the art upon consideration of the present preferred embodiments. Consequently, 
the only limitations which should be placed upon the scope of the invention are those which 

25 appear in the appended claims. 

All references cited within the body of the instant specification are hereby incorporated 
by reference in their entirety. 

5.0 EXAMPLES 

5.1 EXAMPLE 1 

30 Novel Nucleic Acid Sequences Obtained From Various Libraries 

A plurality of novel nucleic acids were obtained from cDNA libraries prepared from 
various human tissues and in some cases isolated from a genomic library derived from human 
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chromosome using standard PCR, SBH sequence signature analysis and Sanger sequencing 
techniques. The inserts of the library were amplified with PCR using primers specific for the 
vector sequences which flank the inserts. Clones from cDNA libraries were spotted on nylon 
membrane filters and screened with oligonucleotide probes (e.g., 7-mers) to obtain signature 
5 sequences. The clones were clustered into groups of similar or identical sequences. 
Representative clones were selected for sequencing. 

In some cases, the 5' sequence of the amplified inserts was then deduced using a typical 
Sanger sequencing protocol. PCR products were purified and subjected to fluorescent dye 
terminator cycle sequencing. Single pass gel sequencing was done using a 377 Applied 
10 Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. 

5.2 EXAMPLE 2 
Assemblage of Novel Nucleic Acids 

The contigs or nucleic acids of the present invention, designated as SEQ ID NO: 1 823- 
2478 were assembled using an EST sequence as a seed. Then a recursive algorithm was used to 

1 5 extend the seed EST into an extended assemblage, by pulling additional sequences from 
different databases (i.e., Hyseq's database containing EST sequences, dbEST, gb pri, and 
UniGene, and exons from public domain genomic sequences predicated by GenScan) that 
belong to this assemblage. The algorithm terminated when there were no additional sequences 
from the above databases that would extend the assemblage. Further, inclusion of component 

20 sequences into the assemblage was based on a BLASTN hit to the extending assemblage with 
BLAST score greater than 300 and percent identity greater than 95%. 

5.3 EXAMPLE 3 
Novel Nucleic Acids 

The novel nucleic acids of the present invention were assembled from sequences that 
25 were obtained from a cDNA library by methods described in Example 1 above, and in some 
cases sequences obtained from one or more public databases. The nucleic acids were 
assembled using an EST sequence as a seed. Then a recursive algorithm was used to extend the 
seed EST into an extended assemblage, by pulling additional sequences from different 
databases (Hyseq's database containing EST sequences, dbEST, gb pri, and UniGene) that 
30 belong to this assemblage. The algorithm terminated when there was no additional sequences 
from the above databases that would extend the assemblage. Inclusion of component sequences 
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into the assemblage was based on a BLASTN hit to the extending assemblage with BLAST 
score greater than 300 and percent identity greater than 95%. 

Using PHRAP (Univ. of Washington) or CAP4 (Paracel), a full-length gene cDNA 
sequence and its corresponding protein sequence were generated from the assemblage. Any 
5 frame shifts and incorrect stop codons were corrected by hand editing. During editing, the 
sequences were checked using FASTY and/or BLAST against Genebank (i.e., dbEST, gb pri, 
UniGene, and Genpept) and the Geneseq (Derwent). Other computer programs which may 
have been used in the editing process were phredPhrap and Consed (University of Washington) 
and ed-ready, ed-ext and cg-zip-2 (Hyseq, Inc.). The full-length nucleotide and amino acid 
10 sequences, including splice variants resulting from these procedures are shown in the Sequence 
Listing as SEQ ID NO: 1-1822. 

Table 1 shows the various tissue sources of SEQ ID NO: 1-911. 

The homologs for polypeptides SEQ ID NO: 912-1822, that correspond to nucleotide 
sequences SEQ ID NO: 1-911 were obtained by a BLASTP version 2.0al 19MP-WashU 
15 searches against Genpept and Geneseq (Derwent) using BLAST algorithm. The results 

showing homologues for SEQ ID NO: 912-1822 from Genpept 127-129 are shown in Table 
2A. The results showing homologues for SEQ ID NO: 912-1822 from Genpept 131 are 
shown in Table 2B. 

Using eMatrix software package (Stanford University, Stanford, CA) (Wu et al., J. 

20 Comp. Biol., Vol. 6, 219-235 (1999), http://motif.stanford.edu/ematrix-search/ herein 
incorporated by reference), all the polypeptide sequences were examined to determine 
whether they had identifiable signature regions. Scoring matrices of the eMatrix software 
package are derived from the BLOCKS, PRINTS, PFAM, PRODOM, and DOMO 
databases. Table 3 shows the accession number of the homologous eMatrix signature found 

25 in the indicated polypeptide sequence, its description, and the results obtained which include 
accession number subtype; raw score; p-value; and the position of signature in amino acid 
sequence. The results showing homologous signatures for SEQ ID NO: 912-1822 from 
eMatrix version 1 .0 are shown in Table 3 A. The results showing homologous signatures for 
SEQ ID NO: 912-1822 from eMatrix version 2.0 are shown in Table 3B. 

30 Using the Pfam software program (Sonnhammer et al., Nucleic Acids Res., Vol. 

26(1) pp. 320-322 (1998) herein incorporated by reference) all the polypeptide sequences 
were examined for domains with homology to certain peptide domains. Table 4A shows the 
name of the Pfam model found, the description, the e-value and the Pfam score for the 
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identified model within the sequence using Pfam versions 7.0 and 7.2. Table 4B shows the 
name of the Pfam model found, the description, the e-value and the Pfam score for the 
identified model within the sequence using Pfam version 7.5. Further description of the 
Pfam models can be found at http://pfam.wustl.edu/ . 
5 The GeneAtlas™ software package (Molecular Simulations Inc. (MSI), San Diego, 

CA) was used to predict the three-dimensional structure models for the polypeptides 
encoded by SEQ ID NO: 1-91 1 (i.e. SEQ ID NO: 912-1822). Models were generated by (1) 
PSI-BLAST which is a multiple alignment sequence profile-based searching developed by 
Altschul et al, (Nucl. Acids. Res. 25, 3389-3408 (1997)), (2) High Throughput Modeling 

10 (HTM) (Molecular Simulations Inc. (MSI) San Diego, CA,) which is an automated sequence 
and structure searching procedure rhtt p://www.msi.com/) , and (3) SeqFold™ which is a fold 
recognition method described by Fischer and Eisenberg (J. Mol. Biol. 209, 779-791 (1998)). 
This analysis was carried out, in part, by comparing the polypeptides of the invention with 
the known NMR (nuclear magnetic resonance) and x-ray crystal three-dimensional structures 

1 5 as templates. Table 5 shows: "PDB ID" , the Protein DataBase (PDB) identifier given to 
template structure; "Chain ID", identifier of the subcomponent of the PDB template 
structure; "Compound Information", information of the PDB template structure and/or its 
subcomponents; "PDB Function Annotation" gives function of the PDB template as 
annotated by the PDB files rhttp :/www.rcsb.org/PDB/) ; start and end amino acid position of 

20 the protein sequence aligned; PSI-BLAST score, the verify score, the SeqFold score, and the 
Potential(s) of Mean Force (PMF). The verify score is produced by GeneAtlas™ software 
(MSI), is based on Dr. Eisenberg' s Profile-3D threading program developed in Dr. David 
Eisenberg's laboratory (US patent no. 5,436,850 and Luthy, Bowie, and Eisenberg, Nature, 
356:83-85 (1992)) and a publication by R. Sanchez and A. Sali, Proc. Natl. Acad. Sci. USA, 

25 95: 13 597-12502. The verify score produced by GeneAtlas normalizes the verify score for 
proteins with different lengths so that a unified cutoff can be used to select good models as 
follows: Verify score (normalized) = (raw score - 1/2 high score)/(l/2 high score) 

The PFM score, produced by GeneAtlas™ software (MSI), is a composite scoring 
function that depends in part on the compactness of the model, sequence identity in the 

30 alignment used to build the model, pairwise and surface mean force potentials (MFP). As 
given in table 5, a verify score between 0 to 1.0, with 1 being the best, represents a good 
model. Similarly, a PMF score between 0 to 1.0, with 1 being the best, represents a good 
model. A SeqFold™ score of more than 50 is considered significant. A good model may 
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also be determined by one of skill in the art based all the information in Table 5 taken in 
totality. 

Table 6 shows the position of the signal peptide in each of the polypeptides and the 
maximum score and mean score associated with that signal peptide using Neural Network 
5 SignalP VI .1 program (from Center for Biological Sequence Analysis, The Technical 
University of Denmark). The process for identifying prokaryotic and eukaryotic signal 
peptides and their cleavage sites are also disclosed by Henrik Nielson, Jacob Engelbrecht, 
Soren Brunak, and Gunnar von Heijne in the publication " Identification of prokaryotic and 
eukaryotic signal peptides and prediction of their cleavage sites" Protein Engineering, Vol. 
10 10, no. 1, pp. 1-6 (1997), incorporated herein by reference. A maximum S score and a mean 
S score, as described in the Nielson et al reference, was obtained for the polypeptide 
sequences. 

Table 7 correlates nucleotide sequences of the invention to a specific chromosomal 
location when assignable. 
15 Table 8 shows the number of transmembrane regions, their location(s), and TMPred 

score obtained, for each of the SEQ ID NO: 912-1822 that had a TMPred score of 500 or 
greater, using the TMpred program 

( http : //www, ch. embnet . org/soft ware/TMPRED form .html) . 

Table 9 is a correlation table of the novel polynucleotide sequences SEQ ID NO: 1- 

20 911, their corresponding polypeptide sequences SEQ ID NO : 9 1 2- 1 822, their corresponding 
priority contig nucleotide sequences SEQ ID NO: 1823-2478, their corresponding priority 
contig polypeptide sequences SEQ ID NO: 2479-3 134, and the US serial number of the 
priority application (all of which are herein incorporated in their entirety), in which the 
contig sequence was filed. 

25 Table 10 is a correlation table of the novel polynucleotide sequences SEQ ID NO: 1- 

911, the novel polypeptide sequences SEQ ID NO: 912-1822, and the US application serial 
number and corresponding SEQ ID NO in which the sequence was previously filed. 
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Tissue Origin 


Library/RNA Source 


HYSEQ 
Library 
Name 


SEQ ID NOS: 


^Mixture of 16 
tissues - mRNA 


Various Vendors 


CGdOll 


58 74 147 267 277 432 489 496 618 


^Mixture of 1 6 
tissues - mRNA 


Various Vendors 


CGd012 


3-4 6 9 19-20 41 56-58 62 71-72 74 78 
81 91-92 114 118-119 128 133 141 
147-149 153 155 165-167 176-177 185 
187 195 215 218 220 228-229 237 245 
249 257 259 267 271 277-280 302 3 10 
313 326 353 355 366 368-369 380 385 
391 394 398 405-406 409-410 417 
432 439-440 442 449 452 460 462 464 
471-472 475 480 489 496 498 505 524- 
528 535 546 555 581 585 594 610 618 
636 804 811 816 845 871 


^Mixture of 16 
tissues - mRNA 


Various Vendors 


CGd013 


55 71 78 92 165 228 271 276 279-280 
310 353 355 422 424 496 526 531 606 
618 908 


* Mixture of 16 
tissues - mRNA 


Various Vendors 


CGd015 


10 97 100 194-195 246 258-260 266 
270 282 311 355-357 398 408 423 428- 

A mm* ^** A mm* *W** J MmT mm* mm* mmm M A mm 

430 439 495 526 544 805 


* Mixture of 16 
tissues - mRNA 


Various Vendors 


CGd016 


10 14 20 51 71 75 83 116 139 149 168 
174-175 194-195 209 230 233 238-239 
248 257 266-267 279-280 289 31 1-312 
355 403 420-421 493 503 513 515 532 
543 689 707 909 


* Mixture of 16 
tissues - mRNA 


Various Vendors 


CGdOlO 


176 185 352 380 386 403 422 425-426 
517 537 606 


adrenal gland 


Clontech 


ADR002 


7 21 36 40 43 64 69 88 101 120 130 
147 157 162-163 169 184 187 195 203 
218 225 230 236 275 283 289 298-299 

-IK -m mWBamm mm mmm mmm ***** my mm. mm* mm* J^H* mmm* A m**m mm* A * mm ** mmmf A) mm* mm*. 

317 355 381-382 392 425 427 430 
446 449 460 471 477 503 510 524 531 

1— m-~ Mm A A*** mm A Jt mw im-mt **m m*~ r*\ m*\ /f~ mmm mm - *** A /*\ J?" A** (~\ f*\ /*\ 

536 540 544 578 622 627 648 668 729 
788-789 


adult bladder 


Invitrogen 


BLD001 


19 33 54 128 195 220 312 355 395 421 
423 446 464 503 543 618 829 


adult brain 


BioChain 


ABR012 


184 246 355 364 


adult brain 


BioChain 


ABR013 


102 226 355 423 432 


adult brain 


GIBCO 


AB3001 


3 87 97 100 236 269 304-306 355 377 
427 497 536 540 


adult brain 


GIBCO 


ABD003 


16 20-21 27 40 59 62 87 94 100 105 
117 120 128-129 134 140 157 184 191- 
192 195 201 234 238-240 245-246 250 
256 260 267-268 270 272 295 304-306 
316 335 351-353 355 362 369 377 384 

. m -mm m. — -* mmm, m -* mmm. m. mmm* mm* A mmm. mt-m. A A M~\ A XPrnf ■ A mm*> mmm 

399 412 415 419 423 430 440 457-460 
464 470 477 496 498 503 519 536 539- 
540 543 555 589 637 


adult brain 


Invitrogen 


ABR014 


*A m-m. tW\ -4 *—\ f-m Arm* ***** JT" *r*\ A mf Mmm A £*-* <•"> mt f~\ "t /"V t 91 £** A t~\ *fl 

120 199 226 246 248 312 319 355 401 
445 477 503 611 


adult brain 


Invitrogen 


ABR015 


20 27 42 49 100 135 157 186 195 199 
226 246 256 430 432 446 472 484 543 
868 


adult brain 


Invitrogen 


ABR016 


16 100 120 195 226 267 355 459 496 
526 


adult brain 


Clontech 


ABR001 


21 27 40 59 87 101 1 12 128 226 268 



WO 03/054152 PCT/US02/39555 



Table 1 
116 



Tissue Origin 


Library/RNA Source 


HYSEQ 
Library 
Name 


SEQ ID NOS: 








272 355 364 382 387 428-429 464 503 
517 543 619-620 644 838 847 852 


adult brain 


Clontech 


ABR006 


17 23 30 33 41 43 59 64 88 94 98 120- 
121 124 131 133 154 179 183-184 194 
199 207 214 220 226 231 238-240 250 
256 257 266 278 281 304 312 351 353 
355 365 376 382 392 401 412 415 419 
427 430 433 445 460 464 479 516-517 
519 535-537 540 542-543 546 548 555 
582 728 747-748 754 816 


adult brain 


Clontech 


ABR008 


3 17 23 26 29 31 36 40-4144 49 51 57 
59 62 78 83 87-88 90 100-102 109 115 
121 124 127-128 130 131 133-136 145- 
146 149 154 158 161-163 167 169-171 
182 184 186 195-196 202 207 211 214 
224 227 231 244-246 248 250-25 1 256 
261-262 267 276 281 285-286 296-297 
312 319 337 351 353 355 364-365 368 
372 376 380 383 385 390-391 394 401 
403 412 423 427 431 433 435-436 437 
444.445 447-448 452 457-458 460-461 
465-466 479-481 489 503 506 508 513 
517 526 535 545 577-578 585 592 598 
611 621 624-626 648 674-675 722 816 
833 837 839 847-848 852 857 865-866 
884 


adult brain 


Clontech 


ABR011 


26 285-286 431 


adult brain 


Invitrogen 


ABT004 


19-21 30 35 44 57 59 64 87 128 131 
140 144 157 161 187 198 202 226 230 
250 268 272 293 295 351 355 365 372 
387 395 415 424 427 428 442-443 446 
452 455 457-458 478 498 513 516 524 
540 543 816 831 


adult cervix 


BioChain 


CVX001 


10 22-23 36 41 43 62-63 77 100-101 
105-106 109 120 130 134 137 141 154 
160 162-163 178-180 183-184 187 192 
195 198 215 217 226 240 246 249 260 
262 267 272 285 297 304 310-311 316 
334-336 353 355 373-374 377 379-380 
385 395 400 408-409 412 417 425 
431 452 464 468 476-477 482 496-497 
503 508 512 516 540 546 578 592 626 
645 660-662 676 680 888 


adult colon 


Invitrogen 


CLN001 


19 21 57 109 128 130 131 148 168 230 
240 301 364 412 424 440 452 464 469 
543 664 


adult heart 


GIBCO 


AHR001 


3 19-20 22 26 41 45 59 62 67 94 97 99- 
101 106 109 120 124 126 128 134 138 
157 171 183 186-190 195 197-198 202 
205 21 1 215 226-227 238-240 248 249 
261-262 267-268 281 285 311 316 335 
353 373-374 379-380 399 412 419 423 
425 430 433 440 464 477 480 496 498 
503 506 543 546 553 566 570 580 


adult kidney 


GIBCO 


AKD001 


3 17 19-20 25 32 35-37 41 49 57 59 62 
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Tissue Origin 


Library/RNA Source 


HYSEQ 
Library 
Name 


SEQ ID NOS: 








64 87 94 98-102 105 109 120 128 134- 
137 145 153-154 157-158 160 176 179- 
180 182 184 186 192 195 198 215 226- 
227 234 238-240 246 248 256 257 262 
267 272 278 283 285 297 304 310-311 
316 326 353 355 362 364 372 375 377 
379 381-382 391 395 412 415 421-422 
423-424 430-432 434 437 440 444 452 
459 464 470 477 496 503 506 510 519 
527-528 540 546 551 566 578 585 606 
837 


adult kidney 


Invitrogen 


AKT002 


1 17 20 26 59 62 73 87 94 100-101 120 
133 135 154 160 177 187 195 208 226 
228 238-239 248 249 257 260 262-264 
267-268 276 293 299 353 355 359 362 
380 412 419 421 425 427 431 434 436 
439 452 462 464 498 503 508 515 519 
532 540 543 546 586 590 


adult liver 


Invitrogen 


ALV002 


1 3-4 14-16 19-20 32-33 40 48 54 63 
77 87 94 97 101 128-129 135-136 138- 
139 149 157 180 1 82 1 87 193 195-196 
203 226-227 246 248 257 262 277-278 
305-306 316 347 355-357 362 364 383 
391 412 423 427 430 446 477 496 503 
510 525 540 553 629 636 852 


adult liver 


Clontech 


ALV003 


14 24 34 40 94 160 195 227 257 278 
355-357 362 369 424 436 496 527-528 
750 901 


adult lung 


GIBCO 


ALG001 


20 36 40 100 105 120 128 130 154 157 
195 198 263-264 267 273 336 373-374 
420 437 446 464 477 503 543 


adult ovary 


Invitrogen 


AOV001 


1 13 15 17 19-22 36 41 49 51 57 59 62 
68 70-71 77 87-88 97 100-101 105-106 
109 112 116 120 125 128 130 134 144 
157-160 169 176-177 179 181-182 184 
187 191-193 195 198 203 211 213 215 
226 230 236-237 240 246 248 249 
261-264 267-268 271-272 279-280 
288-289 295 298-299 304 310 320 335 
355 362 364-365 376-377 380 383-386 
389 391 394-395 399 408 412 415 417 
422 424 434 436 440 446 452 456 459- 
460 477 479 496-497 503 506 510 518- 
519 535-536 540 543-544 546 555 591 
612-614 794 823 827 829-830 833-834 
860 


adult placenta 


Clontech 


APL001 


16 195 256 285 399 421 424 446 464 
477 792 


adult spleen 


Clontech 


SPLcOl 


17 22 36 46 88 91 93 97 103 1 12 135- 
136 142 147 162-163 187 195 256 263- 
264 268 281 303-306 311-312 353 376 
427 433 436 437 452 477 480 506 515 
518 535 537 548 


adult spleen 


GIBCO 


ASP001 


3 10 46 77 81 94 97 100 105 120 129 
134-136 160 166 186-187 195 226 240 
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Tissue Origin 


Library/RNA Source 


HYSEQ 
Library 
Name 


SEQ ID NOS: 








246 248 256 257 267 303-304 311 316 
355 377 382 389 391 423 427 430 432 
436 452 477 536 540 543 582 586 


adult testis 


GIBCO 


ATS001 


20-21 100-101 105-106 109 128 154 
157 160 169 191-193 195 198 213 248 
256 267 269 295 316 336 355 380 388 
415 452 460 477 496 508 518 540-541 


bone marrow 


GF 


BMD002 


10 17 20 25 36 41 43 45-46 58-59 70- 
71 75 81 86 88 100 106 109 115 120 
124 130 134 136 138 147 154-155 157 
162-163 166-167 182-184 187-190 193 
195-196 198 202 216-217 220 226 231 
233 240 248 256 258-259 267-268 272 
279-280 338-339 359 364 373-374 
376-377 380 382 389 391 398 403 412 
415 417 423 427 432 436 437 444 446 
452 460 464 477 480 490 493 497 503 
505 513 517-518 526-528 532 537 540 
543 546 548 551 581-582 597 613 626 
694 697 704 723 729 741-742 744 751 
754-755 768 775 815 822 855 859 874 
885 891 


bone marrow 


Clontech 


BMD001 


10 16 20 22 25 36 41 49 59 62 71 81 97 
99-100 120 128 130 134 136 143 157 
179 184 187-190 195 198 207 211 215- 
217 236 247 256 258-260 263-265 267 
269 279-280 282 285 298 304 312 353 
364 373-375 377-378 380 389 398 419 
421-422 423 425 430 432 434 452 
477 480 497 503 506 518-519 536-537 
544 546 551 556 566 569 571-572 602 
604 615 728 815 818 


bone marrow 


Clontech 


BMD004 


259 398 


bone marrow 


Clontech 


BMD007 


36 259 398 


bone marrow 


BMD008 


259 


398 


bone marrow 


null 


STM001 


91 136 166 195 256 362 377 412 423- 
425 477 503 537 540 


CGSP009 




255 




cultured 
preadipocytes 


Stratagene 


ADP001 


9 17 76 100 120 134 136 179 195 226 
246 248 263-264 267 275 281 295 323 
345-346 377 391 423-425 427 430-431 
464 477 503 506 531 543 546 548 592 
701 876 


endothelial cells 


Stratagene 


EDT001 


3-4 10 20 22-23 35-36 41 47 51 57 59 
62 64 87 97 99-102 105-106 108-110 
120 128 134-135 144 153 157 167 177 
179 182-183 186 191 195 204 214-215 
226 240 246 248 251 256 262 267-268 
270 272-273 275 282 285 295 304 353 
355 362 364-365 373-374 376-377 379 
382 385-386 388 391 394 403 412 419 
423-425 427 430-432 440 444-445 477 
497 503 506 508 546 548 566 608 


esophagus 


BioChain 


ESO002 


532 607 


fetal brain 


Invitrogen 


FBT002 


3 20 42 57 64 100 130 134-135 154 
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Tissue Origin 


Library/RNA Source 


HYSEQ 
Library 
Name 


SEQ ID NOS: 








157 162-163 169 214 226 250 262 
265 267 272 283 294 336 351 355 365 
407 412 423 431 440 444-445 460 470 
477 498-499 526 544 578 610 


fetal brain 


GIBCO 


HFB001 


4 20 22 26-31 36 42 49 59 64 77 85 87 
97 100-102 105 108-109 120 128 134 
157 179 184 186 191-192 195 198-199 
211 226 238-239 246 248 256 261-262 
267-268 285 295 300 310 335 353 355 
365 376-377 379-380 390 415 419 421 
423-424 430 446 448 460 477-478 
489 496 503 506 513 516 536 542 546 
595 793 817 


fetal brain 


Clontech 


FBRs03 


49 


fetal brain 


Clontech 


FBR001 


30 87 100 182 199 268 415 435 516 
578 


fetal brain 


Clontech 


FBR004 


36 214 224 229 376 394 445 460 5 13 
516 578 611 787 816 861 897 


fetal brain 


Clontech 


FBR006 


3 12 15 20-21 23 25 33-34 36 44 49 58- 
59 62 78 83 88 91 99-100 103 106 124 
128 130 131 134-136 149 154 161 169 
182 184 186-187 193 227 231 236 242 
245 256 261 281 285 292 301 311-312 
320 353 355 360-361 364-365 376 383 
385 401-403 412 422 423-424 427 433 
435 441 444 445 452 457-458 460 465 
471 478-480 486 496 502-503 506 513 
515 517 526 535 537 546 548 566 756- 
757 759-760 784-785 858-859 886 


fetal heart 


Invitrogen 


FHR001 


3 7 12 16 19-20 26 31 84 94 101-102 
106 109 124 135-136 138 142 158 183- 
184 193 195 227 229 236 241 249 265 
267 270 273 277 285 307 310 353 355 
376 380 382 385 387 394 396 408 412 
419 425-426 430-432 444-445 452 
464 477 498 503 506-509 513 526 543 
546 548 592 598 648 761-763 765-766 
776-778 859-860 893 


fetal kidney 


Invitrogen 


FKD007 


26 111 162-163 


fetal kidney 


Clontech 


FKD001 


46 186-187 195 267 335 421 452 


fetal kidney 


Clontech 


FKD002 


12 44 49 71 93 97 100 102 106 128 130 
136 154 162-163 179 183 186 195 227 
267 276 279-280 285 310 316 376 384 
387 389 464 477 480 496 498-499 510 
535 554 582 729 768 894 897 


fetal liver 


Clontech 


FLV002 


20 34 40 68 78 85 93 125 182 194-195 
207 257 259 266 278 297 356-357 383 
398 457-458 477 496 548 767 


fetal liver 


Clontech 


FLV004 


5 12 14 20 24 32-33 40 49 94 100 115 
128 130 135 157 159-160 169 176 179- 
180 195-196 203 224 226-227 229 246 
249 253 257 259 267 278 295 305-306 
355-357 385 398 424 427 437 459 464 
477 496 498 537 546 581 603 641 743 
768 
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Tissue Origin 


Library/RNA Source 


HYSEQ 
Library 
Name 


SEQ ID NOS: 


fetal liver 


Invitrogen 


FLV001 


20 32-33 36 85 94 128 144 149 154 
157 171 180 182 195 248 278 292 312 
316 355 362 423 430-431 436 441 472 
477 498 540 543 555 


fetal liver- 
spleen 


Columbia University 


FLS001 


2-3 10 12 14-17 19-24 26-27 32-36 39 
41 43 49 51 57 59 62 80 90-94 96-97 
99-100 105 107 109 120 128-130 134 
136 149 153 157-158 162-163 167 176- 
177 179-187 191 194-195 198 211 215 
226-228 230 234 237 240 243 245-246 
248 254-259 262 266-268 270 277- 
278 282-283 285 295 297 310-311 316 
320-326 330-331 335-336 338 353 
355-357 362 370-374 376 380 382 384 
391 394 398 412-413 415 419-421 423 
430 439-440 446 452 456 459-460 464 
472 477 479-480 489 496-499 503 506 
510 515 525-526 534-535 538 540 543- 
544 546 548 555 557-560 562-566 568 
573-574 576 578 591 593 600-601 732- 
735 809-812 814 820 827 


fetal liver- 
spleen 


Columbia University 


FLS002 


2-3 6 10 14-15 17-20 22-24 26 33 37 
41 48-49 56-57 62 69 77 81 85-86 90 
92 94 99-100 104 107-109 114 120 129 
134-135 144 149 157 168-169 174-175 
179-181 185-187 194 211 215 224 227 
230-231 233 237 240 245-246 257-259 
262 266-268 270 272 275 278 298 300 
304-306 311 313 316-317 320 326-336 
352-354 362 364-365 376-377 382-384 
398 411-412 415 419 421 425 430 433 
436 439 441 446 452 456 459-460 464 
477 479 491-492 496 499 503 505-506 
508 510 515 519 526 535 539 544 546 
566 575 578 591 593 601 654-657 659 
685 687-691 705 707 709 774 779-780 
809 812 820 833 837 854-856 872 896 


fetal liver- 
spleen 


Columbia University 


FLS003 


14 17 20 26 34 63 86 99 107 167 179 
187 195 21 1 220 227 231 248 257 275 
278 320-321 328 335 355-357 365 384 
394 411-412 416 423 425-427 430-431 
440 464 486 497 506 510 525 546 558 
566 648 732 735 738-740 746 779-781 
787 809 812 814 837 


fetal lung 


Clontech 


FLG001 


22 49 160 176 195 237 259 267 353 
398 421 430 436 456 815 


fetal lung 


Clontech 


FLG004 


12 41 506 


fetal lung 


Invitrogen 


FLG003 


15 64 115 128 135 142 151 154 186- 
187 193 195 220 236 253 276 355 421 
446 464 506 519 527-528 537 578 626 


fetal muscle 


Invitrogen 


FMS001 


20 94 100 128 138 171 184 195 226 
246 259 267 311 398 412 443 478 498 
531 585 610 


fetal muscle 


Invitrogen 


FMS002 


12 21 38 57 71 86 100-101 104 109 
116 129-130 138 179 195-196 204 253 
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Tissue Origin 


Library/RNA Source 


HYSEQ 
Library 
Name 


SEQ ID NOS: 








285 312-313 336 376 384-385 391 403 
423 427 430 446 452 473 498 507 517 
535 546 598 643 892 


fetal skin 


Invitrogen 


FSK001 


2 6 12 30-31 56 59 62-63 74 88 97 100 
109 115-118 128 134-135 138 141 151- 
152 154 160 168 178 181-182 185 187 
193 195 229 236 248 249 252 267 272- 
273 276 282 301 313-316 320 362 364- 
365 385 391 404-407 410 415 421 423 
427 430 446 452 460 471 473-474 477 
486-489 499 503 512 526-528 537 540 
543 548 551-552 594 598 635 681-682 
813 829 870-871 888 


fetal skin 


Invitrogen 


FSK002 


9 12 17 20 33 46 58 74 76-78 83 103 
118 124 128-129 135 142 162-163 167- 
169 177 193 195 203 229 231 244 249 
254 256 276 3 1 1 3 13-3 14 342 346 363 
372 376 382 391 393-394 410 421 423 
427 433 464 473 477-478 480-481 486- 
487 497-499 511-513 515 518 535 537- 
538 543 589 592 626 701 731 768-769 
771 781 895 898-899 


fetal spleen 


BioChain 


FSP001 


46 256 


induced neuron- 
cells 


Stratagene 


NTD001 


15 20 49 87 109 1 19-120 157 179 195 
208 226 234 238-239 267 282 340-344 
364 494-495 504 506 516 546 597 698- 
699 887 


infant brain 


Columbia University 


IB2003 


21-22 26 31 48 77 82 98-99 103 128 
131 134-135 154 179 198 21 1 214 230 
234 295 312 339 348-349 351 353 355 
362 365 391 415 423 431 445 452 460 
470 478 497 513 515-516 535 537 540 
543 566 578 639-640 726 816 833 849 


infant brain 


Columbia University 


IBS001 


21 59 77 98 131 154 224 244 371 513 
540 543 578 639 812 


infant brain 


Columbia University 


IBM002 


19 26 77 248 516 


infant brain 


Columbia University j 


IB2002 


19 21-22 26-27 31 36 57 62 77 93 97- 
98 120 128-129 131 134 140 149 154 
169 179 182 187 208 214 226 230 234 
237 256 267-268 273 281 284 295 336 
351 353 355 365 371 387 415 421 429 
431 435 445-446 452 460-461 478-479 
496 503 508 513 515 536-537 543 546 
555 578 605 613 639-640 816 826 849 
853 


leukocyte 


GIBCO 


LUC001 


3 20 23 35-36 39 59 62 64 70-71 87-88 
94 100-101 105-106 120 128-130 134- 
137 139 148 154-155 160 165 176 179 
184 186-190 192-193 195-196 198 211 
216-217 225-226 236 240 246 251 256 
257 260 262-264 267-268 270 279-280 
285 298 304 31 1 353 355 362 375 377 
380 412 415 417 423-426 432 437 
440 444 452 456 460 477 479 497 503 
505-506 508 519 540 543 546 565 582- 



t 
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Tissue Origin 


Library/RNA Source 


HYSEQ 
Library 
Name 


SEQ ID NOS: 








585 587-588 596 646 821 


leukocyte 


Clontech 


LUC003 


41 47 59 71 100 144 186-187 195 240 
263-264 267 279-280 298 385 432 437 
446 497 503 


lung tumor 


Invitrogen 

• 


LGT002 


20 35-37 41 43-44 47-49 54-55 59 62 
64 71 77 93-94 100 102 106 120 134 
136 139 154 157 176 186 194-195 198 
21 1-212 215 218 222 226 234 237-239 
246 249 256 257 260 262-264 267 276 
279-280 289 305-306 311 336 355 362 
364 380 382 385 388 394 396 417 419 
421 425 427 432-433 438 444 446 449 
477 483-485 496-497 503 506 508 527- 
528 540 543 589 606 611 616-617 651 
679 799 828 835 867-869 


lung, fibroblast 


Stratagene 


LFB001 


3-4 16 22 35-36 41 94 100 105 1 16 120 
128 134-135 177 191 195 235 256 267 
353 355 364 377 379 425 459 464 477 
497 503 703 813 829 


lymph node 


Clontech 


ALN001 


10 53 57 81 136 182 195 198 209 246 
263 267 353 355 389 432 503 506 


lymphocyte 


null 


DGD001 


267 31 1 373-374 412 430-432 460 464 
480 503 536 


lymphocytes 


ATCC 


LPC001 


35-36 41 57 77 99-100 105 115 120 
128 139 148 155 171 176 179 182 187- 
190 192 196 226 232-233 236-237 260 
263-264 267-268 270 379 417 427 450 
452 497-498 514 518 540 547-548 566 
583 587 619 693 851 


macrophage 


Invitrogen 


HMP001 


87 128 177 187 196-197 257 277 355 
362 385 412 415 420 423 446 464 497 
537 543 551 782-783 795 


mammary gland 


Invitrogen 


MMG001 


8 16 20 36 44 56-57 59-60 64 74 77 90 
94 96-97 100 109 120 128 130 134-136 
154 157 160 162-163 168 179 182 187 
193 195 198 21 1 219-220 226 240 248 
249 251 257 262 267-268 272-273 276 
281 287 300-301 316 355 362 364 372 
383 388 393-394 417 421 423 427 430- 
432 440 446 452 460 477-478 497 503 
506 516 527-528 537 540 543 555 585 
597 680 842-843 


melanoma 
from-cell-line- 
ATCC-#CRL 
Clontech 


MEL004 


3 20 26 41 
59 100 
120 130 
160 195 
223 237 
246 267 
269 275 
305-306 
376 409 
412 419 
423 437 
446 459 
464 477 





I 
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Tissue Origin 


Library/RNA Source 


HYSEQ 
Library 
Name 


SEQ ID 1VOS: 






486 647 
669 862 




neuronal cells 


Stratagene 


NTU001 


21 49 51 57 59 83 96 120 134 179 19~> 
230-231 256 262 267 289 351 355 383 
412 423 427 497 503 527-528 727 875 


pituitary gland 


Clontech 


PIT004 


16 19 68 87 100 112 128 192 195 246 
262 268 275 355 372 394 41o 42 / 430- 
431 477 503 578 


placenta 


Invitrogen 


APL002 


16 36 57 77 141 187 230 248 262 295 
355 423 430-431 446 496 527-528 555 


placenta 


Clontech 


PLA003 


20 33 70 88 90 109 1 15 120 124 129 
141 181 193 237 245 249 285 297 316 
355 382 391 403 411 421 446 464 477 
486 497 513 546//Z-7/J 


prostate 


Clontech 


PRT001 


10 20 36 49 85 94 100 135 142 188- 
191 195 198 237 267 282 285 316 320 
353 379 394 421 423 425 437 4o0 5Uo 

con £Zi 1 <cd o 


rectum 


Invitrogen 


REC001 


35 77 120 128 136 153 159 168 182 
256 268 272 301 423 430 494 499 503 
505 537 543-544 552 578 664 


retmoic acid- 

induced- 

neuronal-cells 


Stratagene 


NTR001 


120 130 179 183 193 25o 3dj 4oU 4/o 
503 890 


saliva gland 


Clontech 


SALs03 


475 


salivary gland 


Clontech 


SAL001 


19 38 66 134-136 139 191 195 206 240 
246 256 311 355 362 J /o 3o2-3o3 39 j 

42 1 432 4jD 4oU 404 4 / j 4 / / 

497-498 503 527-528 666 790 826 


skeletal muscle 


Clontech 


SKM001 


17 41 100 195 197-198 211 215 236 
238-239 298 302 369 415 443 452 
475 496 503 


skin fibroblast 


ATCC 


SFB001 


49 464 


skin fibroblast 


ATCC 


SFB002 


49 246 267 477 


skin fibroblast 


ATCC 


SFB003 


49 


small intestine 


Clontech 


SIN001 


3 6 10 46 57 59 62 77 90 94 97 100 102 
106 109 112 115 120 123 128 142 162- 
163 168 177 179 183 186 188-190 195 
224 226 240 246 248 262 267-268 275 
289 302 311 316 324 353 355 364 372- 
376 380 382-384 394 408 412 423 425 
430 432 439 442 444 452 459 464 477- 
478 499 503 532 536 540 547-548 626- 
627 633 648 729 736-737 745-746 823 
828 844 


spinal cord 


Clontech 


SPC001 


3 17 27 34 41 57 64 87 100-101 128 
157 183 187 192 195 199 204 226 229 

OiA OAO. OA^ OAR ">£n 9^1 0^-9fiA 9^7 

304 308-309 311 355 369 382 384 388 
412 415 419 428-429 434 446 457-460 
478 496 503 519 648 652 


stomach 


Clontech 


STO001 


36 57 104 115 120 134-135 168 195 
267 311 330-331 364 414 423 425 452 
456 460 540 886 


thalamus 


Clontech 


THA002 


19 87 113 121 160 182 195 202 248 
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Tissue Origin 


Library/RNA Source 


HYSEQ 
Library 
Name 


SEQ ID NOS: 








249 262 272 283 289 355 365 424 427 
451 457-458 460 477 480 503 535-536 
543 634 841 


thymus 


Clontech 


THMc02 


3 12 20 36 58-60 70-72 90 93 97 101- 
102 109 114-115 130 135 147-148 160 
162-163 167 169 171 179 183-184 187 
195-196 217 237 249 259 276 279-280 
303 307 312 320 355 376 383-384 398 
402 419 422 432-433 436 437 444 464 
471 479-480 506 508 513 515 535 537 
550 589 641-642 671-672 787 830 858- 
859 864 


thymus 


Clontech 


THM001 


20 53 59 71 100-101 105-106 110 114 
128 130 134 154 157 179 188-191 195 
215 224 230 240 248 259 267 279 335 
380 382 385 391 398 408 412 427 432 
437 452 459 476 497 543 546 61 1 833 
900 


thyroid gland 


Clontech 


THR001 


3 16 19-20 23 33 35 41 43-44 48-49 57 
59 87 94 100-101 120 133-135 147 149 
157 162-163 179 187 191 193 195 198 
206 209 215 217 229 246-247 256 261- 
262 267 272 276 283 287 297 304-306 
309 312 316 335 353 355 362 364 371- 
374 380 385 394 399 412 419 421 431 
442 444 452-453 460 462 475 477 479 
496-497 503 506 519-520 536 543 546 
548 596 649 653 704 731 812 845 851 


trachea 


Clontech 


TRC001 


100-101 120 130 184 195 267 300 355 
380 396-397 399 415 421 423 464 475 
503 527-528 546 709 725 


umbilical cord 


BioChain 


FUC001 


7 26 36 40 43 46 50-51 57 68 71 77-78 
96 100 102 109 115 120 128 142 158- 
159 172-173 179 184 186 191 195 198 
215 226 228 230 236-237 240 246 250 
267 273 275 279-280 283 287 300 304 
335 351 355 364-365 381 388 394 399 
408 425 430 440 452 459 464 475 
477 503 536-537 540 543 546 576 589 
630 635 669 682 684 904 


uterus 


Clontech 


UTR001 


16 22 64 77 87 100 128 134 195 226 
249 335 366 412 425 427 452 454 464 
546 55 1 


young liver 


GIBCO 


ALV001 


5 14 20 26 32-33 57 71 87 94 97 100 
102 105 120 128 134 154 157 179 192 
194-195 227 257 266 274 278 326 355- 
357 437 446 498 503 516 532 846 



*The 16 tissue/mRNAs and their vendor sources are as follows: 1) Normal adult brain mRNA (Invitrogen), 2) 
Normal adult kidney mRNA (Invitrogen), 3) Normal fetal brain mRNA (Invitrogen), 4) Normal adult liver 
mRNA (Invitrogen), 5) Normal fetal kidney mRNA (Invitrogen), 6) Normal fetal liver mRNA (Invitrogen), 7) 
normal fetal skin mRNA (Invitrogen), 8) human adrenal gland mRNA (Clontech), 9) Human bone marrow 
mRNA (Clontech), 10) Human leukemia lymphoblastic mRNA (Clontech), 1 1) Human thymus mRNA 
(Clontech), 12) human lymph node mRNA (Clontech), 13) human so\spinal cord mRNA (Clontech), 14) 
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human thyroid mRNA (Clontech), 15) human esophagus mRNA (BioChain), 16) human 

conceptional umbilical cord mRNA (BioChain). 
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SEQ ID 


Hit ID 


Species 


Description 


S score 


% Identity 


912 


gil2803215 


Homo sapiens 


glutathione S-transferase theta 2, 
clone MGC:744 IMAGE:3164017, 
mRNA, complete cds. 


1263 


100 


912 


gi601918 


Homo sapiens 


glutathione S-transferase theta 2 
(GSTT2) mRNA, complete cds. 


1263 


100 


912 


gi9937244 


Homo sapiens 


glutathione S-transferase theta 2 
(GSTT2) and glutathione S- 
transferase theta 1 (GSTT1) genes, 
complete cds. 


1259 


99 


913 


gil3872813 


Homo sapiens 


partial mRNA for fibulin-6 (FIBL- 
6 gene). 


4548 


93 


913 


gil4575679 


Homo sapiens 


hemicentin mRNA, complete cds. 


4542 


93 


913 


gil4041957 


Homo sapiens 


cDNA FLJ14438 fis, clone 
HEMBB 10003 17, wealdy similar 
to FIBULIN-1, ISOFORM D 
PRECURSOR. 


2795 


85 


914 


gi 13543645 


Homo sapiens 


D-dopachrome tautomerase, clone 
MGC: 14637 IMAGE :40 82777, 
mRNA, complete cds. 


450 


94 


914 


gil5930140 


Homo sapiens 


Similar to D-dopachrome 
tautomerase, clone MGC:9158 
IMAGE:3902943, mRNA, 
complete cds. 


450 


94 


914 


gi2352915 


Homo sapiens 


D-dopachrome tautomerase (DDT) 
gene, exon 3 and complete cds. 


450 


94 


915 


gil3543645 


Homo sapiens 


D-dopachrome tautomerase, clone 
MGC: 14637 IMAGE :4082777, 
mRNA, complete cds. 


537 


86 


915 


gil5930140 


Homo sapiens 


Similar to D-dopachrome 
tautomerase, clone MGC:9158 
IMAGE:3902943, mRNA, 
complete cds. 


537 


86 


915 


gi2352915 


Homo sapiens 


D-dopachrome tautomerase (DDT) 
gene, exon 3 and complete cds. 


537 


86 


916 


gi4 190954 


Homo sapiens 


gene for hepatocyte growth factor 
activator, complete cds. 


2089 


100 


916 


gi219681 


Homo sapiens 


Human mRNA for hepatocyte 
growth factor (HGF) activator 
precursor, complete cds. 


2089 


100 


916 


AAR89197 


Homo sapiens 

> 


Human hepatocellular growth 
factor single chain precursor 
protein. 


2089 


100 


917 


gi5441937 


T T * 

Homo sapiens 


BAG clone CTB-15P3 from 7q22- 
q31.2, complete sequence. 


8964 


100 


917 


AAY15457 


Homo sapiens 


Human laminin beta 4 protein. 


6164 


96 


917 


AAY15459 


Homo sapiens 


SEQ ID 5 of W09919347. 


6046 


97 


918 


gi 10998440 


Mus musculus 


EGF-related protein SCUBE1 


2116 


52 


918 


gi8052320 


Mus musculus 


Cegpl protein 


1124 


79 


918 


AAY07735 


Homo sapiens 


Human breast-specific BS200 
protein. 


1125 


79 


919 


gil3543621 


Homo sapiens 


parathyroid hormone-like 
hormone, clone MGC: 1461 1 
IMAGE:4050706, mRNA, 
complete cds. 


905 


100 


919 


gil90712 


Homo sapiens 


Human parathyroid hormone-like 
protein (PLP) gene, exon 4, clones 
lambda-P LP g( 1,3,7-2). 


905 


100 
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919 


gil90713 


Homo sapiens 


Human parathyroid hormone-like 
protein (PLP) gene, exon 5, clones 
lambda~PLPg( 1,3,7-2). 


905 


100 

- 


920 


gi3449294 


Rattus 
norvegicus 


MEGF6 


2229 


78 


920 


AAY72715 


Homo sapiens 


HFICU08 clone human attractin- 
like protein. 


554 


43 


/-v /"V /A 

920 


AAG75479 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:6243. 


518 


43 


921 


gi 12966 10 


Homo sapiens 


H. sapiens mRNA for chemokine 
CC-2 and CC-1. 


346 


100 


921 


gi 1004267 


Homo sapiens 


H.sapiens gene for chemokine 
HCC-1. 


346 


100 


921 


gil004269 


Homo sapiens 


H.sapiens mRNA for chemolcine 
HCC-1. 


346 


100 


922 


gi35330 


Homo sapiens 


H.sapiens mRNA for 
procarboxypeptidase Al . 


1126 


61 


922 


AAR97618 


Homo sapiens 


Human carboxypeptidase Al. 


1126 


61 


922 


AAY28915 


Homo sapiens 


Human regulatory protein HRGP- 
1. 


1126 


61 


923 


gi790817 


Homo sapiens 


Human microfibril-associated 
glycoprotein 4 (MFAP4) mRNA, 
3 end of cds. 


1198 


99 


923 


AAY42563 


Homo sapiens 


Human microfibril-associated 
glycoprotein 4 splice variant 
(MAG4V). 


1197 


100 


923 


AAY85177 


Homo sapiens 


Microfibril associated glycoprotein 
4 splice variant protem sequence. 


1197 


100 


924 


AAY06940 


Homo sapiens 


Human secretory protein ZSIG-11. 


1648 


100 


924 


AAY66650 


Homo sapiens 


Membrane-bound protein PR0536. 


1648 


100 


924 


AAY50944 


Homo sapiens 


Human adult heart cDNA clone 
vf 1 1 derived protein. 


1648 


100 


925 


gil3528981 


Homo sapiens 


apolipoprotein A-II, clone 
MGC: 12334 IMAGE :3 934476, 
mRNA, complete cds. 


438 


90 


1 925 


gi2S748 


Homo sapiens 


Human mRNA for apolipoprotein 

A TT 

All precursor. 


438 


90 


925 


gi296633 


Homo sapiens 


Human DNA for apolipoprotein A- 
II. 


438 


90 


926 


AAY76156 


Homo sapiens 


Human secreted protein encoded 
by gene 33. 


419 


90 


926 


AAE04850 


Homo sapiens 


Human SGP014 phosphatase 
polypeptide related exon 4. 


63 


33 


926 


gi599904 


Albinaria turrita 


ATPase subunit 8 


40 


39 


927 


gi 13097252 


Homo sapiens 


Similar to FK506 binding protein 
2 (13 kDa), clone MGC:5177 
IMAGE:3445148, mRNA, 
complete cds. 


640 


91 


1 Q97 


gloo/J/U 


Homo sapiens 


Human rapamycin- and FK506- 
binding protein, complete cds. 


640 


91 


927 


AAQ31004_ 
aal 


Homo sapiens 


hRFKBP cDNA 


636 


90 


928 


AAB53360 


Homo sapiens 


Human colon cancer antigen 
protein sequence SEQ ID NO:900. 


230 


95 


928 


AAG73789 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:4553. 


230 


95 



it 
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928 


AAY86330 


Homo sapiens 


Human secreted protein 
HKGC027, SEQ ID NO: 245. 


60 


66 


929 


gil91613 


Mus musculus 


acetylcholine receptor 


55 


43 


929 


AAY07766 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 23. 


55 


40 


930 


gil5157181 


Agrobacterium 
tumefaciens 


AGR_C_3718p 


1148 


50 


930 


gi 14022240 


Mesorhizobium 
loti 


probable D- lactate dehydrogenase 


1106 


47 

1 


yjyj 


gli DU /j loU 


Sinorhizobium 
meiiioti 


PUTATIVE D-LACTATE 

T"\T"?T T\7""P\T1 A/"T?\T A OP 

DEHYDROGENASE 
(CYTOCHROME) PROTEIN 


1097 


1 48 1 


y j i 


J\J\Vv 0*f O 


Homo sapiens 


Human epidermoid carcinoma cell 
line KB clone HP 10179 protein. 


231 


100 


SO J. 


AAW /ol40 


Homo sapiens 


Human secreted protein encoded 
by gene 21 clone HWTAZ75. 


231 


100 


1 

S yjj i 


1 gl/ 1UO / / o 


Homo sapiens 


HoPC194 


1 151 


75 


932 


AAY24793 


Homo sapiens 


Human secreted protein yc2 1 . 


1107 


99 


932 


gi3858883 


Acanthamoeba 
castellann 


myosin I heavy chain kinase 


143 


30 


932 


gil3751817 


Leishmania 
major 


C2 domain protein 


126 


32 




gil5028816 


T T ' 

Homo sapiens 


mRNA for beta- 1,3- 
galactosyltransferase b3Gal-T8. 


172 


28 


y55 


gi8927l64 


Mus musculus 


corel UDP-galactose:N- 
acetylgalactosamine-alpha-R beta 
1 , 3-galactosyl transferase 


173 


28 


yoo 


giis9271o6 


Rattus 
norvegicus 


corel UDP-galactose:N- 
acetylgalactosamine-alpha-R beta 
1,3-galactosyltransferase 


173 


26 


934 


AAY73383 


Homo sapiens 


HTRM clone 2280456 protein 
sequence. j 


1571 


100 




gll 5929 192 


TT. • 

Homo sapiens 


clone MGQ9522 
IMAGE: 3909690, mRNA, 
complete cds. | 


1487 


99 




AALr/oZoz 


TT „ 

Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:6046. 


619 


99 1 


935 


gi 15705411 


Homo sapiens 


peptidoglycan recognition protein 
L precursor (PGLYRP) mRNA, 
complete cds. 


3041 


99 

j 


935 


AAY72664 


Homo sapiens 


Murine peptidoglycan recognition 
protein-related liver protein. ' 


1484 


76 


935 


gi665 123 9 


Mus musculus 


TAGL-alpha 


1471 


76 


936 


AAB24450 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 14 SEQ ID 
NO:75. 


361 


97 


936 


AAB49409 


Homo sapiens 


hCRFl-TMS construct peptide. 


39 


69 \ 


936 


gil2619689 


Conus 
ventricosus 


conotoxin scaffold VI/VII \ 
precursor | 


58 


21 


1 9^7 




Homo sapiens 


tyrosine 3- 1 
monooxygenase/tryptophan 5- 
monooxygenase activation protein, 
eta polypeptide, clone MGC:675 
IMAGE:3543571 3 mRNA, 
complete cds. 


1094 


94 


937 


gil711232 


Homo sapiens 


Human DNA for 14-3-3 protein eta 
chain, exon2 and complete cds. 


1094 


94 
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937 


gi531590 


Homo sapiens 


H. sapiens gene for 14-3-3 protein. 


1094 


94 


938 


gil2804681 


Homo sapiens 


SI 00 calcium-binding protein, 
beta (neural), clone MGC: 1323 
IMAGE:3543825, mRNA, 
complete cds. 


479 


100 


938 


gi337730 


Homo sapiens 


Human SI 00 protein beta-subunit 
gene, exon 3. 


479 


100 


938 


gil61 18441 


Oryctolagus 
cuniculus 


S-100 calcium-binding protein beta 
subunit 


479 


100 


939 


AAW75082 


Homo sapiens 


Human secreted protein encoded 
by gene 26 clone HTLEV12. 


392 


92 


939 


AAG00461 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4542. 


72 


50 


939 


AAY86322 


Homo sapiens 


Human secreted protein 
HTLFG05, SEQ ID NO:237. 


72 


50 


941 


gi9280025 


Macaca 
fascicularis 


Nogo receptor 


804 


49 


941 


gil5O8O0O5 


Homo sapiens 


nogo receptor, clone MGC: 19831 
IMAGE:4040540, mRNA, 
complete cds. 


792 


49 


941 


gil 2407653 


Homo sapiens 


Nogo receptor mRNA, complete 
cds. 


792 


49 


942 


AAB25674 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 10 SEQ ID 
NO:63. 


768 


98 


942 


gi!4603247 


Homo sapiens 


Similar to RIKEN cDNA 
5730409G15 gene, clone 
MGC: 19636 IMAGE: 2 8223 23, 
mRN A, complete cds. 


238 


90 


942 


AAB36613 


Homo sapiens 


Human FLEXHT-35 protein 
sequence SEQ ID NO:35. 


238 


90 


943 


AAY85678 


Homo sapiens 


Human kidney disease associated 
protein SEQ ID 10. 


751 


98 


943 


gi3127193 


Rattus 
norvegicus 


kidney-specific protein j 


686 


75 


943 


gi50 19275 


Bos taurus 


xenobiotic/medium-chain fatty 
acid: CoA ligase form XL-III I 


474 


54 


944 


gil 197499 


Homo sapiens 


H.sapiens gene for CI inhibitor ! 
exon 2 (and joined CDS). 1 


2527 


100 


944 


gi29535 


Homo sapiens 


Human gene for Cl-inhibitor. 


2527 


100 


944 


gil 5029894 


Homo sapiens 


serine (or cysteine) proteinase | 
inhibitor, clade G (CI inhibitor), 
member 1, clone MGC: 17091 ( 
IMAGE:4150091,mRNA, 
complete cds. | 


2524 


99 


945 


gil5157854 


Agrobacterium 
turnefaciens 


AGR_C_4799p 


218 


38 


945 


gil5076099 


Sinorhizobium 
meliloti j 


HYPOTHETICAL PROTEIN 


233 


36 




gr995l I2l 


Pseudomonas 
aeruginosa 


ribosomal protein LI 1 T 
methyltransferase I 


116 


38 


946 


AAB03948 


Homo sapiens 


Human mesenchymal stem cell 
polypeptide. | 


462 


98 


946 


AAB64909 


Homo sapiens 


Human secreted protein sequence { 
encoded by gene 28 SEQ ID 
NO:87. 


166 


63 


946 


gil53l983 


Homo sapiens 


H.sapiens mRNA for CO- | 


77 


31 



WO 03/054152 PCT/US02/39555 

Table 2A 
130 





wit m 

XJ.Il XJL/ 
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chemokine, eotaxin variant (clone 

5^ 






947 


gil3958036 


1 Homo sapiens 


FYVE-finger protein EIP1 mRNA, 
complete cds. 


2917 


97 


947 


AAY29861 


1 Homo sapiens 


Human secreted protein clone 
cb98 4. 


2910 


96 


947 


cn1 1 558044 
gll IJJOU'tt' 


j ivius museums 


FYVE-finger containing protein 


2752 


91 


948 


AAW75000 


| Homo sapiens 


Human secreted protein encoded 
by gene 146 clone HSNAK17. 


49 


29 


948 


gi 10 176829 


f Arabidopsis 
thaliana 


gene_id:MBBl8.l6~ 


79 


32 




gn jz /y/oo 


, Homo sapiens 


clone MGC: 10946 
IMAGE: 3 63 1700, mRNA, 
complete cds. 


466 


100 


949 


AAG81435 


Homo sapiens 


Human AFP protein sequence SEQ 
iu NQ:388. 


466 


100 


949 




riomo sapiens 


Human gene 7 encoded secreted 
protein HIN I JJl^Zl, brsl^ ID 
NO:58, 


466 


100 


950 


A AY16787 




riuman secretea protein (^clone 
dy41 2). 


557 


ioo s 


950 




I Que c r»**/"vfo 

oiib ceroid 


lectin- HKe oxiaizea LUL receptor- 
1 


r-\ o /- 
235 


29 


950 


AAW52837 


Homo sapiens 


Human C-type lectin MCTL. 


206 


34 




A AY41 790 


riomo sapiens 


Human FK0792 protein sequence. 


1140 


87 


951 


AAB44276 


Homo sapiens 


Human PRQ792 (UNQ431) 
protein sequence SEQ ID NO:231. 


1140 


87 


951 


AAB24055 


Homo sapiens 

■ 


Human PR0792 protein sequence 
SEQ ID NO:31. 


1140 


87 


Q59 


rr«7£7n7zl£ I 


riomo sapiens 


T TTATl 1 1 i 

UDF-glucose:glycoprotein 
glucosyltransferase 1 precursor, 
mRNA, complete cds. 


7968 


98 




gll JZ /DOZl 


synthetic 
construct 


Rat RUGT 


7313 


91 




gl /o / / 1 /o 


Kattus 

• 

norvegicus 


UDP-glucose 

glycoprotein:glucosyltransferase 
precursor 


7313 


91 






womo sapiens 


Amino acid sequence of human 
intracellular signalling molecule 

TXTTl? AT) 


409 


51 i 


953 


AAG00975 


Homo sapiens 


Human secreted protein, SEQ ID 


212 


46 


953 


gi2623224 


Mus mus cuius 


FK-506 binding protein homolog 


118 


30 


954 


gll / / OUOi 


o accnaromyces 

» ♦ 

Lcrevibide 


transcription/repair tactor TFIIH 
suuunit iidj 


83 


31 


954 


gi927727 


Saccharomyces 
cerevisiae 


Tfb3p: TFIIH subunit Tfb3; 

Y JJK40U W 


83 


31 


954 


gi3372804 


Rattus 
norveeicus 


focal adhesion kinase-related 

nrntpin 

JJl ULulJl 


65 


25 


955 


AAG81399 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:316. 


1910 


100 


955 


AAB61421 


Homo sapiens 


Human TANGO 300 protein. 


1904 


99 


955 


AAB23618 


Homo sapiens 


Human secreted protein SEQ ID 
NO: 36. 


1902 


99 


956 


gi296532 


Homo sapiens 


H.sapiens mRNA for I beta 1-6 N- 
acetylglucosaminyltransferase. 


1043 


66 
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956 


gil315909 


Homo sapiens 


Human beta- 1 , 6-N- 
acetylglucosaminyltransferase 
(IGnT) gene, exon 3, complete cds. 


1043 


66 


956 


gi307298 


Homo sapiens 


Human I beta 1-6 N- 
acetylglucosaminyltransferase 
mRNA, complete cds. 


1043 


66 


957 


gi20651 65 


Homo sapiens 


H. sapiens mRNA for extracellular 
matrix protein collagen type XIV, 
N-terminus. 


776 


93 


957 


AAG00322 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4403. 


482 


98 


957 


gi28S875 


Gallus gallus 


collagen XIV 


467 


68 


958 


gil5530201 


Homo sapiens 


Similar to matrix 
metalloproteinase 1 (interstitial 
collagenase), clone MGC: 10479 
IMAGE:3834572, mRNA, 
complete cds. 


2381 


93 


958 


gi30126 


Homo sapiens 


H.sapiens mRNA for type I 
interstitial collagenase. 


2381 


93 


958 


gil688258 


Homo sapiens 


Human collagenase and 
stromelysin genes, complete cds, 
and metalloelastase gene, partial 
cds. 


2381 


93 


959 


AAY25868 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 57. 


/-v ^ 

993 


99 


959 


gi4008584 


Rattus 
norvegicus 


huntingtin 


61 


66 


959 


gi6006615 


Saimiri sciureus 


Alanine Glyoxylate 
Aminotransferase 


71 


31 


960 


AAB58288 


Homo sapiens 


Lung cancer associated 
polypeptide sequence SEQ ID 626. 


474 


85 


960 


gil3378304 


Homo sapiens 


dynein-associated protein HKM23 
(km23) mRNA, complete cds. 


473 


98 


960 


gil2803327 


Homo sapiens 


HSPC 162 protein, clone 
MGC:773 IMAGE: 3347555, 
mRNA, complete cds. 


473 


98 


961 


gi930078 


Homo sapiens 


Human Koxl6 mRNA for zinc 
finger protem, partial. 


56 


34 


961 


AAE01364 


Homo sapiens 


Human gene 13 encoded secreted 
protein HDPIW06, SEQ ID 
NO:86. 


54 


43 


961 


AAE01398 


Homo sapiens 


Human gene 13 encoded secreted 
protein HDPIW06, SEQ ID 
NO: 120. 


54 


43 


962 


gi9956936 


Mus musculus 


Su(var)3-9 homolog Suv39h2 


1313 


84 


962 


gi 10440094 


Homo sapiens ! 


cDNA: FLJ23414 fis, clone 
HEP20704. 


1236 


100 


962 


gi 13623277 


Homo sapiens 


suppressor of variegation 3-9 
(Drosophila) homolog 1, clone 
MGC:10376 IMAGE: 3 945632, 
mRNA, complete cds. 


821 


55 


963 


gi 12654023 


Homo sapiens 


NADH dehydrogenase 
(ubiquinone) 1 alpha subcomplex, 
5 (13kD, B13), clone MGC:5042 
IMAGE:3453236, mRNA, 
complete cds. 


432 


90 


963 


gi 1373 173 


Homo sapiens 


NADH: ubiquinone oxidoreductase 


432 


90 
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subunit B13 (B13) mRNA, 
complete cds. 






963 


gil698653 


Homo sapiens 


Human NADH:ubiquinone 
oxidoreductase subunit B 13 
mRNA, complete cds. 


432 


90 


964 


gi37347 


Homo sapiens 


Human mRNA for T-cell 
rearranging gamma gene (TRG) 
V(g)8-J(g)2-C(g)2. 


1645 


94 


964 


gi339407 


Homo sapiens 


Human T-cell receptor Ti 

1 1 * T"» VTA 

rearranged gamma-cham mRNA 
V-J-C region, complete cds. 


1617 


93 


964 


gi37018 


Homo sapiens 


Human mRNA for T-cell receptor 
gamma-cham. 


1439 


87 


965 


AAE06606 


Homo sapiens 


Human protein having 
hydrophobic domain, HP 10794. 


566 


yo 


965 


gi6996628 


Tnticum 
aestivum 


1 i -i « i „ „ 

phenylalanine ammonia lyase 


72 


46 


965 


gil2667442 


Rattus 
norvegicus 


synaptotagmm VIIT1 


75 


J4 


966 


gil 3654639 


Bos taurus 


D-glucuronyl C5 epimerase 


Mod 


an 
9/ 


966 


gi 13442978 


Mus musculus 


D-glucuronyl C5~epimerase 


3142 


95 


r\ s~ J* 

966 


gil 1935177 


Mus musculus 


heparin/heparan sulfate: glucuronic 
acid C5 epimerase 


3137 


95 


967 


AAG71875 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1556. 


709 


86 


967 


AAG71443 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1 124. 


692 


79 


967 


AAG71816 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1497. 


684 


82 


968 


gil 2655 133 


Homo sapiens 


Similar to CGI-63 protein, clone 
MGC:708 IMAGE: 3 139846, 
mRNA, complete cds. 


1918 


100 


968 


AAB73679 


Homo sapiens 


Human oxidoreductase protein 
ORP-12. 


1918 


100 


968 


gi4929595 


Homo sapiens 


CGI-63 protein mRNA, complete 
cds. 


1874 


98 


969 


gi6062874 


Homo sapiens 


candidate tumor suppressor protein 
DICE1 mRNA, complete cds. 


1313 


81 


969 


AAY 15344 


Homo sapiens 


Tumour suppressor protein del-27. 


1313 


81 


969 


AAY28995 


Homo sapiens 


Tumour suppressor Del-27 protein 
sequence. 


1313 


81 


970 


gi387011 


Homo sapiens 


Human pyruvate dehydrogenase 
El -alpha subunit mRNA, cds. 


2187 


99 


970 


gil2803199 


Homo sapiens 


pyruvate dehydrogenase 
(hpoamide) alpha 1, clone 
MGC:8609 IMAGE:2961286, 
mRNA, complete cds. 


2049 


100 


970 


gi35379 


Homo sapiens 


Human mRNA for bram pyruvate 
dehydrogenase tbC 1.2.4.1). 


2049 


100 


971 


gi2275569 


Homo sapiens 


T cell receptor beta locus, 
TCRBV6S4A1 to TCRBV8S1 
region. 


660 


99 


971 


gi2104755 


Homo sapiens 


T cell receptor V-beta 23 
(TCRBV) gene, partial cds. 


660 


99 


971 


gi495404 


Pan troglodytes 


T cell receptor beta chain 


657 


88 


972 


AAG89141 


Homo sapiens 


Human secreted protein, SEQ ID 


1272 


100 
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NO: 261. 






972 


gil2053273 


Homo sapiens 


mRNA; cDNA DKFZp434K1172 
(from clone DKFZp434Kl 172); 
complete cds. 


1271 


99 


972 


gi292836 


Homo sapiens 


Human trichohyalm (TRHY) gene, 
complete cds. 


146 


29 


973 


gi 12654633 


Homo sapiens 


protein inhibitor of activated 
STAT3, clone MGQ1417 
IMAGE:3528679, mRNA, 
complete cds. 


2708 


96 


973 


gi4996563 


Homo sapiens 


PIAS3 mRNA for protein inhibitor 
of activatied STAT3, complete cds. 


2708 


96 


973 


AAE02937 


Homo sapiens 


Human TFRP protein. 


2708 


96 


974 


gil 5099957 


Homo sapiens 


diacylglycerol acyltransferase 2- 
like protein mRNA, complete cds. 


856 


79 


974 


gil 5099955 


Mus musculus 


diacylglycerol acyltransferase 2- 
like protein 


676 


59 


974 


AAY94889 


Homo sapiens 


Human protein clone HP02485. 


578 


56 


975 


gi7022243 


Homo sapiens 


cDNA FU10300 fis, clone 
NT2RM2000030. 


1432 


100 


975 


AAB92669 


Homo sapiens 


Human protein sequence SEQ ID 
NO:11033. 


1432 


100 


975 


gi3 834427 


Drosophila 
melanogaster 


cytoplasmic dynein intermediate 
chain isoform DIClc 


134 


At 

24 


976 


gil2803965 


Homo sapiens 


clone MGC:4294 
IMAGE:3636069, mRNA, 
complete cds. 


519 


100 


976 


gil5919713 


Homo sapiens 


isolate NSB 1 1-3-K3-A1 0 
immunoglobulin kappa chain 
variable region gene, partial cds. 


62 


32 


976 


gil 59 19629 


Homo sapiens 


isolate NSB1 1-4-K3-B6 
immunoglobulin kappa chain 
variable region gene, partial cds. 


60 


33 


977 


gil 209685 


Homo sapiens 


Human salivary peroxidase 
mRNA, complete cds. 


3408 


99 


977 


gil 63307 


Bos taurus 


lactoperoxidase 


2906 


77 


977 


gil 1990122 


Camelus 
dromedarius 


peroxidase 


2882 


84 


978 


AAY01603 


Homo sapiens 


Amino acid sequence of the human 
defensin (Def-X) protein. 


501 


97 


978 


gi29735 


Homo sapiens 


H.sapiens mRNA for corticostatin 
HP-4 precursor. 


214 


46 


978 


gi665927 


Homo sapiens 


Human corticostatin/defensm HP-4 
precursor gene, complete cds. 


214 


46 


979 


AAG81415 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:348. 


848 


100 


979 


gil 1559416 


Oryctolagus 
cuniculus 


NADPH-dependent retinol 
dehydrogenase/reductase 


75 


31 


980 


AAR94422 


TT * 

Homo sapiens 


Bactericidal/permeability 
increasing peptide (BPI.245). 




DO 


980 


AAR76424 


Homo sapiens 


Bacterial permeability-increasing 
peptide BPI.245. 


63 


56 


980 


AAW63485 


Homo sapiens 


Human BPI protein derived 
peptide XMP.245. 


63 


56 


981 


gi4309953 


Homo sapiens 


BAC clone RP1 1-121 A8 from 
7pl4-pl3, complete sequence. 


951 


95 
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981 


gi296680 


Homo sapiens 


Human germline TCR-gamma 
gene variable region V-gamma 5. 


570 


85 


981 


gi4309952 


Homo sapiens 


BAG clone RP1 1-121A8 from 
7pl4-pl3, complete sequence. 


544 


68 


982 


gi 1223888 


synthetic 
construct 


T cell receptor alpha chain 


1070 


79 


982 


gi338766 


Homo sapiens 


Human T-cell receptor rearranged 
alpha-chain V-region (V-D-J) 
mRNA, complete cds. 


994 


73 


982 


gi3089419 


Homo sapiens 


SSC1 1 rearranged T cell receptor 
alpha chain (TCRAV17) gene, 
complete cds. 


976 


71 


983 


gil4249942 


Homo sapiens 


Similar to RIKEN cDNA 
0610008P16 gene, clone 
MGC: 15937 IMAGE: 3 53 7224, 
mRNA, complete cds. 


309 


46 


983 


AAB73512 


Homo sapiens 


Human transferase HTFS-19, SEQ 
ID NO: 19. 


309 


46 


983 


gi2828262 


Bos taurus 


aralkyl acyl-CoA:amino acid N- 
acyltransferase 


285 


42 


984 


AAG71251 


Homo sapiens 


Human gene 9-encoded secreted 
protein HMSDL37, SEQ ID 
NO:99. 


311 


90 


984 


AAG71286 


Homo sapiens 


Human gene 9-encoded secreted 
protein HMSDL37, SEQ ID 
NO:135. 


311 


90 


984 


gil3096922 


Mus musculus 


Similar to nadrm 


76 


A A 

44 


985 


gi45 19541 


Mus musculus i 


thrombospondin type 1 domain 


1299 


88 


985 


gil3625176 


Homo sapiens 


clone 1 thrombospondin mRNA, 
complete cds. 


646 


A f 

46 


985 


AAW85607 


Homo sapiens 


Secreted protein clone da228 6. 


646 


46 


986 


AAG81417 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:352. 


r\ c\ 

287 


t r\r\ 

100 


986 


gi 17655 8 


Aotus trivirgatus 


mvolucrm (large allele) 


53 


40 


986 


AAG77243 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO: 8009. 


36 


70 


987 


gi2 104856 


Rattus 
norvegicus 


platelet glycoprotein V 


551 


36 


987 


gi6449037 


Mus musculus 


platelet glycoprotein V 


550 


36 


987 


gi3 12502 


Homo sapiens 


H. sapiens GPV gene encoding 
platelet glycoprotein V precursor. 


486 


34 


988 


gil2S03851 


Homo sapiens 


nucleotide binding protein 2 
(E.coli MinD like), clone 
MGC:3473 IMAGE-.3633393, 
mRNA, complete cds. 


1385 


100 


988 


gil4124958 


Homo sapiens 


nucleotide binding protein 2 
(E.coli MinD like), clone 
MGC: 15834 IMAGE:3507768, 
mRNA, complete cds. 


1385 


100 


988 


AAB59022 


Homo sapiens 


Breast and ovarian cancer 
associated antigen protein 
sequence SEQ ID 730. 


1 « or 

1385 


100 


989 


gi9864185 


Drosophila 
melanogaster 


Crossveinless 2 


1031 


35 


989 


gi7768636 


Xenopus laevis 


Kielin 


9ll 


31 


989 


gi9S87910 


Oryctolagus 
cuniculus 


zonadhesin precursor 


540 


31 
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990 


gil3161011 


Homo sapiens 


sclerostin gene, complete cds. 


417 


92 


990 


gil3161020 


Homo sapiens 


sclerostin mRNA, complete cds. 


417 


92 


990 


gil3236418 


Homo sapiens 


SOST (SOST) mRNA, complete 
cds. 


417 


92 


991 


gi335148 


Tacaribe virus 


P-ll protein 


69 


39 


991 


gi5596437 


Petunia x 
hybrida 


Stigl 


58 


30 


992 


gi4490538 


Homo sapiens 


Human U266 rearranged DNA for 
lambda-immunoglobulin light 
chain. 


933 


81 


992 


gi33746 


Homo sapiens 


Human rearranged 
immunoglobulin lambda light 
chain mRNA. 


920 


80 


992 


AAB36212 


Homo sapiens 


Human immune system associated 
protein HISAP-10. 


913 


81 


993 


AAG03466 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7547. 


104 


31 


993 


gi259589 


Cercopithecine 
herpesvirus 1 


glycoprotein J; gJ 

i 


88 


42 


993 


gi35 14059 


Cercopithecine 
herpesvirus 1 


glycoprotein gJ 


87 


42 


994 


gi203246 


Rattus 
norvegicus 


cell adhesion-like molecule 


1739 


97 


994 


gi5 14374 


Homo sapiens 


Human (clone pHOM) opioid- 
binding cell adhesion molecule 
mRNA, complete cds. 


1666 


94 


994 


gi586 


Bos taurus 


put. pre-OPCAM (AA 1 - 345) 


1643 


93 


995 


AAB88408 


Homo sapiens 


Human membrane or secretory 
protein clone PSEC0164. 


1358 


92 


995 


AAY13392 


Homo sapiens 


Amino acid sequence of protein 
PR0328. 


1355 


100 


995 


AAB01373 


Homo sapiens 


Neuron-associated protein. 


1355 


100 


996 


AAB85144 


Homo sapiens 1 


Human NKCR polypeptide (clone 
ID HMSOM53). 


982 


89 


996 


gi31332 


Homo sapiens 


Human mRNA for high affinity Fc 
receptor (FcRI). 


364 


42 


996 


AAY96226 


Homo sapiens 


Human high affinity Fc receptor, 
FcgammaRI. 


364 


42 


997 


gi 14165486 


Homo sapiens 


clone MGC: 15041 
IMAGE:3831657 5 mRNA, 
complete cds. 


70 


37 


997 


gi9909361 


Homo sapiens 


sphingosine kinase (SPHK) 
mRNA, complete cds. 


70 


37 


997 


gi8 132868 


Homo sapiens 


sphingosine kinase- 1 mRNA, 
complete cds. 


70 


37 


998 


AAB56802 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO:1380. 


417 


100 


998 


gi2983321 


Aquifex aeohcus 


cytochrome c-type biogenesis 
protein 


61 


36 


998 


gi6941880 


Escherichia coli 


flagellin 


63 


35 


999 


AAB08732 


Homo sapiens 


Amino acid sequence of a human 
OLD- 3 5 polypeptide. 


1375 


87 


999 


gil4042110 


Homo sapiens 


cDNA FLJ14531 fis, clone 
NT2RM2000371, weakly similar 
to POLYRIBONUCLEOTIDE 


1265 


88 
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NUCLEOTIDYLTRANSFERASE 
(EC 2.7.7.8). 






999 


AAB92684 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11065. 


1265 


88 


1000 


gi 199582 


Mus musculus 


yi A V * 111" 

B(2)-microglobulm 


616 


95 


1000 


gi50105 


Mus musculus 


beta2-microglobulin precursor (aa 
-20 to 99) 


614 


95 


1000 


gi 199576 


Mus musculus 


B(2)~microglobulin 


609 


94 


1001 


gi7023273 


Homo sapiens 


cDNA FLJ 10922 fis, clone 
OVARC1 000420. 


672 


99 


1001 


AAB93335 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12441. 


672 


99 


1001 


AAM06337 


Homo sapiens 


Human foetal protein, SEQ ID NO: 
68. 


328 


75 


1002 


gil 5559608 


Homo sapiens 


Similar to zinc finger protein 16 
(KOX 9), clone MGC:20886 
IMAGE:4549240, mRNA, 
complete cds. 


2079 


100 


1002 


gi55471 


Mus musculus 


Zfp-29 


725 


59 


1002 


gi6409345 


Homo sapiens 


zinc finger protein ZNF180 
(ZNF180) mRNA, complete cds. 


730 


54 


1003 


gi488555 


Homo sapiens 


Human zinc finger protein ZNF135 
mRNA, complete cds. 


1664 


60 


1003 


gil769491 


Homo sapiens 


Human kruppel-related zinc finger 
protein (ZNF184) mRNA, partial 
cds. 


1633 


49 


1003 


gil 0436789 


Homo sapiens 


cDNA FLJ14345 fis, clone 
THYRO1001 1 89, weakly similar 
to ZINC FINGER PROTEIN 91. 


1619 


57 


1005 


AAB23641 


Homo sapiens 


Human secreted protein SEQ ID 
NO: 97. 


609 


100 


1005 


AAE01594 


Homo sapiens 


Human gene 12 encoded secreted 
protein HHSGW69, SEQ ID 
NO: 144. 


93 


35 


1005 


gi5668598 


Homo sapiens 


Wiskott-Aldrich syndrome protein 
interacting protein (WASPIP) 
mRNA, partial cds. 


92 


35 


1006 


gil 1493473 


Homo sapiens 


PR02225 


163 


67 


1006 


gi9654986 


Vibrio cholerae 


autoinducer-2 production protein 


64 


33 


1006 


AAG02328 


Homo sapiens 


Human secreted protein, SEQ ID I 
NO: 6409. 


58 


45 


1008 


AAM00955 


Homo sapiens 


Human bone marrow protein, SEQ 
ID NO: 431. 


704 


100 


1008 


gi4902661 


Homo sapiens 


Novel human gene mapping to 
chomosome 22. 


635 


100 


1008 


gil2052896 


Homo sapiens 


mRNA; cDNADKFZp564F1978 
(from clone DKFZp564F1978); 
complete cds. 


635 


100 


1009 


AAE01420 


Homo sapiens 


Human secreted protein fragment, 
SEQ ID NO: 144. 


612 


100 


1009 


gi4028545 


Homo sapiens 


LIM domain only 7 (LM07) gene, 
exon T and alternative splice 
products, partial cds. 


65 


44 


1009 


gil 620061 


Paramecium 
bursaria 
Chlorella virus 1 


a389R 


66 


46 
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1010 


gi9408382 


Raja eglanteria 


eos 


79 


26 


1010 


gi2062744 


Xenopus 
laevis/gilli 


Ikaros homolog 


63 


27 


1010 


gill99604 


Homo sapiens 


Human zinc finger protein C2H2- 
25 mRNA, complete cds. 


97 


27 


1011 


gil3097207 


Homo sapiens 


ribosomal protein, large, PI, clone 
MGC:5215 IMAGE: 2900846, 
mRNA, complete cds. 


332 


100 


1011 


gi 14043204 


Homo sapiens 


ribosomal protein, large, PI, clone 
MGC: 15616 IMAGE: 3 343 021, 
mRNA, complete cds. 


332 


100 


1011 


gi 190234 


Homo sapiens 


Human acidic ribosomal 
phosphoprotein P 1 mRNA, 
complete cds. 


332 


100 


1012 


AAW88457 


Homo sapiens 


Human lysophospholipase IHLP. 


1113 


93 


1012 


AAY64648 


Homo sapiens 


Human lysophospholipase 
homology protein. 


825 


100 


1012 


gil0303289 


Neurospora 
crassa 


related to lysophospholipase 


201 


34 


1013 


g i 140434 17 


Homo sapiens 


clone IMAGE:3953868, mRNA, 
partial cds. 


919 


98 


1013 


gil5080096 


Homo sapiens 


clone MGC:20451 
IMAGE:3830864, mRNA, 
complete cds. 


919 


98 


1013 


AAG81374 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO-.266. 


919 


98 


1014 


gil3543427 


Homo sapiens 


Similar to RIKEN cDNA 
5730469M10 gene, clone 
MGC:4248 IMAGE:3010078, 
mRNA, complete cds. 


191 


36 


1014 


AAW67858 


Homo sapiens 


Human secreted protein encoded 
by gene 52 clone HAUCC47. 


191 


36 


1014 


AAY66746 


Homo sapiens 


Membrane-bound protein 
PROH98. 


191 


36 


1015 


AAB56791 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO:1369. 


496 


98 


1015 


gi2984030 


Aquifex aeolicus 


phosphofructokinase 


70 


25 1 


1015 


gil388150 


Haemophilus 
ducreyi 


ribosomal protein L31 


51 


45 


1016 


gi29509 


Homo sapiens 


Human BTG1 mRNA. 


655 


100 


1016 


gi50188 


Mus musculus 


btgl 


655 


100 


1016 


gi293306 


Mus musculus 


B-cell translocation gene-1 protein 


655 


100 


1017 


gi 13623633 


Homo sapiens 


clone MGC: 13 105 
IMAGE:3957973, mRNA, 
, complete cds. 


2984 


100 


1017 


AAB81188 


Homo sapiens 


Human zinc finger protein 52 
(ZFP-52). 


2389 


98 


1017 


AAB95368 


Homo sapiens 


Human protein sequence SEQ ID 
NO:17684. 


1712 


»| /-V /"V 

100 


1018 


gil2002127 


Homo sapiens 


CCK1 protein (CCK1) mRNA, 
complete cds. 


55 


45 


1018 


gi9392591 


Homo sapiens 


CC chemoldne CCL28 (SCYA28) 
mRNA, complete cds. 


55 


45 


1018 


gil0312152 


Homo sapiens 


mucosae-associated epithelial 
chemokine mRNA, complete cds. 


55 


45 
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1019 


AAB90816 


Homo sapiens 


Human shear stress-response 
protein SEQ ID NO: 140. 


365 


100 


1019 


gil0039341 


Dugesia 
dorotocephala 


Vasa-related protein P1VAS 1 


67 


45 


1019 


gil5158371 


Agrobacterium 
tumefaciens 


AGR_L_42GMp 


64 


30 


1020 


gil377897 


Homo sapiens 


heart protein (FHL-2) mRNA, 
complete cds. 


1631 


100 


1020 


gi 1160932 


Homo sapiens 


(clone 35.3) DRAL mRNA, 
complete cds. 


1625 


99 


1020 


gi7209525 


Homo sapiens 


gene for DRAL/Shm3/FHL2, exon 
5 and complete cds. 


1625 


99 


1021 


gi7770259 


Homo sapiens 


PR02975 


794 


100 


1021 


gil 142588 


Trypanosoma 
brucei 


CR3 


55 


40 


1021 


gil237130 


Escherichia coli 


O antigen polymerase 


89 


28 


1022 


AAG81348 


Homo sapiens 


Human AFP protein sequence SEQ 
IDNO:214. 


530 


63 


1022 


gi4206763 


Arabidopsis 
thaliana 


cell wall-plasma membrane linker 
protein nomolog 


119 


30 


1022 


gi2623666 


Canis familiaris 


dentatorubro-pallidoluysian 
atrophy protein 


101 


30 


1023 


AAG01390 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5471. 


297 


100 


1023 


AAB57280 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO:1858. 


66 


34 


1023 


gi456681 


Pseudorabies 
virus 


helicase 


80 


26 


1024 


gi 10 17722 


Homo sapiens 


Human repressor transcriptional 

s> r f—y-v x n O t" X "VTA t a 

factor (ZNF85) mRNA, complete 
cds. 


1344 


58 


1024 


gi 143485 91 


Homo sapiens 


KRAB zinc finger protem (KR19) 
mRNA, complete cds. 


1342 


58 


1024 


gi4454678 


Homo sapiens 


zinc finger protem 4 


1321 


59 


1025 


gil65680 


Oryctolagus 
cumcums 


recombination activating protein 


r^. f if f 

2666 


91 


1025 


gi2576246 


Mus musculus 


RAG-2 protein 


2594 


88 


1025 


gil5809414 


Mormoops 
blamvilln 


recombination activator protein 2 


2387 


91 


1026 


AAG75278 


Homo sapiens 


Human colon cancer antigen 
protem SEQ IDNO:6042. 


1061 


95 


1026 


gi211896 


Gallus gallus 


h-caldesmon 


153 


29 


1026 


gi600255 


Gallus gallus 


caldesmon 


153 


29 


1027 


AAY87341 


Homo sapiens 


Human signal peptide containing 
protein HSPP-118 SEQ ID 
NO:118. 


699 


99 


1027 


AAY12929 


Homo sapiens 


Amino acid sequence of a human 
secretea peptiae. 


66 


51 


1027 


gi930084 


Homo sapiens 


Human Kox21 mRNA for zinc 
finger protein, partial. 


41 


31 


1028 


gil73324 


Candida glabrata 


metallothionein II 


37 


71 


1028 


gil 73 326 


Candida glabrata 


metallothionein (MTII) 


37 


71 


1028 


gil73328 


Candida glabrata 


metallothionein II 


37 


71 


1029 


AAE02058 


Homo sapiens 


Human four disulfide core domain 
(FDCD)-containing protein. 


598 


4 4 
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1029 


gil2655452 


Homo sapiens 


mRNA for keratin associated 
protein 4.7 (KRTAP4.7 gene). 


557 


46 


1029 


gil2655456 


Homo sapiens 


partial mRNA for keratin 
associated protein 4.9 (KRTAP4.9 
gene). 


540 


42 


1030 


gil0639287 


Thermoplasma 
acidophilum 


amino acid transporter related 
protein 


66 


29 


1030 


gil4325303 


Thermoplasma 
volcanium 


amino acid transporter 


64 


28 


1030 


gil5341701 


Homo sapiens 


clone MGC: 19805 
IMAGE.3939974, mRNA, 
complete cds. 


60 


34 


1031 


gil2653801 


Homo sapiens 


peptidylprolyl isomerase A 
(cyclophilin A), clone MGC:2351 
IMAGE:3349335, mRNA, 
complete cds. 


820 


93 


1031 


gil2804335 


Homo sapiens 


clone IMAGE:2823490, mRNA, 
partial cds. 


820 


93 


1031 


gil3529080 


Homo sapiens 


peptidylprolyl isomerase A 
(cyclophilin A), clone MGC: 12404 
IMAGE:3935025, mRNA, 
complete cds. 


820 


93 


1032 


gi207621 


Rattus 
norvegicus 


uromodulin 


98 


36 


1032 


gi912817 


Rattus sp. 


Tamm-Horsfall protein; THP 


98 


36 


1032 


gi602768 


Mytilus 

galloprovincialis 


adhesive plaque matrix protein 
precursor 


86 


29 


1033 


AAG03055 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7136. 


269 


100 


1033 


AAY73471 


Homo sapiens 


Human secreted protein clone 

yd 153 1 protein sequence SEQ ID 

NO: 164. 


62 


33 , 


1034 


AAB38043 


Homo sapiens 


Fragment of human secreted 
protein encoded by gene 10 clone 
HWHGP71. 


124 


38 


1034 


gi5305335 


Mycobacterium 
tuberculosis 


proline-rich mucin homolog 


114 


36 


1034 


gi22599 


Arabidopsis 
thaliana 


APG 


112 


37 


1035 


gi6467206 


Homo sapiens 


GIOT-4 mRNA for gonadotropin 
inducible transcription repressor-4, 
complete cds. 


2004 


60 


1035 


AAY58627 


Homo sapiens 


Protein regulating gene expression 
PRGE-20. 


1732 


56 


1035 


gi3953593 


Mus musculus 


Zinc finger protein si 1-6 


1720 


54 


1036 


AAB95007 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 16685. 


518 


86 


1036 


gi45906 


Proteus vulgaris 


hlyC protein (AA 1-54) 


72 


45 


1036 


AAB56607 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO: 11 85. 


74 


56 


1037 


AAY27616 


Homo sapiens 


Human secreted protein encoded 
by gene No. 50. 


562 


99 


1037 


gil2957417 


Casuarius 
bennetti 


ATPase subunit 8 


62 


35 


1037 


gi3320O9 


Murine leukemia 
virus 


pl5-gag protein 


57 


27 
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1038 


gi 15822827 


Homo sapiens 


mRNA for pendnn-like protein 1, 
complete cds. 


1 1 C A 

1154 


39 


1038 


gi 13344999 


XT * 

Homo sapiens 


solute carrier family 26 member 6 
(SLC26A6) mRNA, complete cds. 


1 1 r\f\ 


•2*7 


1038 


AAY71067 


T T * 

Homo sapiens 


Human membrane transport 
protein, MTRP-12. 


izy / 


J / 


1039 


AAY57945 


T T * 

Homo sapiens 


Human transmembrane protein 
HTMPN-69. 






1039 


AAY76141 


T T * 

Homo sapiens 


Human secreted protein encoded 
by gene 18. 






1039 


AAB24037 


x t • 

Homo sapiens 


Human PRO 1555 protein sequence 
SEQ ID NO:49. 


jD6 


/IT 

4/ 


1040 


AAY59672 


XT * 

Homo sapiens 


Secreted protein 1 08-006- 5-0-L6- 
FL. 


555 


CI 


1040 


gil0435214 


XT * 

Homo sapiens 


cDNA FLJ 13263 lis, clone 
OVARC 1000924. 


54y 


DO 


1040 


AAB94543 


XT * 

Homo sapiens 


Human protein sequence SEQ ID 
NO: 15290. 


549 


o2 


1041 


AAY92710 


Homo sapiens 


Human membrane-associated 
protein Zsig24. 


/(J4 


y i 


1041 


AAY87250 


Homo sapiens 


Human signal peptide containing 
protein HSPP-27 SEQ ID NO:27. 


566 




1041 


AAG00627 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4708. 


260 


100 


1042 


gil4572521 


Homo sapiens 


NEPH1 (NEPH1) mRNA, 
complete cds. 


1512 


51 


1042 


AAB37996 


Homo sapiens 


Human secreted protein encoded 
by gene 13 clone HIBEU15. 


1164 


92 


1042 


gi 10434261 


Homo sapiens 


cDNA FLJ 12646 fis, clone 
NT2RM4001987, weakly similar 

. \Tmm A T /"IT -1 T T A r\T1DC!TA\T 

to NEURAL CELL ADHESION 
MOLECULE 1, LARGE, 
ISUrUKM rKiiCUKISUK. 


1035 


41 


1043 


'f^f\ arts' 

gi29806 


XT " „ 

Homo sapiens 


Human mKJNA tor cuoy, an JLY- 
6-like protein regulating 
complement rnemorane attacic. 


/ 1U 


1 fiA 
If U 


1043 


gi825637 


Homo sapiens 


H.sapiens gene for CD59 protein, 
exon 2. 


710 


100 


1043 


gi29815 


Homo sapiens 


Human mRNA tor GD5V antigen. 


^71 n 

/IV 


i fin 


1044 


gi6841140 


Homo sapiens 


HSPC100 mRNA, partial cds. 


498 


100 


1044 


gi2828808 


Bacillus subtilis 


glucose transporter 


111 


25 


1044 


gi9106658 


X T 1 11 

Xylella 
fastidiosa 9a5c 


I / i j j » 

glucose/galactose transporter 


1 /(A 

140 


23 


1045 


AAB56632 


XT * 

Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO:1210. 


3377 


99 


1045 


gil3097708 


T "I • 

Homo sapiens 


ribopnorm 11, clone MGC:iol / 
IMAGE:3546673, mRNA, 
compieic cas. 




1 AA 


1045 


gi5834424 


Homo sapiens 


RIBIIR gene (partial), exon 1 and 
joined CDS. 


3152 


100 


1046 


gil3182757 


Homo sapiens 


HTPAP mRNA, complete cds. 


598 


100 


1046 


AAG89279 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 399. 


598 


100 


1046 


AAB70690 


Homo sapiens 


Human hDPP protein sequence 
SEQ ID NO:7. 


598 


100 
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1047 


gi2276448 


Homo sapiens 


Human MHC class I HLA-A 
(HLA-A-0302-new allele) mRNA, 
complete cds. 


1794 




1047 


gi6815812 


Homo sapiens 


MHC class I antigen heavy chain 

/ttt a A \ TlXT A TTT A A ikmAO 

(HLA-A) mRNA, HLA-A* 03 02 
allele, complete cds. 


1794 


93 


1047 


gi 1245460 


Homo sapiens 


Human MHC class I HLA-A allele 
(HLA-A) mRNA, complete cds. 


1 'HOC 


yZ 


1048 


AAB95392 


Homo sapiens 


tt i * __ ______ f1T7_*\ T "1 \ 

Human protein sequence SEQ ID 
NO: 17743. 


DO / 


/O 


1048 


AAB29645 


Homo sapiens 


Human membrane-associated 
protein HUMAP-2. 




/U 


1048 


AAB95049 


Homo sapiens 


TT i * _,__. ____._--..___-_. -~ OT_7^\ T 1 

Human protein sequence bh.v_) 1JJ 
NO: 16845. 


jyo 


7fi 
/ 0 


1049 


gi 140 17773 


Mus museums 


Cgl 0671 -like 


131/ 




1049 


gil4017764 


Mus musculus 


CG10671-like 


1517 


96 


1049 


gil6198091 


_____ m | M 

Drosophila 
melanogaster 


LD30661p 


1 OA 




1050 


AAG81431 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:380. 


503 


97 


1050 


gi6707026 


Monodelphis 
domestica 


■ 1 "f t * 1 " 1 _L ___.1___. _________ 1 — *-_.____■_ «. 

immunoglobulin light chain kappa 


1 AO 


zo 


1050 


gi6653413 


Oryctolagus 
cumculus 


immunoglobulin light chain VJ 
kappa region 


102 


27 


1051 


gil2836893 


Gallus gallus 


IPR328-like protein 


158 


29 


1051 


gi3093433 


Homo sapiens 


Chromosome 16 BAC clone 
CIT987SK-625P11, complete 
sequence. 


151 


29 


1051 


gi4558766 


Homo sapiens 


neuronal voltage gated calcium 
channel gamma- 3 subunit mRNA, 
complete cds. 


151 


29 


1052 


gi4337100 


Homo sapiens 


MSH55 gene, partial cds; and 
CLIC1, DDAH, G6b, G6c, G5b, 
G6d, G6e, G6f, BAT5, G5b, 
CSK2B, BAT4, G4, Apo M 3 
BAT3, BAT2, AIF-1, 1C7, LST-1, 
LTB, TNF, and LTA genes, 
complete cds. 


400 


100 


1052 


gi5304878 


Homo sapiens 


genes encoding RNCC protein, 
DDAH protein, Ly6-C protein, 

t y w _■ _j 

Ly6-D protein and 
immunoglobulin receptor. 


400 


1 AA 


1052 


AAY27597 


Homo sapiens 


Human secreted protein encoded 
by gene No. 3 1 . 


A A A 

400 


1UU 


1053 


A AB 88325 


Homo sapiens 


Human membrane or secretory 
protein clone PSEC0020. 


912 


99 


1053 


AAB53257 


Homo sapiens 


Human colon cancer antigen 
protein sequence SEQ ID NO: 797. 


859 


99 


1053 


gil3325409 


Homo sapiens 


clone IMAGE:3845253, mRNA, 

_^ ■■4' ■ *■_ 1 _(^ g^~\ #*< 

partial cas. 


774 


100 


1054 


gil234787 


Xenopus laevis 


up-regulated by thyroid hormone 
in tadpoles; expressed specifically 
in the tail and only at 
metamorphosis; membrane bound 
or extracellular protein; C-terrninal 
basic region 


917 


61 


1054 


gil0435980 


Homo sapiens 


cDNA FLJ13840 fis, clone 


812 


62 
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THYRO 1000783, moderately 
similar to Xenopus laevis tail- 
specific thyroid hormone up- 
regulated (gene 5) mRNA. 






1054 


AAB94773 


Homo sapiens 


Human protein sequence SEQ ID 
NO:l5860. 


812 


62 


1055 


gi4099139 


Homo sapiens 


Human P2X4 purinoreceptor gene, 
exons 9, 10, ll and 12 and 
complete cds. 


2014 


100 


1055 


gi4099121 


Homo sapiens 


Human P2X4 purinoreceptor 
mRNA, complete cds. 


2014 


i An 


1055 


AAW47066 


Homo sapiens 


Human bram P2X-1 receptor 
polypeptide. 


2014 


100 


1056 


AAE03560 


Homo sapiens 


■w- -w- -f • ] ' It J 

Human differentially expressed 
kidney cDNA 22360 encoded 
protein. 


1020 


on 

yy 


1056 


gii5637151 


Beta vulgaris 


glycine decarboxylase subunit P 


62 


36 


1056 


gi5 824822 


Caenorhabditis 
elegans 


Y53F4A.2 


62 


25 


1057 


gi972946 


Mus musculus 


ZP1 precursor 


2217 


67 


1057 


gi 1113794 


Mus musculus 


zona pellucida 


2210 


67 


1057 


gi2804566 


Rattus 
norvegicus 


zona pellucida 1 glycoprotein 


2200 


67 


1058 


gi 15779 156 


Homo sapiens 


Similar to RIKEN cDNA 
1810073N04 gene, clone 
MGC: 15523 IMAGE:3028844, 
mRNA, complete cds. 


1858 


100 


1058 


gi 130 97045 


Mus musculus 


Similar to RIKEN cDNA 
1810073N04 gene 


1719 


91 


,1058 


gi603254 


Saccharomyces 
cerevisiae 


Yel064cp 


319 

« 


27 


1059 


AAW03516 


Homo sapiens 


Prostaglandin DP receptor. 


1467 


100 


1059 


gi940379 


Homo sapiens 


Human DP prostanoid receptor 
(PTGDR) gene, 5' region and 
partial cds. 


1467 


100 


1059 


gi4567038 


Rattus 
norvegicus 


prostaglandin D2 receptor 


1127 


77 


1060 


gi2S11122 


Xenopus laevis 


NaDC-2 


1274 


56 


1060 


gil098557 


Homo sapiens 


Human renal sodium/dicarboxylate 
cotransporter (NADC1) mRNA, 
complete cds. 


1618 


55 


1060 


gi3 168585 


Rattus 
norvegicus 


sodium-dependent dicarboxylate 
transporter 


1614 


54 


1061 


gi3036840 


Homo sapiens 


mRNA for cystinosin. 


1686 


88 


1061 


gi3036851 


Homo sapiens 


CTNS gene, exon 3, flanking 
intronic regions and joined CDS. 


1686 


88 


1061 


gi7239176 


Homo sapiens 


vanilloid receptor gene, partial 
sequence; CARKL and CTNS 
genes, complete cds; TIP1 gene, 
partial cds; rzADb ana JrzADa 
genes, complete cds; and 
HUMINAE gene, partial cds. 


1686 


88 


1062 


gi41077 


Escherichia coli 


cal protein precursor (aa 1-51) 


63 


42 


1062 


gi6474978 


Schizosaccharom 
yces pombe 


Amino acid permease 


62 


27 


1062 


AAB40157 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID NO:67. 


60 


27 
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1063 


AAY36071 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 456. 


1252 


92 


1063 


gi 15990604 


Homo sapiens 


RAE-1-like transcript 4 mRNA, 
complete cds. 


1022 


91 


1063 


AAG00501 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4582. 


533 


95 


1064 


gi 142905 60 


Homo sapiens 


Similar to transmembrane 7 
superfamily member 2, clone 
MGC:9286 IMAGE:3874367, 
mRNA, complete cds. 


1548 


98 


1064 


gi 15277509 


Homo sapiens 


Similar to transmembrane 7 
superfamily member 2, clone 
MGQ17157 IMAGE:42 14662, 
mRNA, complete cds. 


1548 


97 


1064 


gi3211722 


Homo sapiens 


lamm B receptor homolog 
TM7SF2 (TM7SF2) mRNA, 
complete cds. 


1132 


77 


1066 


AAE06611 


Homo sapiens 


Human protein having 
hydrophobic domain, HP03696. 


1552 


99 


1066 


gil 3676372 


Homo sapiens 


clone MGC:4595 
IMAGE:3345729, mRNA, 
complete cds. 


469 


50 


1066 


AAY41690 


Homo sapiens 


Human PR0329 protein sequence. 


469 


50 


1067 


AAG72119 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1 800. 


1036 


81 


1067 


gi3769616 


Rattus 
norvegicus 


olfactory receptor 


887 


81 


1067 


gil 2054453 


Homo sapiens 


6M1-18*01 gene for olfactory 
receptor, cell line BM28.7. 


547 


42 


1068 


gi7106778 


Homo sapiens 


HSPC194 


c o r\ 

530 


95 


1068 


AAW64547 


Homo sapiens 


Human stomach cancer cell clone 
HP 10 175 protein. 


530 


95 


1068 


AAY35949 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 198. 


530 


95 


1069 


gi402185 


Homo sapiens 


H.sapiens ALK-2 mRNA. 


15 11 




1069 


gi338219 


Homo sapiens 


Human novel serine kinase 
receptor mRNA, complete cds. 


1572 




1069 


AAR85206 


Homo sapiens 


Human ALK-2. 


1572 


100 


1070 


gi4 128041 


Homo sapiens 


claudin-9 (CLDN9) gene. 


227 


35 


1070 


AAB64401 


Homo sapiens 


Amino acid sequence of human 
intracellular signalling molecule 
INTRA33. 


227 


35 


1070 


gi4325296 


Mus musculus 


claudin-9 


O 1 /i 

214 


34 


1071 


gil405S93 


Homo sapiens 


H.sapiens MICA gene. 


1896 


93 


1071 


AAW60043 


Homo sapiens 


Human MHC class I chain-related 
gene A (MICA) polypetide. 


1896 


93 


1071 


gi508492 


Homo sapiens 


Human MHC class I-related 
protein mRNA, complete cds. 


1838 


90 


1072 


gil 5292437 


Drosopniia 
melanogaster 


LP1U2 /2p 






1072 


gi4877582 


Homo sapiens 


lipoma HMGIC fusion partner 
(LHFP) mRNA, complete cds. 


221 


28 


1072 


AAY87336 


Homo sapiens 


Human signal peptide containing 
protein HSPP-1 13 SEQ ID 
NO:113. 


221 


28 


1073 


AAB58289 


Homo sapiens 


Lung cancer associated 


1338 


100 
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polypeptide sequence SbQ ID 627. 






1073 


AAY29332 


Homo sapiens 


Human secreted protein clone 
pe584_2 protein sequence. 


1338 


100 


1073 


AAB75295 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 
NO:114. 


1247 


100 


1074 


AAB58289 


Homo sapiens 


Lung cancer associated 
polypeptide sequence obQ ID 62/. 


1338 


100 


1074 


AAY29332 


Homo sapiens 


Human secreted protein clone 
pe584_2 protein sequence. 


1338 


100 


1074 


AAB75295 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 
NO:ll4. 


1247 


100 


1075 


AAE04780 


T T * 

Homo sapiens 


Human vesicle trariickmg protem- 
23 (VETRP-23) protein. 


©64 




1075 


AAB28629 


Homo sapiens 


Human Bl lAgl antigen splice 
isoform B11C-8. 


546 


39 


1075 


AAB28630 


Homo sapiens 


Human Bl lAgl antigen splice 
isoform B11C-9-16. 


546 


39 


1076 


gil58 11373 


Mus musculus 


G protein coupled receptor 
affecting testicular descent 


1269 


83 


1076 


gil0441730 


Homo sapiens 


leucine-rich repeat-containing G 
protein-coupled receptor 7 (LGR7) 
mRNA, complete cds. 


1004 


62 


1076 


AAY42170 


Homo sapiens 


Human LGR7 long form protein 
sequence. 


1004 


62 


1077 


gi 13544043 


Homo sapiens 


clone IMAGE:3627317, mRNA, 
partial cds. 


1257 


52 


1077 


gi 14249892 


T T * 

Homo sapiens 


spinster-like protein, clone 
MGC: 15767 IMAGE:3501826, 
mRNA, complete cds. 


1257 


bl 


1077 


gil20039S0 


Homo sapiens 


spinster-like protein mRNA, 
complete cds. 


1257 


52 


1078 


AAB85029 


Homo sapiens 


Protein encoded by BAP2 8 cDNA 
consisting of exons 1 to 45. 


1618 


68 


1078 


AAW54099 


Homo sapiens 


Homo sapiens BAP28 sequence. 


1617 


67 


1078 


gi7022341 


Homo sapiens 


cDNA FLJ10359 fis, clone 
NT2RM2001243. 


1 c o o 

1588 


92 


1079 


gil 349 1841 


Rattus 

• 

norvegicus 


gamma-glutamyltranspeptidase- 
like protein 


209 


34 


1079 


AAG75266 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:6030. 


217 


100 


1079 


gi57806 


Rattus sp. 


gamma-glutamyltranspeptidase 
(AA 1-568) 


186 


33 


1080 


gi5 262646 


Homo sapiens 


mRNA; cDNA DKFZp434I09l 
(from clone DKFZp434I09l); 
partial cds. 


2917 


100 


1080 


gi6807820 


Homo sapiens 


mRNA; cDNA DKFZp434A2372 
(from clone DKFZp434A2372); 
partial cds. 


629 


100 


1080 


gil408182 


Homo sapiens 


Human LGN protein mRNA, 
complete cds. 


282 


31 


1081 


gi4878022 


Homo sapiens 


acyl-coenzyme A: cholesterol 
acyltransferase mRNA, complete 
cds. 


930 


98 


1081 


AAR53079 


Homo sapiens 


Acetyl coenzyme A: cholesterol 
acetyltransferase (AC AT). 


925 


98 
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1081 


AAW38416 


Homo sapiens 


T T 1 

Human acyl-coenzyme 

A 1 t 2. 1 1 j _ _C „ T 

A: cholesterol acyl transferase I. 


925 


98 


1082 


gi458938 


Saccharomyces 
cerevisiae 


Yhrl86cp 


1004 


58 


1082 


'?A11 1 A A 

gi5921144 


Schizosaccharom 
yces pombe 


mipl 


2049 


52 


1082 


gi9366720 


Trypanosoma 
brucei 


possible tlool 1.22 protein. 


277 




1083 


gi402l87 


T T * 

Homo sapiens 


TT " A T Tf O T"» "K T A 

H.sapiens ALK-3 mRNA. 


1664 




1083 


AAR55368 


Homo sapiens 


Human Activin receptor-like 
kinase 3 (nALK-3). 


1664 


99 


1083 


A A n OTA /I '"7 

AAR85207 


Homo sapiens 


x T_ A T TS~ O 

Human ALK-3. 


1 £LCA 

1664 




1084 


gi609354 


Xenopus laevis 


BMP receptor 


1485 


90 


1084 


gi2446992 


Xenopus laevis 


'BMP receptor' 


1483 


on 

89 


1084 


gi355l073 


Damo reno 


type I serin/threonine kinase 
receptor 


1 A C 1 

1451 


87 


1085 


AAW90873 


Homo sapiens 


Human bram-speciFic dysferlm 
protein. 


1340 


53 


1085 


gi3600028 


Homo sapiens 


dysferlin mRNA, complete cds. 


1340 


53 


1085 


AAY82643 


Homo sapiens 


TT J X» 1 ' J * 

Human dysferlm protein sequence 
SEQ ID NO:2. 


1 O A f\ 

1340 


53 


p-t /"l 

1086 


gi3600028 


Homo sapiens 


dysferlin mRNA, complete cds. 


i o f~ s~ 

1866 


49 


1086 


AAY82643 


Homo sapiens 


Human dysferlin protein sequence 
SEQ ID NO:2. 


1866 


49 


1086 


AAW90868 


Homo sapiens 


TT J X* 1 * j • 

Human dysferlin protein. j 


i a f f 

1866 


49 


1087 


AAY92321 


XT * 

Homo sapiens 


Human alpha-2-delta-D calcium 
channel subunit. 


5881 


99 


1087 


AAB 62262 


Homo sapiens 


T T 1 * 1 1 

Human calcium channel ' 
alpha2delta subunit. 


5745 


C\C\ 

99 


1087 


AAY92323 


T T * 

Homo sapiens 


Human alpha-2-delta-D 
polypeptide rrom splice variant 1 . 


4976 


99 


-i Ann 

1088 


gi2 104689 


Mus museums 


alpha glucosidase II, alpha subunit 


1796 


DD 


1088 


gi 1890664 


Sus scrofa 


glucosidase 11 


1792 


55 


1088 


gi7672977 


Homo sapiens 


glucosidase II alpha subunit 
mRNA, complete cds. 


1783 


55 


1089 


AAY01143 


Homo sapiens 


Secreted protein encoded by gene 
9 clone HSIDY06. 


238 


100 


1089 


gi6692409 


Otus longicornis 


cytochrome b 


64 


38 


1089 


gil03121S5 


Otus watsonn 


cytochrome b 


61 


43 


1090 


gi 13477285 


Homo sapiens 


structure specific recognition 
protein 1, clone MGC:1608 
IMAGE:3536048, mRNA, 
complete cds. 


3683 


100 


1090 


gi 184242 


Homo sapiens 


Human high mobility group box 
(SSRP1) mRNA, complete cds. 


3683 


100 


1090 


AAR38744 


Homo sapiens 


Human SSRP. 


3683 


100 


1091 


gil77814 


Homo sapiens 


Human alpha- 1-antitrypsin-related 
protein gene, exons 3, 4 and 5. 


1925 


90 


1091 


AAP50132 


T T * 

Homo sapiens 


Sequence of the predominant form 
of human alpha- l-antitrypsin( AT). 


828 


59 


1091 


gil 5990507 


Homo sapiens 


Similar to serine (or cysteine) 
proteinase inhibitor, clade A 
(alpha- 1 antiproteinase, 
antitrypsin), member 1, clone 
MGQ23330 IMAGE:4644658, 
mRNA, complete cds. 


1409 


66 
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1092 


AAB56819 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO:1397. 


1054 


100 


1092 


gil5981490 


Yersinia pestis 


protease 


137 


28 


1092 


lr\^tr A C\C\ £ 

gi965499D 


Vibrio cnolerae 


protease uegb 


1 




1093 


gil3543976 


Homo sapiens 


clone IMAGE: 3603998, mRNA, 
partial cos. 


1523 


100 


1093 


gil 5930240 


Homo sapiens 


Similar to CAP-binding protein 
complex interacting protein 2, 
clone JViOL- :yyoz 
1MAGE:3878011, mRNA, 
complete cus. 


1523 


100 


1093 


AAY57946 


Homo sapiens 


Human transmembrane protein 

Tj^FA/f DAT HC\ 
rl 1 IVLr IN- / U. 


1128 


100 


1094 


AAY53031 


Homo sapiens 


Human secreted protein clone 
dd426_l protein sequence SEQ ID 


590 


93 


1094 


AAY71062 


Homo sapiens 


Human membrane transport 
protein, MTRP-7. 


158 


26 


1 Art A 

1094 


gll5529155 


Arabiaopsis 
thaliana 


/v i ogjUjyu/ 1 ojzz_io 






1095 


gi4959568 


Homo sapiens 


nuclear pore complex interacting 
protein NPIP (NPIP) mRNA, 
complete cds. 


IDJU 


Q4 


1095 


gi2342743 


TT „ ' „ 

Homo sapiens 


Human Chromosome 10 t>AL 
clone CIT987SK-A-589H1, 
complete sequence. 


lOZ / 




1095 


AAY10915 


Homo sapiens 


Amino acid sequence of a human 
secreieQ peptiae. 


760 


88 


1096 


gi7022118 


Homo sapiens 


cDNA FLJ10213 fis, clone 

WTnA/fR A 1 C\(\fiAl A txr^pVlv cimilar 
txCllVlO/\l VJUO'r Vv CalViy Mlllllal 

to 40 KD PROTEIN. 


1074 


99 


1096 


a a Dm/cnn 

AAByzouy 


Homo sapiens 


xiuman protein sequence oiiv^ lu 
NO: 10874. 


1 074 


99 


1096 


gi45oooo 


r>orna disease 
virus 


pHU 




41 


1097 


gi38432 


Homo sapiens 


H.sapiens gene for mitochondrial 
ATP synthase c subunit (P2 form). 


612 


90 


1097 


gi285910 


Homo sapiens 


rz mivLNA tor Air syninase 
subunit c, complete cds. 




90 


1097 


AAB43694 


Homo sapiens 


riuman cancer associaiea proLcin 
sequence SEQ ID NO:1139. 




90 


1098 


gi897827 


Homo sapiens 


Human iron-responsive eienierii- 
binding protein/iron regulatory 
proiem z ^ii\jj/~ij-r z/ irvrz j mxvi^-rv, 
partial cds. 


'+700 


99 


1098 


gi89758l 


Homo sapiens 


Human iron-reguiatory proiem z, 
(IRP2) mRNA, partial cds. 




99 

yy 


1098 


gi897583 


Rattus 


iron-regulatory protein z 


*r / \J\J 




1099 


gi5732908 


Homo sapiens 


BPAGln3 (BPAG1) mRNA, 
partial cds. 


75 


32 


1099 


AAY87302 


Homo sapiens 


Human signal peptide containing 
protein HSPP-79 SEQ ID NO:79. 


61 


35 


1099 


AAY76213 


Homo sapiens 


Human secreted protein encoded 
by gene 90. 


61 


35 


1100 


gi5732908 


Homo sapiens 


BPAGln3 (BPAG1) mRNA, 


75 


32 
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partial cds. ' 






1100 


AAY87302 


Homo sapiens 


Human signal peptide containing 
protein HSPP-79 SEQ ID NO:79. 


61 


35 


1100 


AAY76213 


Homo sapiens 


_ _ i * -f-| 

Human secreted protein encoded 
by gene 90, 


61 


35 


1101. 


gi5732908 


Homo sapiens 


BPAGln3 (BPAG1) mRNA, 
partial cds. 


75 


32 


1101 


AAY87302 


Homo sapiens 


Human signal peptide containing 
protein HSPP-79 SEQ ID NO:79. 


61 


35 


1101 


AAY76213 


Homo sapiens 


Human secreted protein encoded 
by gene 90. 


61 


35 


1102 


AAY86234 


Homo sapiens 


Human secreted protein 
HNTNC20, SEQ ID NO: 149. 


88 


31 


1102 


gi5430769 


Arabidopsis 
thaliana 


Similar to somatic embryogenesis 
receptor-like kinase 


88 


32 


1102 


AAB24074 


Homo sapiens 


Human PROl 153 protein sequence 
SEQ ID NO:49. 


79 


22 


1103 


gil3447199 


Homo sapiens 


sphingosine-l-phosphate 
phosphatase mRNA, complete cds. 


1931 


98 


1103 


gi9623190 


Mus musculus 


sphingosine- 1 -phosphate 
phosphohydrolase 


1692 


S3 


1103 


gil5778670 


Mus musculus 


sphingosine- 1 -phosphate 
phosphatase 


1692 


83 


1104 


gi 12052824 


Homo sapiens 


mRNA; cDNA DKFZp564H1562 
(from clone DKFZp564H1562); 
complete cds. 


1544 


100 


1104 


gi5326797 


Homo sapiens 


junctional adhesion molecule 
(JAM1) mRNA, complete cds. 


1544 


100 


1104 


gi5731339 


Homo sapiens 


junctional adhesion molecule- 1 
mRNA, complete cds. 


1544 


100 


1105 


gi296636 


Homo sapiens 


Human apoOII gene for 
preproapolipoprotein OIL 


506 


100 


1105 


gi757915 


Homo sapiens 


Human mRNA for lipoprotein 
apoCII. 


506 


100 


1105 


gi 178 836 


Homo sapiens 


APOC2 gene, complete sequence; 
and apolipoprotein C-II (APOC2) 
gene, complete cds. 


506 


100 


1106 


gil3097159 


Homo sapiens 


tumor protein, translationally- 
controlled 1, clone MGC:5308 
IMAGE:2899964, mRNA, 
complete cds. 


794 


97 


1106 


gi7573519 


Homo sapiens 


TPT1 gene for translationally 
controlled tumor protein (TCTP), 
exons 1-6. 


794 


97 


1106 


gi37496 


Homo sapiens 


Human mRNA for translationally 
controlled tumor protein. 


794 


97 


1107 


gi 12082725 


Mus musculus 


B cell phosphoinositide 3-kinase 
adaptor 


3523 


84 


1107 


gil2082723 


Gallus gallus 


B cell phosphoinositide 3-kinase 
adaptor 






1107 


AAB43816 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1261. 


1257 


98 


1108 


gil0177622 


Arabidopsis 
thaliana 


gene_j d: K6M 13.11— 


201 


39 


1108 

., 


gil 0437414 


Homo sapiens 


cDNA: FLJ21330 As, clone 
COL02466. 


165 


34 

■ 
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SEQ ID 


Hit ID 


Species 


Description 


S score 


% Identity 


1108 


gi499199 


Schizosaccharom 
yces pombe 


uvi22 




jj 


1109 


* ^ A *^ S~ A A f 

gi 13436446 


Homo sapiens 


myosin regulatory ngnt cnain, 
cione ivjlvj \~> . HHyjD 
IMAGE:2906108, mRNA, 

UUIIIJJICIC tUa. 


OO 1 




1109 


gi829623 


Homo sapiens 


Human myosin regulatory light 


881 


99 


1109 


gi!5076511 


Homo sapiens 


MLC-2 mRNA for nonmuscle 


881 


99 


1110 


gi5305502 


Mus museums 


phospholemman precursor 


153 


45 


1110 


gil916012 


Rattus 
norvegicus 


pnospnoiemman cmoriue ond-iinei 




53 


1 1 1 A 

mo 


gil9l6010 


Homo sapiens 


Human pnospnoiemman niionue 
cnauiiGi miviN/v, wumpicLc lus. 




47 


mi 


gi 13272522 


Homo sapiens 


transcription factor NYD-splO 
miviNA, complete t/Ub. 


1344 


90 


mi 


gil4278918 


Homo sapiens 


mRNA for transcription factor 
jvr^v'H'j complete cuo. 


1166 


82 


mi 


gi583352 


synthetic 
construct 


does not include the start ot stop 
codon 


162 


29 


lll2 


AAB47296 


Homo sapiens 


PRO4401 polypeptide. 


1062 


58 


ll 12 

* 


AAY22496 


Homo sapiens 


Human secretea protein sequence 
clone cn621 8. 


1UOZ 


JO 


1112 


gi 1404244 1 


Homo sapiens 


cJJJNA rLJ 14/24 IIS, Clone 
JN 1 ZxCr^^UUl / 10. 


Ann 


4^ 


1113 


gi 1534 1863 


Homo sapiens 


Similar to RIKEN cDNA 
zyuuujZHZi gene, cione 
MGC:21625 IMAGE :42 14683, 
mRNA, complete cds. 


758 


98 


1113 


AAY33297 


Homo sapiens 


Human niemDrane bpanrung 
protein MSP-4. 


/ *J o 


98 


1113 


AAB61 149 


Homo sapiens 


Untvi/in "\T/^~\"\ T "1 Q tomtom 

jriuman inu v i o proiem. 


7S8 

/ o 




1114 


gill 125139 


Homo sapiens 


Novel human gene mapping to 
cnomosome zz. 


476 


89 


1114 


AAY94914 


Homo sapiens 


Human secreted protein clone 
pwjj / o proiem sequence o 11^ 
NO:34. 


476 


89 


1114 


gi602584 


Methanosarcina 
mazei 


cytoenrome o 






1115 


AAG72267 


Homo sapiens 


Human onactory receptor 
polypeptide, SEQ ID NO: 1948. 


1981 


100 


11 15 


AAu /24U / 


Homo sapiens 


Human wiv-iikc poiypepLiae qucxy 
sequence, SEQ ID NO: 2088. 


1 981 


100 


1115 


AAG72270 


Homo sapiens 


Human oiiactory receptor 
polypeptide, SEQ ID NO: 1951. 


Q07 




1116 


gil2002782 


Homo sapiens 


olfactory receptor-like protein 
JCG2 (JCG2) mRNA, partial cds. 


1538 


100 


1116 


gil2002784 


Homo sapiens 


olfactory receptor-like protein 
JCG2 f JCG2^ eene comolete cds 


1538 


100 


1116 


AAE04555 


Homo sapiens 


Human G-protein coupled 
receptor- 11 (GCREC-11) protein. 


1538 


100 


1117 


gi5802817 


Homo sapiens 


endogenous retrovirus HERV- 
K104 long terminal repeat, 
complete sequence; and Gag 
protein (gag) and envelope protein 
(env) genes, complete cds. 


479 


77 
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Hit ID 
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Description 
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% Identity 


1117 


gi 1469243 


Human 
endogenous 
retrovirus K 


pol/env 


4oo 


in 


1117 


gi3150438 


Human 
endogenous 
retrovirus K 


pol-env 


466 


11 


1118 


AAG89341 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 461. 


501 


99 


1118 


gi6651037 


Mus musculus 
domesticus 


similar to RNA binding protein 


411 


96 


1118 


AAG02095 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6176. 


167 


55 


1119 


AAB20155 


TT • 

Homo sapiens 


Secreted protein JSiiCr i . 




j i 


1119 


gi3080663 


Homo sapiens 


PAC clone RP5-1 168D1 1 from 
7p21-p22, complete sequence. 


1408 


47 


1119 


gi2897863 


T T ' 

Homo sapiens 


BAC clone GS1-164B5 from 
7p21-p22, complete sequence. 


134U 




1120 


gi32329 


Homo sapiens 


Human HMG-17 gene for non- 
histone chromosomal protein 
HMG-17. 


429 


94 


1120 


gi306864 


Homo sapiens 


Human non-histone chromosomal 
protein HMG-17 mRNA, complete 
cds. 


429 


94 


1120 


AAB28199 


Homo sapiens 


Human HMG-17 non histone 
chromosomal protein. 


429 


94 


1121 


gi 13905022 


Homo sapiens 


Similar to interferon induced 
transmembrane protein 3 (1-8U), 
clone MGQ5225 
IMAGE:2986145, mRNA, 
complete cds. 


444 


69 


1121 


gil4250038 


Homo sapiens 


Similar to interferon induced 
transmembrane protein 3 (1-8U), 
clone MGC: 14565 

Till /TP? A f\1 F A CO „]"*XT A 

IMAGE:4075453, mRNA, 
complete cds. 


436 


68 


1121 


gi23398 


Homo sapiens 


Human 1-8U gene from interferon- 
mauciDle gene iamiiy. 




of 


1122 


gi70 19933 


Homo sapiens 


cDNA FLJ20071 fis, clone 

Ls\JL,V loo /. 


2163 


100 


1122 


AAB36618 


Homo sapiens 


Human FLEXHT-40 protein 
sequence olic^ 1JJ JNU:4U. 


1051 


100 


1122 


AAW88957 


Homo sapiens 


.Polypeptide fragment encoded Dy 
gene 128. 






1123 


AAB60112 


Homo sapiens 


rluman transport protein i rr i -dz. 


1 1~> 


i nn 


1123 


gil 1558029 


Homo sapiens 


ooct gene tor organic cation 
transporter. 




4-0 


H23 


gi9663117 


Homo sapiens 


mKiNA tor organic cation 
transporter. 


DOZ 


4o 


H24 


AAR28120 


Homo sapiens 


NKG2 transmembrane protein-D. 


IZ J 




1 1 O/l 


gizyoUooo 


riomo sapiens 


in ivvjrz,i_v gene, uauiio ana 

joined mRNA and CDS. 


/ £H 




1124 


gi35063 


Homo sapiens 


Human mRNA for NKG2-D gene. 


724 


94 


1125 


gi7767239 


Homo sapiens 


nectin-like protein 2 (NECL2) 
mRNA, complete cds. 


612 


39 


1125 


gi45 19602 


Homo sapiens 


IGSF4 gene, exon 10 and complete 
cds. 


609 


38 


1125 


AAY45092 


Homo sapiens 


Human lymphoid derived dendritic 


609 


38 
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Table 2A 
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SEQ ID 


Hit ID 


Species 


uescription 


a score 


/o lueuuiy 








cell adhesion molecule. 






1126 


gi7020365 


Homo sapiens 


cDNA FLJ20336 fis, clone 

T_TT7 D 1 1 

rlb.r 1 I /zz. 


4316 


99 


1126 


gilG435830 


Homo sapiens 


cDNA FLJ13727 fis, clone 
PLACE3000103. 


3079 


99 


1126 


AAB94738 


TT * 

Homo sapiens 


tt _ _ , _ • nr?A ta 

Human protein sequence SEQ ID 
N(J:1j //o. 


"5 ATA 

3079 


yy 


1127 


AAB75594 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 3 / bbQ ID 
NO:148. 


678 


99 


1127 


AAB75542 


Homo sapiens 


Human secreted protein sequence 
encoaea by gene J / iu 
NO:96. 




1 Afl 


1 1 at 
1127 


gil 56401 1 


Homo sapiens 


mKJNA tor oxir^o-i, complete cas. 


ZD 1 


*r ^ 


1128 


gi7020372 


Homo sapiens 


cDNA FLJ20340 fis, clone 

JtxfcSr IjLd /4. 


1692 


99 


1 1 oo 

1128 




rro c n 1 or otnr ix 
hollandica 


oytivi 




D I 


1128 


gi324932 


Influenza A virus 


PA polymerase 


67 


38 


1129 


gi7023403 


Homo sapiens 


cDNA FLJ11006 fis, clone 

T>T A AD1AA1A/IC 


499 


59 


1129 


AAB93412 


Homo sapiens 


Human protein sequence SEQ ID 
jNU:12olo. 


499 


59 


1129 


gi 135429 19 


Mus mus cuius 


Similar to mucolipin 1 


432 


61 


1130 


gil 5488920 


Homo sapiens 


r*i • *1 ±. t» tt>"t - jxt r\\T A 

Similar to RIKEN cDNA 
2010107G23 gene, clone 
MOU:y5yo lMACjxi:3oyoojo, 
mRNA, complete cds. 


1 AT 

107 


/to 


1130 


AAW74777 


T T _ • „ 

Homo sapiens 


Human secreted protein encoded 
by gene 48 clone HFCAI74. 




/in 


1130 


gi 13 04441 


Pseudorabies 
virus 


Rsp4U 






1131 


gilOl 19918 


Homo sapiens 


brain otoferlin short isoform 
(OlOr) mRNA, complete cas. 


1315 


49 


1131 


gilOl 19916 


Homo sapiens 


brain otoferlin long isoform 
(OTOF) mRNA, complete cas. 


1315 


49 


1131 


gi4588470 


Homo sapiens 


otoferlin (OTOF) mRNA, 
complete cds. 


2214 


43 


1132 


gil 006665 


Homo sapiens 


H. sapiens mRNA for transcript 
associated with monocyte to 
macrophage differentiation. 


442 


98 


1132 


gllDl5Do9o 


Agrobacterium 
tumefaciens 


AUR_C_ 1 o D 3p 




j i 


1132 


w< 1 XTAAQ OCA 

gllDOzioDO 


Clostridium 
acetobutylicum 


rredictea membrane protein, 
hemolysin III homolog 


1 1 n 


AA 


H33 


AA07 1 803 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1484. 




yz 


1133 


A A /""»T1 OAC 

AA(jr71o05 


T T • „ 

Homo sapiens 


Human olfactory receptor 

nolvnentide SPO TH NO- I486 




yz 


1133 


AAG71807 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1488. 


1178 


70 


1134 


AAY70455 


Homo sapiens 


Human membrane channel protein- 
5 (MECHP-5). 


609 


91 


1134 


AAV83992_ 
aal 


Homo sapiens 


Nucleic acid encoding a protein 
with water channel activity. 


608 


92 


1134 


gi23 17274 


Homo sapiens 


mRNA for aquaporin adipose, 


608 


92 
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SEQ ID 


WW* A TW 

Hit ID 


Species 


Description 


S score 


1 % Identity 








complete cds. 






1135 


gi33 19326 


Homo sapiens 


protein associated with Myc 
mRNA, complete cds. 


111 


33 


1136 


gi2463632 


Homo sapiens 


monocarboxylate transporter 
homologue MCT6 mRNA, 
complete cds. 


2574 


97 


1136 


gil0880482 


Mus musculus 


monocarboxylate transporter 4 


393 


39 


1136 


gi2463634 


Homo sapiens 


monocarboxylate transporter 
(MCT3) mRNA, complete cds. 


394 


40 


1137 


gil 3528675 


Homo sapiens 


ATPase, H+ transporting, 
lysosomal (vacuolar proton pump) 
16kD, clone MGC:3723 
IMAGE:36 18755, mRNA, 
complete cds. 


705 


94 


1137 


gil 3938484 


Homo sapiens 


ATPase, H4- transporting, 
lysosomal (vacuolar proton pump) 
16kD, clone MGC: 16271 
IMAGE:3831016, mRNA, 
complete cds. 


705 


94 


1137 


gil4043553 


Homo sapiens 


ATPase, H+ transporting, 
lysosomal (vacuolar proton pump) 
16kD, clone MGC: 12873 
IMAGE:4127653, mRNA, 
complete cds. 


705 


94 


1138 


gil 5 0803 14 


Homo sapiens 


Similar to RIKEN cDNA 
0610010D20 gene, clone 
MGQ20590 IMAGE :43 10241, 
mRNA, complete cds. 


514 


100 


1138 


gi 105 80053 


Halobacterium 
sp. NRC-1 


dihydrodipicolinate synthase; 
DapA 


379 


33 


1138 


gil 5 90977 


Methanococcus 
jannaschii 


dihydrodipicolinate synthase 
(dapA) 


336 


29 


1139 


AAE06614 


Homo sapiens 


Human protein having 
hydrophobic domain, HP03974. 


1394 j 


100 


1139 


gi520469 


Oryctolagus 
cumculus 


597 aa protein related to 
Na/glucose cotransporters 


1231 


85 


1139 


gi338055 


Homo sapiens 


Human Na+/glucose cotransporter 
1 mRNA, complete cds. 


705 


57 


1140 


gi6708478 


Mus musculus 


fbrmin-like protein 


1571 


66 


1140 


gi4101720 


Mus musculus 


lymphocyte specific formin related 
protein 


1543 


65 


1140 


gil914849 


Mus musculus 


WW domain binding protein 3; 
WBP3 


299 


54 


1142 


gil2052738 


Homo sapiens 


mRNA; cDNA DKFZp564H1322 
(from clone DKFZp564H1322); 
complete cds. 


1755 


96 


1142 


gil 0434632 


Homo sapiens 


cDNA FLJ12886 fis, clone 
NT2RP2004041, weakly similar to 
SYNAPSINS I A AND IB. 


1755 


96 


11/10 




riomo sapiens 


Human protein sequence SEQ ID 
NO:14883. 


1755 


96 


1143 


AAW54370 


Homo sapiens 


G-protein coupled receptor 
HLTEX11. 


1815 


100 


1143 


AAB64854 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 36 SEQ ID 
NO: 140. 


1792 


100 


1143 


AAW70504 


Homo sapiens 


Leukocyte seven times membrane- 


821 1 


46 1 
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Hit ID 


Species 


Description 


S score 


% Identity 








penetrating type receptor protein 
JEG18. 






1144 


gil5278128 


Mus musculus 


chemokine-like factor 2 variant 2 


412 


49 


1144 


AAB51648 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 29 SEQ ID 
NO:88. 


410 


100 


1144 


AAE03929 


Homo sapiens 


Human gene 32 encoded secreted 
protein HTLIF12, SEQ ID NO: 92. 


410 


100 


1146 


gil3477335 


Homo sapiens 


vitamin A responsive; 

1 1 i 1 i 1 1 

cytoskeleton related, clone 
MGC:1917 IMAGE:35 10436, 
mRNA, complete cds. 


777 


95 


1146 


gi3746652 


Homo sapiens 


JWA protein mRNA, complete 
cds. 


111 


95 


1146 


gi6563260 


Homo sapiens 


jmx protein mRNA, complete cds. 


111 


95 


1147 


gi2970431 


Florometra 
serratissima 


"V T A Y^X X 1 1 1 1 * t A 

NADH dehydrogenase subunit 4 


94 


31 


1147 


gil 5042530 


Chilo iridescent 
virus 


A ^ ~W~ 

450L 


70 


24 


1147 


AAY87197 


Homo sapiens 


T X it » * 

Human secreted protein sequence 
SEQ ID NO:236. 


90 


27 


1148 


AAB93562 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12957. 


2402 


100 


1148 


gi7023538 


X X * 

Homo sapiens 


cDNA FLJ11091 fis, clone 
PLACE1005313. 


860 


100 


1148 


AAB93489 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12790. 


C\ S~ aT\ 

860 


100 


1150 


gil 043 8431 


Homo sapiens 


cDNA: FLJ221 55 fis, clone 
HRC00205. 


1995 


100 


1150 


gil 0437336 


Homo sapiens 


cDNA: FLJ21267 fis, clone 
COLO 17 17. 


1776 


99 


1150 


gi7020065 


Homo sapiens 


cDNA FLJ20152 fis, clone 
COL08515. 


705 


100 


1151 


gil 2654 159 


Homo sapiens 


interferon induced transmembrane 
protein 1 (9-27), clone MGC:5195 
IMAGE:3464598, mRNA, 
complete cds. 


569 


93 


1151 


gil 177476 


Homo sapiens 


H.sapiens mRNA for mterferon- 
induced 17kDa membrane protein. 


569 


93 


1151 


gil77802 


Homo sapiens 


XX « . • f •11 

Human interferon-inducible 
protein 9-27 mRNA, complete cds. 


563 


92 


1152 


AAG72230 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1911. 


1615 


100 


1152 


AAG72382 


XT * 

Homo sapiens 


XX J /™v X"% "1 # 1 1 * • 1 

Human OR-hke polypeptide query 
sequence, SEQ ID NO: 2063. 


1615 


100 


1152 


gil5293613 


Homo sapiens 


clone OR5C1 olfactory receptor 
gene, partial cds. 


1097 


100 


1153 


gi784997 


Homo sapiens 


H.sapiens mRNA for tumour 
suppressor protein, HUGL. 


5025 


95 


1153 


gil 944491 


Homo sapiens 


Human LLGL mRNA, complete 
cds. 


4797 


91 


1153 


gi854124 


Homo sapiens 


H.sapiens mRNA for human giant 
larvae homolog. 


2837 


58 


1154 


AAB95830 


Homo sapiens 


Human protein sequence SEQ ID 
NO:18850. 


219 


72 


1154 


gi7959889 


Homo sapiens 


PR02221 


137 


49 
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% Identity 


1154 


gi2072969 


Homo sapiens 


Human LI element LI. 24 p40 
gene, complete cds. 


133 


/I o 

48 


1155 


gil 5277644 


T T — 

Homo sapiens 


amino acid transporter (bLO7Al0) 
gene, exon 1 1 and complete cds. 


24o / 


1 (\C\ 

1UU 


1155 


gi9309293 


Homo sapiens 


hasc-1 mRNA for asc-type amino 
acid transporter 1, complete cds. 


2487 


100 


1155 


gi7415938 


Mus musculus 


ascl 


2329 


91 


1156 


gi6760373 


Homo sapiens 


ODZ3 (ODZ3) mRNA, partial cds. 


2323 


100 


1156 


gi4760780 


Mus musculus 


Ten-m3 


2248 


96 


1156 


gi60 10049 


Gallus gallus 


teneurin-2 protein 


878 


62 


1157 


gil4286298 


Homo sapiens 


clone MGC:3593 
IMAGE:2963628 5 mRNA, 
complete cds. 


630 


99 


1157 


gi4877285 


Homo sapiens 


mRNA for prenylated Rab 
acceptor 1. 


630 


99 


1157 


gi6563192 


Homo sapiens 


prenylated rab acceptor 1 mRNA, 
complete cds. 


630 


99 


1158 


gil 780976 


Human 
endogenous 
retrovirus K 


protease 


915 


58 


1158 


gi5802824 


Homo sapiens 


endogenous retrovirus HERV- 
K109, complete sequence. 


909 


59 


1158 


gi9558703 


Homo sapiens 


tandemly repeated human 
endogenous retrovirus HERV-K 
(HML-2.HOM), complete 
sequence. 


905 


59 


1159 


gil3111941 


Homo sapiens 


vesicle-associated soluble NSF 
attachment protein receptor (v- 
SNARE; homolog of S. cerevisiae 
VTI1), clone MGC:3767 
IMAGE: 295 8320, mRNA, 
complete cds. 


804 


91 


1159 


gi3861488 


Homo sapiens 


vesicle soluble NSF attachment 
protein receptor VTI2 mRNA, 
complete cds. 


804 


91 


1159 


AAY73339 


Homo sapiens 


HTRM clone 2056042 protein 
sequence. 


804 


91 


1160 


gil 922891 


Mus musculus 


alpha 3B chain of laminm-5 


10355 


75 


1160 


gi57775Sl 


Homo sapiens 


H. sapiens mRNA for laminin-5, 
alpha3b chain. 


9398 


99 


1160 


gi551597 


Homo sapiens 


laminin-related protein (LamA3) 
mRNA, complete cds. 


8690 


100 


1161 


AAD05056__ 
aal 


Homo sapiens 


HUMA- Human secreted protein- 
encoding gene 4 cDNA clone 
HKAAV61, SEQ ID NO:14. 


1524 


83 


1161 


AAE01167 


Homo sapiens 


HUMA- Human gene 4 encoded 
secreted protein HKAAV61, SEQ 
ID NO:68. 


1523 


82 


1161 


AAG67516 


1**1* * 

Homo sapiens 


SMIK Amino acid sequence of a 
human secreted polypeptide. 


1523 


82 


1162 


AAM42034 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 6965. 


900 


99 


1162 


AAM40248 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 3393. 


821 


100 


1162 


gi491284 


synthetic 
construct 


IFN-pseudo-omega 2 


800 


98 


1163 


AAI70234 


Homo sapiens 


AMGE- Human interleukin-1 


819 


100 
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aal 




receptor antagonist related protein 
cDNA. 






1163 


AAD11148_ 
aal 


Homo sapiens 


IMMV Human FIL- 1 theta full- 
length DNA. 


819 


100 


1163 


AAA89175_ 
aal 


Homo sapiens 


HYSE- Human interleukin- 1 Hy2 
cDNA. 


819 


100 


1164 


gi340215 


Homo sapiens 


Human vascular endothelial 
growth factor gene, exon 8. 


1056 


97 


1164 


gi340301 


Homo sapiens 


Human vascular permeability 
factor mRNA, complete cds. 


1056 


97 


1164 


AAR91077 


Homo sapiens 


PRIZ- Human vascular endothelial 
growth factor-189, VEGF-189. 


1056 


97 


1166 


gil321816 


Gorilla gorilla 


interleukin- 8 receptor type B 


602 


90 


1166 


gi 1109691 


Homo sapiens 


Human interleukin- 8 receptor type 
B (IL8RB) mRNA, complete cds. 


599 


88 


1166 


gil86378 


Homo sapiens 


Human interleuken 8 receptor B 
mRNA, complete cds. 


599 


88 


1167 


gi 1160967 


Homo sapiens 


palmitoyl-protein thioesterase 
gene, complete cds. 


1285 


100 


1167 


gil3 14355 


Homo sapiens 


Human palmitoyl protein 
thioesterase mRNA, complete cds. 


1285 


100 


1167 


gil4250054 


Homo sapiens 


palmitoyl-protein thioesterase 1 
(ceroid-lipofuscinosis, neuronal 1, 
infantile), clone MGC: 14590 
IMAGE:4249991, mRNA, 
complete cds. 


1285 


100 


1168 


gil77814 


Homo sapiens 


Human alpha- 1-antitrypsin-r elated 
protein gene, exons 3, 4 and 5. 


1956 


90 


1168 


AAP50132 


Homo sapiens 


ZYMO- Sequence of the 
predominant form of human alpha- 
1 -antitrypsin(AT) . 


1009 


72 


1168 


gil5990507 


Homo sapiens 


Similar to serine (or cysteine) 
proteinase inhibitor, clade A 
(alpha- 1 antiproteinase, 
antitrypsin), member 1, clone 
MGC:23330 IMAGE:4644658, 
mRNA, complete cds. 


1450 


69 


1169 


AA012931 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 26823. 


346 


100 


1169 


AAO02697 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 16589. 


143 


66 


1169 


AAO0S307 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 22199. 


137 


80 


1170 


AAR15222 


Homo sapiens 


TEXA Chronic myelogenous 
leukaemia-derived myeloid-related 
protein. 


635 


100 


1170 


gil81527 


Homo sapiens 


Human neutrophil peptide 
(defensin) 1 mRNA, complete cds. 


493 


100 


1170 


gil81529 


Homo sapiens 


Human defensin 1 protein mRNA, 
complete cds. 


493 


100 


1171 


gi 100 1697 


Synechocystis 
sp. PCC 6803 


sensory transduction histidine 
Idnase 


67 


37 


1171 


gi2353225 


Acromyrmex 
octospinosus 


cytochrome oxidase I 


54 


40 


1171 


AAG02950 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 7031. 


54 


32 


1172 


gi4884282 


Homo sapiens 


mRNA; cDNA DKFZp566K0524 


1998 


99 



I 
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(from clone DKFZp566K0524); 
partial cds. 






1172 


gi2665458 


Mus musculus 


protein-tyrosine-phosphatase 


1363 


63 


1172 


gi452194 


Homo sapiens 


Human mRNA for protein tyrosine 
phosphatase (PTP-BAS, type 3), 
complete cds. 


618 

- 


48 


1173 


AAY48226 


Homo sapiens 


MET A- Human prostate cancer- 
associated protein 12. 


956 


96 


1173 


AAM25683 


Homo sapiens 


HYSE- Human protein sequence 
SEQ ID NO: 1198. 


956 


96 


1173 


AAY99342 


Homo sapiens 


GETH Human PRO 1018 
(UNQ501) amino acid sequence 
SEQ ID NO: 8. 


950 


94 


1174 


gil78018 


Homo sapiens 


Human activation (Act-2) mRNA, 
complete cds. 


125 


84 


1174 


gi34218 


Homo sapiens 


Human LAG-1 mRNA. 


125 


84 


1174 


gi533213 


Homo sapiens 


secreted T cell protein (H400; SIS- 
gamma) mRNA, complete cds. 


125 


84 


1175 


gil78018 


Homo sapiens 


Human activation (Act-2) mRNA, 
complete cds. 


123 


92 


1175 


gi34218 


Homo sapiens 


Human LAG-1 mRNA. 


123 


92 


1175 


gi533213 


Homo sapiens 


secreted T cell protein (H400; SIS- 
gamma) mRNA, complete cds. 


123 


92 


1176 


AAG03315 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 7396. 


314 


100 


1176 


gi965069 


Serratia 
marcescens 


phage-holin analog protein 


71 


40 


1176 


gil6415877 


Octopus salutii 


cytochrome oxidase subunit III 


74 


28 


1177 


gi!78836 


Homo sapiens 


APOC2 gene, complete sequence; 
and apolipoprotein C-II (APOC2) 
gene, complete cds. 


453 


89 


1177 


gi296636 


Homo sapiens 


Human apoC-II gene for 
preproapolipoprotein C-II. 


453 


89 


1177 


gi757915 


Homo sapiens 


Human mRNA for lipoprotein 
apoCII. 


453 


89 


1178 


AAO07986 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 21878. 


77 


34 


1179 


AAB60502 


Homo sapiens 


INCY- Human cell cycle and 
proliferation protein CCYPR-50, 
SEQ ID NO:50. 


1205 


100 


1179 


AAB12144 


Homo sapiens 


PROT- Hydrophobic domain 
protein isolated from WERI-RB 
cells. 


1205 


100 


1179 


AAG81333 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO: 184. 


687 


99 


1180 


AAW67880 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 74 clone 
HNTAC73. 


378 


97 


1180 


gi9949887 


Pseudomonas 
aeruginosa 


probable transcriptional regulator 


65 


41 


1180 


gi3 130050 


Schizosaccharom 
yces pombe 


haloacid dehalogenase-like 
hydrolase 


62 


32 


1181 


AAG01183 


Homo sapiens j 


GEST Human secreted protein, 
SEQ ID NO: 5264. 


278 


94 


1181 


AAO00621 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 14513. 


84 


47 
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1 1 81 


A A AH^ lOO 

AAOUzlaa 


Homo sapiens 


HYbii- Human polypeptide oisi^ 
ID NO 16080. 


HQ. 
/o 


Aft 


1182 


ABB 12063 


Homo sapiens 


HYSE- Human secreted protein 
homologue, SEQ ID NO:2433. 


326 


100 


1182 


AAE06730 


TT — • 

Homo sapiens 


SM1K Human CASH /oD protein. 


J.UU 


1 nn 


1182 


AAW75098 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 4/ clone 
HSXBI25. 


143 


81 


1183 


gi 13278924 


Homo sapiens 


neural proliferation, differentiation 
ana control, 1, clone M(jrC43>y / 
IMAGE:3347743, mRNA, 
complete cas. 


748 


98 


1183 


gi85 15886 


Homo sapiens 


NPDCl-like protein (NPDC1) 
mRNA, complete cds. 


748 


98 


1183 


AAB43904 


Homo sapiens 


HUMA- Human cancer associated 
protein sequence oisq iu 
NO: 1349. 


748 


98 


1 1 O A 

1184 


gil3128925 


Homo sapiens 


UL-orZ protein mKJNA, complete 
cds. 




yv 


1 1 O A 

1184 


gu453U663 


Homo sapiens 


mKJNA tor ALL^AiN-aipna, 
complete cds. 




OA 

y\j 


1184 


AAY15238 


Homo sapiens 


IMMV ULBP-2 amino acid 
sequence. 


1025 


90 


1185 


• a 1 £1 A 1 1 A 

gi4164134 


. 

Homo sapiens 


cosmia clone u loyuz rrom 
Xp22. 1-22.2, complete sequence. 


/o 




1185 


AAU22866 


Homo sapiens 


HUMA- Human prostate cancer 
antigen, oeq 1JJ No 3oD. 


60 


35 


1185 


AAM96178 


Homo sapiens 


HUMA- Human reproductive 
system leiatea antigen oxiv^ iu 
NO: 4836. 


60 


35 


1 loo 


gl/ H\)ZZ5 


Homo sapiens 


rKUi / 14 






1186 


ABB15615 


Homo sapiens 


HUMA- Human nervous system 
related polypeptide SEQ ID NO 
4Z/Z. 


63 


48 


1186 


AAO07531 


Homo sapiens 


HYSE- Human polypeptide SEQ 

rr\ \ja ti Ann 


67 


46 


1187 


AAE05962 


Homo sapiens < 


HYSE- Human phospholipase- 
related protein. 


2521 


99 


1187 


gi3811347 


Homo sapiens 


cytosolic phospholipase A2 beta 
(crl^Az oetaj mKJNA, complete 

COS. 


1209 


44 


1187 


gi4886978 


Homo sapiens 


cytosolic phospholipase A2 beta 
mRNA, complete cds. 


1209 


44 


1188 


A A AA1 ni O 

AAO01938 


Homo sapiens 


rli oxi- Human polypeptide biii^ 
ID NO 15830. 


yj 


DO 


1 1 o o 

1188 


gi2992470 


JVlus sp. 


mitocnonariai capsule 
selenoprotein; MCS 


yj 


D 1 


t 1 o o 

1 1 88 


gil4717800 


Mus musculus 


seleno-protein 




1 1 


1189 


gil 86600 


Homo sapiens 


Human inter-alpha-trypsin 

liiniDiior ngnx cruxm ^111^ gciic, 
exons 7-10. 


1461 


93 


1189 


gi24479 


Homo sapiens 


Human mRNA for alpha- 1- 
microglobulin and HI-30. 


1461 


93 


1189 


gi32047 


Homo sapiens 


Human mRNA for protein HC 
(alpha- 1 -microgl obulin) . 


1461 


93 


1190 


gi673422 


Homo sapiens 


H.sapiens mRNA fragment for T- 
cell receptor alpha chain. 


1322 


93 
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1190 


AAU69943 


Homo sapiens 


CORI- Human T cell receptor Va 
chain of clone 4E5 for prostate 
protein P501S, 


1097 


77 


1190 


AAM01298 


Homo sapiens 


CORI- P501S-specific T cell clone 
4E5 Va chain T cell receptor 
amino acid. 


1097 


77 


1191 


gi673422 


Homo sapiens 


H.sapiens mRNA fragment for T- 
cell receptor alpha chain. 


859 


95 


1191 


gi623119 


Macaca mulatta 


T-cell receptor alpha 


605 


86 


1191 


AAU69943 


Homo sapiens 


CORI- Human T cell receptor Va 
chain of clone 4E5 for prostate 
protein P501S. 


594 


65 


1192 


gil3097588 


Homo sapiens 


clone MGC: 10745 
IMAGE:2820343, mRNA, 
complete cds. 


201 


100 


1192 


gi53861 


Mus musculus 


Q300 protein (AA 1-77) 


72 


38 


1192 


AAO02105 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 15997. 


50 


56 


1193 


AAB08894 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 4 SEQ 
ID NO:51. 


208 


57 


1193 


gil5088679 


Homo sapiens 


cysteine and tyrosine-rich protein 1 
(CYYR1) mRNA, complete cds. 


59 


28 


1193 


AAY87233 


Homo sapiens 


INCY- Human signal peptide 
containing protein HSPP-10 SEQ 
ID NO: 10. 


59 


28 


1194 


AAG03963 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 8044. 


417 


80 

• 


1194 


ABB 10168 


Homo sapiens 


HUMA- Human cDNA SEQ ID 
NO: 476. 


289 


100 


1194 


ABB10412 


Homo sapiens 


HUMA- Human cDNA SEQ ID 
NO: 720. 


289 


100 


1195 


gi9758052 


Arabidopsis 
thaliana 




64 


46 


1195 


gi6815537 


Human 

immunodeficienc 
y virus type 1 


gag protein 


47 


62 


1195 


gil4269033 


Sus scrofa 


interferon beta- 1 


42 


47 


1196 


gi7582276 


Homo sapiens 


BM-002 


328 


100 


1196 


AAM78626 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1288. 


328 


100 


1196 


AAM79610 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3256. 


328 


100 


1197 


gill81885 


Mus musculus 


patched 


209 


62 


1197 


AAV21590_ 
aal 


Homo sapiens 


STRD Human patched (ptc) 
protein encoding cDNA. 


215 


63 


1197 


gil335864 


Homo sapiens 


Human patched homolog (PTC) 
mRNA, complete cds. 


214 


63 


1198 


AAB25674 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 10 SEQ 
ID NO:63. 


646 


84 


1198 


gil4603247 


Homo sapiens 


Similar to RIKEN cDNA 
5730409G15 gene, clone 
MGC: 19636 IMAGE:2822323, 
mRNA, complete cds. 


420 


94 


1198 


AAB36613 


Homo sapiens 


INCY- Human FLEXHT-35 
protein sequence SEQ IDNO:35. 


420 


94 
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1200 


AAU12292 


Homo sapiens 


GETH Human PRO6027 
poiypeptioe sequence. 


990 


98 


1200 


AAU27673 


Homo sapiens 


ZYMO Human protein 
AFP235412. 


987 


99 


1200 


gil347712l 


Homo sapiens 


clone IMA CxE: 3636082, mKJNA, 
partial cds. 


on i 

zy i 


y!> 


1201 


AAB43928 


Homo sapiens 


HUMA- Human cancer associatea 
protein sequence SEQ ID 
NO:1373. 


ZlO 


jo 


1201 


gil3325337 


Homo sapiens 


clone MCjC:1(J520 
IMAGE:3938462, mRNA, 
complete cds. 


ziy 


A Q 


1201 


AAB21040 


Homo sapiens 


INCY- Human nucleic acid- 
binding protein, NuABP-44. 


ziy 




1202 


AAB43928 


Homo sapiens 


HUMA- Human cancer associated 
protein sequence SEQ ID 
NO:l373. 


223 


55 


1202 


gil 6550327 


Homo sapiens 


cDNA rLJ3IU(Jj lis, clone 
HLUNG2000068, weakly similar 
to ZINC FINGER PROTEIN 157. 


ZID 


An 
o / 


1202 


gi 16552980 


Homo sapiens 


CJJJNA rLJJZ/oo lis, Clone 

i i rzvu i o / y, weaKiy similar to 

z»ii> r 11N vjrjQiv rss.\J i niiN ij / , 


Z1D 


O/ 


1ZU3 


gi4Jzzyjo 


riomo sapiens 


xix'JS7vji^iv-iiKe Kinase rnjrjv 


1 90 


O J 


1203 


gi4262362 


Caenorhabditis 
elegans 


alternatively spliced 
serine/threonine protein kinase 

A/TTrj 1 ^ 
IVIIO- 1 D 


119 


81 


1203 


AAB50058 


Homo sapiens 


IMMV SS-4694. 


117 


81 


1204 


gil 754714 


Xenopus laevis 


oviductin 


378 


40 


1 OA/1 

1204 


gil5277254 


Buio japomcus 


oviductin 


JJ 1 


oo 


1204 


gi2981641 


Xenopus laevis 


polyprotein 


339 


46 


1205 


gil5214578 


Homo sapiens 


clone JVKjC:9135 
IMAGE:3865141, mRNA, 
complete cds. 


/i ^ 1 


/o 


1205 


AAW67901 


Homo sapiens 


HUMA- Human secreted protein 
encoaea oy gene jo cione 
HODCL36. 


451 


76 


1205 


gi 1946205 


Homo sapiens 


H. sapiens mKJNA tor 
protein. 




13 


1206 


AAG01971 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6052. 


314 


100 


1 O A^ 

1206 


gi420U34U 


.Lycopersicon 
esculentum 


royD protein 






1206 


gi3 183989 


Lycopersicon 
esculentum 


P69E protein 


82 


43 


1207 


gil4043211 


Homo sapiens 


Similar to RIKEN cDNA 
493 1428F04 gene, cione 
IMAGE:3346497, mRNA, partial 
cas. 


878 


83 


1207 


gi97 11595 


Xanthomonas 
oryzae pv. 
oryzae 


HpaA 


71 


24 


1207 


AAO 10768 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 24660. 


72 


34 


1208 


AAY91512 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 62 SEQ 


606 


100 
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ID NO: 185. 






1208 


AAY91653 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 62 SEQ 
ID NO:326. 


606 


100 


1208 


AAY71277 


Homo sapiens 


ZYMO Human Zlipo3 protein. 


606 


100 


1209 


AAY27648 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene No. 82. 


322 


98 


1209 


gi7959897 


Homo sapiens 


PR02379 


72 


39 


1209 


AAO03791 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 17683. 


61 


68 


1210 


gi 1743 1247 


Ralstonia 
solanacearum 


HYPOTHETICAL PROTEIN 


70 


38 


1211 


AAB08765 


Homo sapiens 


INCY- A human leukocyte and 
blood related protein (LBAP). 


339 


62 


1211 


AAB74718 


Homo sapiens 


INCY- Human membrane 
associated protein MEMAP-24. 


314 


66 


1211 


gi25 87024 


Homo sapiens 


mRNA containing human 
endogenous retrovirus H and 
human endogenous retrovirus E 
sequences. 


259 


67 


1212 


gil0440139 


Homo sapiens 


cDNA: FLJ23447 fis, clone 
HSI03346. 


1339 


95 


1212 


AAY13396 


Homo sapiens 


GETH Amino acid sequence of 
protein PR0332. 


872 


48 


1212 


AAB33425 


Homo sapiens 


GETH Human PR0332 protein 
UNQ293 SEQ ID NO:57. 


872 


48 


1213 


AAG66547 


Homo sapiens 


HYSE- Human secreted 

metallocarboxypeptidase-like 

polypeptide. 


1551 


99 


1213 


AAG66565 


Homo sapiens 


HYSE- Human secreted 
metall ocarboxyp epti dase-like 
variant polypeptide. 


1548 


98 


1213 


AAB74682 


Homo sapiens 


INCY- Human protease and 
protease inhibitor PPIM-15. 


1482 


98 


1214 


gil5528833 


Homo sapiens 


Fc receptor-like protein 2 (FCRH2) 
mRNA, complete cds. 


528 


100 


1214 


AAB 85464 


Homo sapiens 


HYSE- Human immunoglobulin 
domain-containing polypeptide. 


528 


100 


1214 


AAB82317 


Homo sapiens 


UYCO Human immunoglobulin 
receptor IRTA4 protein. 


528 


100 


1215 


AAU27663 


Homo sapiens 


ZYMO Human protein 
AFP285042. 


555 


100 


1215 


AAE06599 


Homo sapiens 


SAGA Human protein having 
hydrophobic domain, HP 10782. 


510 


100 


1215 


gil5558917 


Magnaporthe 
grisea 


similarity to Lec35 protein 


169 


30 


1216 


gil0439008 


Homo sapiens 


cDNA: FLJ22573 fis, clone 
HSI02387. 


682 


99 


1216 


AAM87876 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 15469. 


169 


70 


1216 


gi 161 6747 


Tupaia glis 
belangeri 


GnRH preprohormone second 
form 


68 


33 


1217 


gi 10439008 


Homo sapiens 


cDNA: FLJ22573 fis, clone 
HSI02387. 


529 


92 


1217 


AAM87876 


Homo sapiens 


HUMA- Human 


109 


71 
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immune/haematopoietic antigen 
SEQ ID NO: 15469. 






1217 


AAM87620 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO:15213. 


70 


35 


1 O 1 o 
1215 


AAM6095 1 


T T ' 

Homo sapiens 


» 1AT T? TT 1 . * J 

MOLE- Human brain expressed 
single exon probe encoded protein 
isbQ ID NO: 33U30. 


58 


36 


1218 


AAM73644 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 33950. 


58 


36 


inig 


a a nnn i nn 


Homo sapiens 


HYbE- Human polypeptide SbQ 
ID NO 14001. 


58 


45 




giloy5053 / 


T T * 

Hop mosaic 

■ 

vims 


> ■ * -I 1 • _ • J 1*1* 

cysteine-rich nucleic acid binding 
protein 


41 


47 


1219 


AAY19474 


Homo sapiens 


HUMA- Amino acid sequence of a 
numan secreted protein. 


43 


43 


1219 


AAO03071 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 16963. 


45 


31 


1ZZU 


gll 1 /lDoy 


rlasmoaium 
laiciparum 


frameshift 


76 


37 


1220 


gi4512010 


Escherichia coli 


OrfY 


66 


50 


1220 


gi 1870470 


Mus musculus 


anti-DNA immunoglobulin light 
chain IgG 


46 


37 


1221 


gil3274524 


Homo sapiens 


complement-clq tumor necrosis 
factor-related protem (CTRP7) 
mRNA, complete cds. 


1451 


94 


1221 


AAB50371 


Homo sapiens 


ZYMO Human ZACRP7. 


1451 


94 


1221 


gll 32745 18 


Homo sapiens 


complement-clq tumor necrosis 
factor-related protein (CTRP2) 
mRNA, complete cds. 


r*_ *t 

831 


61 


1 on 
1222 


A A aaq onn 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 17791. 


67 


50 


1222 


A A P'71 /t _C <_ 

AA 073403 


Homo sapiens 


HUMA- Human gene 12-encoded 
secreted protein fragment, SEQ ID 
JNO:240. 


75 


31 


1222 


AAM85406 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 12999. 


60 


40 


122.5 


giooZ>U24D 


Homo sapiens 


activated p21cac4zHs kinase 
^AOivi j mKiNA, complete cds. 


5605 


1 i\r\ 

100 


1223 


gi2921447 


Mus musculus 


non-receptor protein tyrosine 
Kinase ack 


4238 


79 


IZZJ 


glZU / ojOo 


_t>os taurus 


uac4Z-associatea tyrosine kinase 
ACK-2 


34y3 


90 


1 01/1 
1224 


/\/VJc5 040^0 


riomo sapiens 


ZYMO Ammo acid sequence or a 
human zkunlO polypeptide. 


35 o 


35 


1224 


glZ 1 1 OZZ 


Gallus gallus 


alpha-3 collagen type VI 


t An 

308 


33 




A AM420SQ 


jn.vjij.iu odpieiio 


T-XV^QTh _ T-Tl1TTIE.ri t/l-Q^f i /-la Cpn 

ii i ox./- nun Lcin poiypepnoe _j_c.i^ 
ID NO 7020. 


z/o 




1225 


AAB66065 


Homo sapiens 


MILL- Human TANGO 294. 


2113 


99 


1225 


AAB66067 


Homo sapiens 


MILL- Human TANGO 294 
mature protein. 


2015 


99 


1225 


gi434306 


Homo sapiens 


H.sapiens mRNA for lysosomal 
acid lipase. 


1290 


60 


1226 


AAM06483 


Homo sapiens 


HYSE- Human foetal protein, SEQ 


282 


98 
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ID NO: 214. 






1226 


gi3694984 


Pimpinella 
brachycarpa 


metallothionein~l-like protein 


57 


30 


1226 


A AT T'l r» J 1 f 

AAU22415 


Homo sapiens 


HUMA- Human cardiovascular 
system antigen polypeptide SEQ 
ID JNo 1 ioy. 


56 


T7 
2/ 


1227 


gl 15 029634 


Homo sapiens 


Similar to tetranectin 
(plasminogen-binding protein), 
clone Mut: UDyz 
IMAGE:4042921, mRNA, 
complete cos. 


/ZD 


1 A A 
1UU 


1227 


gi37409 


Homo sapiens 


H. sapiens mRNA for tetranectin. 


725 


100 


1227 


gi825722 


Homo sapiens 


H.sapiens tetranectin gene, exon 1 . 


725 


100 




giD /yuzu / 


1 aema saginata 


A 1 .rase suDumt o 


IK) 




1228 


gi3 927873 


Cyanidioschyzon 
rnerolae 


NADH-ubiquinone oxidoreductase 
chain 3 


44 


19 


1228 


gi43 78776 


Pedmomonas 
minor 


NADH dehydrogenase subunit 3 


4/ 


3U 


1229 


AAE01790 


Homo sapiens 


HUMA- Human gene 21 encoded 
secreted protein HDPTW65, SEQ 
IDNO:lll. 


142 


59 


1229 


A A too 

AAE01838 


Homo sapiens 


HUMA- Human gene 21 encoded 
secreted protein rlJJrl WoD, oHQ 
ID NO:159. 


140 


57 


1 01 a 




rlomo sapiens 


ri i oii- Human reverse 
transcriptase homologue, SEQ ID 

■\T/^» ♦ 1 Q A Q 

JNU. lo4y. 




DO 


1230 


AAE04775 


Homo sapiens 


INCY- Human vesicle trafficking 
protein- l o v Jb 1 Kr- 1 o ) protein. 


327 


100 


1230 


gil 1120502 


Homo sapiens 


ERGL mRNA, complete cds. 


327 


100 


1230 


AAW88699 


Homo sapiens 


HUMA- Secreted protein encoded 
by gene 166 clone HCEQA68. 


TOO 


i aa 
1UU 


1231 


AAG00381 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 4462. 


ZOO 


A 1 


1231 


AAU 1935 / 


Homo sapiens 


PHAA Human G protein-coupled 
receptor nGPCR-2290. 


nc 
1Z3 


<CA 

DU 


1 T3 1 


A A r\f»QT2 C 


rlomo sapiens 


rl Y oii- Human polypeptide ozsq 
ID NO 23130. 


i no 

iuy 


/D 


1251 


A AH ft AiC.CC 0 

AAJVLUoDDc 


rlomo sapiens 


rl Y iSri- Human toetal protein, JsxsQ 
ID NO: 289. 


1 


OQ 


1232 


gi2648055 


Xenopus laevis 


corticotropin releasing factor 
receptor, type I 


48 


29 


1232 


AAU21458 


Homo sapiens 


HUMA- Human novel foetal 
antigen, oriQ lu fsnj l /u/. 


45 


36 


1233 


AAM06562 


Homo sapiens 


HYSE- Human foetal protein, SEQ 
ID JNVJ: zyj. 


383 


100 


1233 


AAG72602 


Homo sapiens 


YEDA Human OR-like 

polypeptide query sequence, oEQ 
ID NO* 9983 


65 


42 


1233 


gi7547265 


Canis familiaris 


5-Hydroxytryptamine ID receptor 
subtype beta; 5-HT1D beta 


67 


39 


1234 


AAM92283 


Homo sapiens 


HUMA- Human digestive system 
antigen SEQ ID NO: 1632. 


76 


36 


1234 


AAO09955 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 23847. 


70 


35 


1234 


gi8778849 


Arabidopsis 


T7N9. 1 


69 


42 
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thaliana 








1235 


AAM63797 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 35902. 


48 


34 


1235 


AAM76610 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 36916. 


48 


34 


1235 


ABB 12222 


Homo sapiens 


HYSE- Human secreted protein 
homologue, SEQ ID NO:2592. 


52 


42 


1236 


gi 160822 


Sarcocystis 
muris 


31-kDa major surface antigen 


55 


37 


1236 


gi5305067 


Mus musculus 


immunoglobulin light chain 
variable region 


61 


34 


1236 


AAM60441 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 32546. 


61 


40 


1237 


gi4929790 


Homo sapiens 


angiopoietin-related protein 3 
(ANGPTL3) mRNA, complete 
cds. 


1489 


98 


1237 


AAY05395 


Homo sapiens 


GETH Human TIE ligand NL6 
protein sequence. 


1489 


98 


1237 


AAB 12135 


Homo sapiens 


PROT- Hydrophobic domain 
protein from clone HP 10622 
isolated from Liver cells. 


1489 


98 


1238 


AAM06568 


Homo sapiens 


HYSE- Human foetal protein, SEQ 
ID NO: 299. 


142 


57 


1238 


gil334599 


Magnaporthe 
grisea 


ubiquinol: cytochrome c 
oxireductase 


48 


42 


1238 


gil 3487283 


Mycosphaerella 
fijiensis 


cytochrome b 


46 


42 


1239 


gil 5930235 


Homo sapiens 


clone IMAGE: 3 846772, mRNA, 
partial cds. 


46 


40 


1239 


gil334235 


Rattus rattus 


MIP protein (261 AA; AA 3 - 263) 


65 


45 


1239 


gil 185419 


Mus musculus 


major intrinsic protein 


65 


45 


1240 


AAM60668 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 32773. 


62 


31 


1240 


AAM73340 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 33646. 


62 


31 


1240 


gi975678 


Albinaria 
caerulea 


start codon=CAT, termination 
codon=TAA 


65 


27 


1241 


AAG03454 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 7535. 


40 


26 


1241 


gil 245208 


Danio rerio 


Zgl3 


57 


47 


1241 


AAY19486 


Homo sapiens 


HUMA- Amino acid sequence of a 
human secreted protein. 


33 


41 


1242 


gi3 157920 


Arabidopsis 
thaliana 


F12F1.6 


46 


37 


1242 


AAY48414 


Homo sapiens 


META- Human prostate cancer- 
associated protein 111. 


36 


46 


1242 


gi927722 


Saccharomyces 
cerevisiae 


Erdlp; YDR414C; CAI: 0.11 


61 


44 


1243 


gi9657469 


Vibrio cholerae 


soxR protein 


39 


46 


1243 


gi3493510 


Mus musculus 


Ymp 


47 


43 


1243 


gi2358254 


Mus musculus 


HNMP-1 


47 


43 
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1244 


AA012129 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2602 1. 


64 


90 


1244 


gi3 874749 


Caenorhabditis 
elegans 


C34E7.3 


50 


56 


1244 


AA012895 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 26787. 


64 


44 


1245 


AAY99386 


Homo sapiens 


GETH Human PRO 1 305 
(UNQ671) ammo acid sequence 
SEQ ID NO:l53. 


71 


39 


1245 


AAO02040 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 15932. 


63 


33 


1245 


gil5073483 


Smorhizobium 
meliloti 


PUTATIVE SENSORY 
TRANSDUCTION HISTIDINE 
KINASE TRANSMEMBRANE 
rKU 1 blJN 


77 


39 


1246 


AAG75420 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID NO:6184. 


49 


40 


1246 


gi4099021 


Helicobacter 
pylori 


amino acid permease 


47 


39 


1246 


gi23 14328 


Helicobacter 
pylori 26695 


glutamine ABC transporter, 
permease protein (glnP) 


47 


39 


1248 


gi2959352 


Brugia pahangi 


cuticle collagen 2(F) 


68 


37 


1248 


AAO03627 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 17519. 


59 


40 


1248 


gil3959819 


Myxococcus 
xanthus 


USC6-lp 


67 


43 


1249 


gi8249629 


Homo sapiens 


partial IGHV gene for 
immunoglobulin heavy chain 
variable region, clone B31. 


62 


44 


1249 


gi6646882 


Paragonimus 
westermani 


"V T A T\TT J 1 1 T * j -| 

NADH dehydrogenase subunit 1 


63 


A f\ 

40 


1249 


AAR39641 


Homo sapiens 


CIBA Transforming Growth 
Factor-betal(44/45)beta2 hybrid. 


44 


37 


1250 


AAU04613 


Homo sapiens 


UNIW Gonadotropin analogue, 
beta subunit. 


40 


58 


1250 


gi3242155 


Drosophila 
melanogaster 


153C9.b 


60 


43 


1250 


AAM63639 


Homo sapiens 


MOLE- Human brain expressed 

i i_ 11 j * 
single exon probe encoded protein 

SEQ ID NO: 35744. 


59 


42 


1251 


AA011677 


T T 

Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 25569. 


74 


C A 

54 ■ 


1251 


gi77 16782 


Drosophila 
simulans 


hehcase pitchoune 


77 


A A 

44 


1251 


gi3342758 


T"\ 1 * 1 

Drosophila 
melanogaster 


hehcase pitchoune 


77 


A A 

44 


1252 


gi482846 


Torgos 
trachehotus 


cytochrome b 


51 


40 


1252 


gl22 / 3 / 


rioraeum 
vulgare 


— ; ; 

beta-hordothionin 


42 




1252 


AAM79945 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3591. 


45 


40 


1253 


gi424891 


Human 

immunodeficienc 
y virus type 1 


envelope glycoprotein 


37 


33 


1253 


gi9654985 


Vibrio cholerae 


glutamate-cysteine ligase 


62 


28 
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1254 


gi3805960 


Populus 
balsamifera 
subsp. 
trichocarpa 


laccase 


AH 
4 / 


7? 


1254 


gi 150740 16 


Smorhizobium 
meliloti 


PUTAllViS IKJUNoCKiJr 11LJ1N 
KbCjU LA 1 UK rKU 1 lillN 


D / 




1254 


gil2652993 


Homo sapiens 


clone IMAGE:3357862, mRNA, 
partial cds. 


56 


47 


1255 


gil655739 


Peromyscus 
maniculatus 


NADH dehydrogenase subunit 4 


44 


24 


1255 


gil6551105 


Crotalus 
adamanteus 


NADH dehydrogenase subunit 5 


66 


28 


1255 


gil6551107 


Crotalus atrox 


NADH denvdrogenase subunit d 






1256 


AAO06799 


Homo sapiens 


HYSE- Human polypeptide 
ID NO 20691. 




^7 


1256 


AAO00659 


Homo sapiens 


HYSE- Human polypeptide hcKl 
ID NO 14Dj 1. 






1256 


AAB51937 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded oy gene y 

ID JNOloy, 


37 


60 


1257 


gi6449037 


Mus musculus 


platelet glycoprotein V 


738 


38 


1257 


gi2104856 


Rattus 

• 

norvegicus 


platelet glycoprotein v 


/ jJ 




1257 


gi2 104845 


Mus musculus 


platelet glycoprotein V 


722 


37 


1258 


AAO 11326 


Homo sapiens 


til at,- Human polypeptide on^ 
ID NO 25218. 


U 1 


40 

-r\j 


1258 


gi7576909 


Damo reno 


her7-protein 




37 


1258 


AAG81428 


Homo sapiens 


ZYMO Human AFP protein 
sequence ISEQ ID IN u. J /4. 


47 


38 


1259 


gi 18349 


Daucus carota 


glycine rich protein (AA 1 - 96) 


65 


45 


1259 


gi336034 


Vesicular 
stomatitis virus 


M-protein 


HC\ 




1259 


gi335876 


Vesicular 
stomatitis virus 


matrix (M) protein 


H(\ 
/K) 




1260 


AAO09307 


Homo sapiens 


HYSE- Human polypeptide Sbi^J 
ID NO 23199. 


*f 1 


D 1 


1260 


gil5042581 


Echinococcus 
granulosus 


NADH dehydrogenase subunit z 




» 1 


1261 


gil0436454 


Homo sapiens 


cDNA FLJ14082 fis, clone 
HEMBB 1002300. 


983 


99 


1261 


AAB95686 


Homo sapiens 


HEL1- Human protein sequence 
SEQ ID NO: 18490. 




QQ 


1261 


AAY20668 


Homo sapiens 


UYRO- Human neurofilament-M 
wild type protein tragment iu. 


44 


50 


1263 


gi965014 


Mus musculus 


ADAM 4 protein precursor 


1303 


51 


1263 


gil061159 


Macaca 
fascicularis 


testicular Metalloprotease-like, 
Disintegrin-like, Cysteine-rich 
protein IVa 


iz / / 




1 O /TO 

1263 


glJLUol loi 


jviacaca 
fascicularis 


testicular \/Tetallot)rotease-like 
Disintegrin-like, Cysteine-rich 
protein IVb 


1249 


38 


1264 


AAM79049 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1711. 


1895 


98 


1264 


AAM80033 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3679. 


1895 


98 


1264 


AAM53458 


Homo sapiens 


MOLE- Human brain expressed 


1074 


100 



WO 03/054152 PCT/US02/39555 

Table 2A 
165 



SEQ ID 
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bpecies 


Ucacnpiiuii 


C li'iirp 
O atUI v 


% Tdentitv 








single exon probe encoded protein 
SEO ID NO: 25563. 






1265 


AAB44605 


Homo sapiens 


rlUivlA- riuman secrexea protein 
sequence encoded by gene 10 SEQ 
ID NO:70. 




70 


1265 


AAG71789 


Homo sapiens 


YEDA Human olfactory receptor 
polypeptide, SEQ ID NO: 1470. 


42 


63 


1265 


AAG72517 


Homo sapiens 


YEDA Human OR-like 
polypeptide query sequence, SEQ 
ID NO: 2198. 


42 


63 


1266 


gil4714741 


Homo sapiens 


clusterin (complement lysis 
inhibitor, SP-40,40, sulfated 
glycoprotein 2, testosterone- 
repressed prostate message 2, 
apolipoprotein J), clone 

mRNA, complete cds. 


1629 


99 


1266 


gi292843 


tt_ „ • „ 

Homo sapiens 


Human 1 Ki^ivi-z protein gene, 
exons 7,8,9 and complete cds. 




Q9 


1266 


gi30251 


Homo sapiens 


Human £>Jr-4U,4U miviNA ior 

^Atnnlpmpnt.aocnr'i' ci'j'F'rl TYrrvt"pin SIP— 
COnipiCIIlCIlt-aooU^lcltCU. ^sJLUldil 01 

40,40 alpha-1 and beta-1 chain. 




y y 


1267 


gi 11493504 i 


xiomo sapiens 




1 1 92 


98 


1267 


gi412723 


synthetic 
construct 


synthetic antithrombin III 


1192 


98 


1267 


gi583741 


synthetic 
construct 


Antithrombin III 


1192 


98 


1268 


gil 1493504 


Homo sapiens 


PRO0309 


1439 


98 


1268 


gi4 12723 


synthetic 
construct 


syntnetic antitnromcnn 111 


14^Q 




1268 


gi583741 


synthetic 
construct 


Antithrombin III 


1439 


98 


1269 


gi203710 


Rattus 
norvegicus 


cytochrome c oxidase suDunix v ic 






1269 


gil200057 


Homo sapiens 


Human mRNA for cytochrome c 
oxidase suDunix vie. 


229 


61 


1269 


gi 12652867 


Homo sapiens 


cytochrome c oxidase subunit Vic, 
clone MCjC: iz>z\j 
IMAGE:3350637, mRNA, 
complete cos. 


229 


61 


1270 


AAM96033 


Homo sapiens 


HUM A- Human reproductive 
system reiatea antigen oxsvi iu 
NO: 4691. 


465 


98 


1270 


A ATT1 0001 

AAUloool 


Homo sapiens 


ri\jNiJ\r JNOvei prostaie giana 
antigen, Seq ID No 180. 


tUJ 


98 


1270 


gi962223o 


Homo sapiens 


caonenn-iiKe protein v kzu 
mRNA, partial cds. 


979 


100 


1271 


gi552137 


Drosophila 
melanogaster 


tropomyosin isoform 9E 


118 


27 


1Z / 1 




melanogaster 


trntinrnvncjin isoform OA 


118 


27 


1271 


^158696 


Drosophila 
melanogaster 


tropomyosin isoform 9D 


118 


27 


1272 


gi6689249 


Streptococcus 

dysgalactiae 

subsp. 

dysgalactiae 


M-like protein 


122 


24 
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1272 


gi6692690 


Arabidopsis 
thaliana 


F12K1L14 


114 


28 


1272 


gi4778 


Saccharomyces 
cerevisiae 


Usol protein 


122 


29 


1273 


gil 3097573 


Homo sapiens 


Similar to thiosulfate 
sulfurtransferase (rhodanese), 
clone MGC: 10492 
IMAGE:361 1253, mRNA, 
complete cds. 


1469 


94 


1273 


gil6876913 


Homo sapiens 


mercaptopyruvate 
sulfurtransferase, clone 
MGC:24539 IMAGE:4 105509, 
mRNA, complete cds. 


1469 


94 


1273 


gil75 11726 


Homo sapiens 


mercaptopyruvate 
sulfurtransferase, clone 
MGC:31798 IMAGE:4131927, 
mRNA, complete cds. 


1469 


94 


1274 


AAB85039 


Homo sapiens 


CURA- Human SER5 protein 
sequence. 


767 


48 


1274 


gi6 137097 


Homo sapiens, 


serine protease DESC1 (DESC1) 
mRNA, complete cds. 


749 


48 


1274 


AAY99414 


Homo sapiens 


GETH Human PR01461 
(UNQ742) amino acid sequence 
SEQ ID NO:269. 


749 


48 


1275 


gil 25 84839 


Homo sapiens 


HT036-ISO (HT036-ISO) mRNA, 
complete cds. 


997 


94 


1275 


gil25 84841 


Homo sapiens 


HT036 (HT036) mRNA, complete 
cds. 


820 


93 


1275 


gil7427028 


Ralstonia 
solanacearum 


CONSERVED HYPOTHETICAL 
PROTEIN 


502 


42 


1276 


gi3 10691 


Simian virus 40 


small T antigen 


48 


47 


1276 


gi8886685 


Centris inermis 


cytochrome b 


53 


40 


1276 


gi625084 


Oncorhynchus 
tshawytscha 


heat-shock protein 30 


37 


44 


1277 


gi7106820 


Homo sapiens 


HSPC215 


261 


100 


1277 


AAU16225 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 1178. 


261 


100 


1277 


AAG81441 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO:400. 


261 


100 


1278 


AAM25840 


Homo sapiens 


HYSE- Human protein sequence 
SEQ ID NO: 1355. 


208 


88 


1278 


AAM74914 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 35220. 


63 


68 


1278 


AAM06639 


Homo sapiens 


HYSE- Human foetal protein, SEQ 
ID NO: 370. 


36 


70 


1279 


gi7023943 


Homo sapiens 


GC36 mRNA, complete cds. 


360 


35 


1279 


gi2502077 


Homo sapiens 


digestive tract-specific calpain 
(nCL-4) mRNA, complete cds. 


360 


35 


1279 


gi2358262 


Rattus 
norvegicus 


calpain large subunit 


351 


35 


1280 


gi4153951 


Homo sapiens 


H.sapiens gene from PACs 295C6 
and313L4. 


259 


37 


1280 


AAY32437 


Homo sapiens 


TEXA Absorptive hypercalciuria 
associated gene protein prodict. 


259 


37 


1280 


gil5383934 


Homo sapiens 


testicular soluble adenylyl cyclase 
mRNA, complete cds. 


259 


37 
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1281 


AAM8965 1 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 17244. 


66 


31 


1281 


gi408591 


Influenza A virus 


nonstructural protein 


62 


28 


1281 


AAM82524 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 101 17. 


62 


55 


1282 


gi4079820 


Mus musculus 


HERC2 


67 


40 


1282 


gi459017 


Allomyces 
macrogynus 


subunit 6 of the ATPase complex 


71 


44 


1282 


gil236414 


Allomyces 
macrogynus 


H(+)-transporting ATPase, F0 
subunit 6 


71 


44 


1283 


AAM63001 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 35106. 


153 


67 


1283 


AAM75812 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 36118. 


153 


67 


1283 


AAE10197 


Homo sapiens 


HYSE- Human bone marrow 
derived peptide, SEQ ID NO: 41. 


60 


36 


1284 


AAG81367 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO:252. 


816 


98 


1284 


gi7582286 


Homo sapiens 


BM-007 


530 


98 


1284 


AAG02907 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6988. 


408 


95 


1285 


AAG81367 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO:252. 


906 


98 


1285 


gi7582286 


Homo sapiens 


BM-007 


538 


99 


1285 


AAG02907 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6988. 


-416 


96 


1286 


AAW49716 


Homo sapiens 


PROT- Protein polymer adhesive 
substrate PPAS1-C monomer. 


54 


31 


1286 


AAW49721 


Homo sapiens 


PROT- Protein polymer adhesive 
substrate PPAS1-D monomer. 


54 


31 


1286 


gi683735 


Macaca 
fascicularis 


endothelin 3 


50 


62 


1287 


gi5689766 


Homo sapiens 


mRNA for zinc fmger 2 (ZNF2 
gene). 


2092 


99 


1287 


gil4602980 


Homo sapiens 


clone MGC: 16594 
IMAGE:41 10322, mRNA, 
complete cds. 


1609 


100 


1287 


gi 13477207 


Homo sapiens 


clone MGC: 12980 
IMAGE:3350363, mRNA, 
complete cds. 


1604 


99 


1288 


AAB44228 


Homo sapiens 


HUMA- Human cancer associated 
protein sequence SEQ ID 
NO: 1673. 


284 


86 


1288 


AAM90208 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 17801. 


43 


50 


1288 


gi733438 


Cepaea 
nemoralis 


NADH dehydrogenase subunit 4L 


62 


31 


1289 


gi 10764264 


synthetic 
construct 


mutated NS5A 


67 


30 


1289 


AAO02625 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 16517. 


58 


26 



t 
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SEQ ID 


Hit ID 


Species 
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S score 


% Identity 


1289 


gil0644188 


"W T * • J • A^% * 

Hepatitis C virus 
type la 


polyprotein 


an 
b / 




1290 


AAG03150 


Homo sapiens 


GEST Human secreted protein, 
SEO ID NO: 7231. 




OR 


1290 


AAW48931 


Homo sapiens 


CEDA- bcnwannomm-Dinaing 
protein C-terminal fragment 




i on 

1UU 


1290 


AAO04324 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 18216. 


63 


33 


1291 


AAB60098 


Homo sapiens 


INCY- Human transport protein 
TPPT-18. 


1822 


92 


1291 


gil537070 


Rattus 
norvegicus 


nucleoponn p54 


1 /o / 


yz 


1291 


gil5214835 


Homo sapiens 


clone MGC: 13407 
IMAGE:3931652, mRNA, 
complete cds. 


lb22 




1292 


AAY94621 


Homo sapiens 


MILL- Epidermal growth factor- 
like variant in skin-2 amino acid 
sequence. 


385 


100 


1292 


AAE06697 


Homo sapiens 


HYSE- Human TGF alpha-like 
protein. 


385 


100 


1292 


AAE06698 


Homo sapiens 


HYSE- Human TGF alpha-like 
splice variant protein. 


385 


100 


1293 


AAW78245 


Homo sapiens 


HUMA- Fragment of human 
secreted protein encoded by gene 
19. 


1018 


98 


1293 


ABB11835 


Homo sapiens 


HYSE- Human secreted protein 
homologue, SEQ ID NO:2205. 


1018 


98 


1293 


AAM79352 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 2998. 


1 r\ 1 o 
lU 1 o 


ya 


1294 


AAM99920 


Homo sapiens 


HUMA- Human polypeptide SbQ 
ID NO 36. 


00/ 


y / 


1294 


gil6552010 


Homo sapiens 


cDNA FLJ32009 fis, clone 
NT2RP700949o, weakly similar to 
FIBULIN-1, ISOFORM A 
PRECURSOR. 


oo/ 


y / 


1294 


AAM99933 


ir -w- * 

Homo sapiens 


HUMA- Human polypeptide bbQ 
ID NO 49. 


oA 1 


oi 


1295 


gi2598167 


Homo sapiens 


zinc finger protein (HZF6) mRNA, 
5' UTR and partial cds. 


2772 


99 


1295 


gi5640019 


Mus musculus 


zinc finger protein ZFP235 


1565 


68 


1295 


gill 84371 


Mus musculus 


zmc finger protein; Method: 
conceptual translation supplied by 
author 


1278 


rr 


1296 


gil5679947 


Homo sapiens 


endothelial zinc finger protein 
induced by tumor necrosis factor 
alpha, clone MGC: 11153 
IMAGE:3840512, mRNA, 
complete cds. 


2734 


100 


1296 


gi9502202 


Homo sapiens 


endothelial zinc finger protein 

lnauceu Dy lurnor necrosis idctor 
alpha (EZFIT) mRNA, complete 
cds. 


2734 


100 


1296 


gil0437767 


Homo sapiens 


cDNA: FLJ21628 fis, clone 
COL08076. 


1713 


77 


1297 


AAM56742 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 28847. 


99 


55 



I 
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SEQID 


Hit ID 


Species 


Description 


S score 


% Identity 


1297 


AAO09197 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 23089. 


66 


45 


1297 


gi 12543402 


Corynebacterium 
glutamicum 


FRXA00675 


79 


26 


1298 


AAM79176 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1838. 


601 


100 


1298 


ABB 11626 


Homo sapiens 


HYSE- Human Fas-associated 
phosphatase homologue, SEQ ID 
NO: 1996. 


559 


94 


1298 


AAM80160 

* 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3806. 


559 


94 


1299 


gil2698338 


Homo sapiens 


matrix metalloproteinase-28 
precursor, mRNA, complete cds. 


2424 


96 


1299 


gil2698852 


Homo sapiens 


matrix metalloprotease MMP25 
mRNA, complete cds. 


2424 


96 


1299 


AAU 12243 


Homo sapiens 


GETH Human PR04339 
polypeptide sequence. 


2424 


96 


1300 


gi 142 10477 


Homo sapiens 


interleukin 18 precursor, mRNA, 
complete cds. 


138 


92 


1300 


AAW31757 


Homo sapiens 


INCY- Interferon gamma inducing 
factor-2 (IGIF-2) R140I variant. 


138 


92 


1300 


gil 0799833 


Ovis aries 


interleukin- 1 8 (IGIF) 


122 


78 


1301 


AAE05302 


Homo sapiens 


MILL- Human TANGO 457 
protein. 


623 


97 


1301 


AAE05303 


Homo sapiens 


MILL- Human mature TANGO 
457 protein. 


611 


100 


1301 


AAE05305 


Homo sapiens 


MILL- Human TANGO 457 
protein cytoplasmic domain. 


605 


100 


1302 


AAM55396 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 27501. 


64 


38 


1302 


AAM57742 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 29847. 


64 


38 


1302 


AAM67792 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 28098. 


64 


38 


1303 


AAM88370 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 15963. 


38 


35 


1303 


gi7330034 


Macaca mulatta 
rhadinovirus 26- 
95 


helicase-primase 


56 


30 


1303 


gi4494949 


Macaca mulatta 

rhadinovirus 

17577 


helicase/primase 


56 


30 


1304 


gil90870 


Homo sapiens 


Human retinoic acid receptor 
gamma 2 mRNA, 5* end. 


274 


100 


1304 


gi297146 


Homo sapiens 


H. sapiens gene for retinoic acid 
receptor gamma-2. 


274 


100 


1304 


gi200660 


Mus musculus 


retinoic acid receptor gamma 2 


252 


92 


1305 


AAM39737 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2882. 


992 


99 


1305 


AAM39736 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2881. 


875 


100 


1305 


AAM41522 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 6453. 


875 


100 
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SEQ ID 


Hit ID 
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S score 


% Identity 


1306 


ABB17891 


Homo sapiens 


HUMA- Human nervous system 
related polypeptide oni^ iu i>hj 
6548. 


54 


38 


1306 


AAM88996 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 

oril^ 11J 1N\J. lOOoy. 


D 1 




1306 


AAM65093 


Homo sapiens 


MOLE- Human brain expressed 
single exon prooe encoaea protein 
SEQ ID NO: 37198. 


55 


50 


1307 


AAY19551 


Homo sapiens 


HUMA- Amino acid sequence oi a 
human secreted protein. 


1 


AO 


1307 


AAY75972 


TT J 

Homo sapiens 


(jriiJNii- Human sKm ceil protein, 
SEQ ID 150. 






1307 


AAB559H 


Homo sapiens 


(jxsJNii- olcin ceil protein, or^v^ ii^j 
NO: 150. 


1 H 


AO 


1308 


AAU27671 


Homo sapiens 


ZYMO Human protein 

A T7D'2<:^/1'71 

Arrijj4 / 1. 


486 


100 


1308 


AA012566 ! 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID INU> Zo4Do. 


61 


28 


1308 


gil67933 


Dictyostelium 
discoideum 


thioredoxin 


67 


34 


1309 


AAR15222 


Homo sapiens 


TEX A Chronic myelogenous 
leukaemia-derived myeloid-related 
protein. 


620 


100 


1309 


gil 81527 


Homo sapiens 


Human neutrophil peptide 
(defensin) 1 mRNA, complete cds. 


493 


100 


1309 


gil8l529 


T T • 

Homo sapiens 


Human aerensm 1 protein mKJN A, 
complete cds. 




i no 


1310 


gi29H559 


Human 

- ii _ * 

papillomavirus 

type 77 


E6 protein 


66 


27 


1310 


gi9800324 


rat 

i • 

cytomegalovirus 
Maastricht 


prl09 


62 


25 


1310 


gi3 97007 


Human 

papillomavirus 
type 3 


envelope protein 


ou 


j i 


1311 


A A TT1 r\Z"0 O 

AAU19632 


Homo sapiens 


riUMA- riuman novel 
extraceuuiar matrix protein, oeq 
ID No 282. 






lO 1 1 

1311 


gii iz /y/o 


riomo sapiens 


ri. sapiens jvln/y ior type v i 
collagen alpha3 chain. 


1 Ou 


42 


1 O 1 1 

13 11 


giD /you 


Mus museums 


conagen aipna j cnam type v i 


176 


40 


1312 


gllODUo 1 /0 


riomo sapiens 


small vj i r^-Dinaing luiiior 
suppressor 1 mRNA, complete cds. 


1012 


100 


1312 


gl 1 OD JJJ .34 


riomo sapiens 


Kig protein miviNA, complete cus. 




100 ' 


1312 


gil6508174 


Mus musculus 


small GTP-binding tumor 
suppressor 1 


963 


93 


1313 


AAG73984 


T T * 

Homo sapiens 


HUMA- Human colon cancer 
antieen nrotein SEO ID NO 474 8 

O-i i Ll gull [./I WWlll Ul^/V^ li-/ 1 iV/*T / ~u» 






1313 


gi3041771 


Homo sapiens 


mRNA for perilipin, complete cds. 


83 


32 


1313 


AAY22157 


Homo sapiens 


ABBO Human BS135 protein 
sequence. 


83 


32 


1314 


AAM71801 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 32107. 


1872 


100 


1314 


gi!6549907 


Homo sapiens 


cDNA FLJ30663 fis, clone 


1203 


58 
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Description 


S score 


% Identity 








FCBBF1000598, moderately 
similar to ZINC FINGER 
PROTEIN 84. 






1314 


AAM78565 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1227. 


1151 


55 


1315 


gi32472 


Homo sapiens 


H.sapiens mRNA for high-sulphur 
keratin. 


785 


76 


1315 


gi3228239 


Homo sapiens 


UHS KerA gene. 


774 


76 


1315 


gi34079 


Homo sapiens 


Human gene for ultra high-sulphur 
keratin protein. 


774 


76 


1316 


gil2655446 


Homo sapiens 


mRNA for keratin associated 
protein 4.4 (KRTAP4.4 gene). 


755 


80 


1316 


gil2655460 


Homo sapiens 


mRNA for keratin associated 
protein 4.12 (KRTAP4.12 gene). 


726 


75 


1316 


gil3278825 


Homo sapiens 


Similar to RIKEN cDNA 
1 1 10054P19 gene, clone 
MGC:2782 IMAGE:2959821, 
mRNA, complete cds. 


726 


75 


1317 


gil 2655462 


Homo sapiens 


mRNA for keratin associated 
protein 4.14 (KRTAP4.14 gene). 


1102 


88 


1317 


gil 2655452 


Homo sapiens 


mRNA for keratin associated 
protein 4.7 (KRTAP4.7 gene). 


1081 


84 


1317 


gil 2655460 


Homo sapiens 


mRNA for keratin associated 
protein 4.12 (KRTAP4.12 gene). 


997 


79 


1318 


AAM79404 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3050. 


850 


74 


1318 


AAM39466 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2611. 


844 


77 


1318 


AAM41252 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 6183. 


836 


77 


1319 


gil6552172 


Homo sapiens 


cDNA FLJ32133 fis, clone 
PEBLM2000308, moderately 
similar to ZINC FINGER 
PROTEIN 135. 


800 


65 


1319 


gi6467200 


Homo sapiens 


GIOT-1 mRNA for gonadotropin 
inducible transcription repressor- 1, 
partial cds. 


775 


60 


1319 


gi498721 


Homo sapiens 


H.sapiens HZF10 mRNA for zinc 
finger protein. 


770 


63 


1320 


gi3036963 


Ciona savignyi 


CsCDC42 


163 


60 


1320 


gil 5072535 


Schizophyllum 
commune 


small GTPase CDC42 


162 


60 


1320 


gi520533 


Drosophila 
melanogaster 


Dcdc42 


161 


60 


1321 


AAE02058 


Homo sapiens 


HUMA- Human four disulfide core 
domain (FDCD)-containing 
protein. 


517 


43 


1321 


gil 2655452 


Homo sapiens 


mRNA for keratin associated 
protein 4.7 (KRTAP4.7 gene). 


509 


44 


1321 


gi200964 


Mus musculus 


serine 2 ultra high sulfur protein 


494 


42 


1322 


ABB 12490 


Homo sapiens 


HYSE- Human bone marrow 
expressed protein SEQ ID NO: 
329. 


169 


72 


1322 


gil 4647047 


Puntius titteya 


ATP synthase 8 


56 


37 


1322 


gil 4646929 


Barbus 

cyclolepis 

cyclolepis 


ATP synthase 8 


54 


44 
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Hit ID 
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Description 


S score 


% Identity 


1323 


gi5921470 


Homo sapiens 


mRNA for G8 protein (G8 gene, 
located in the class III region of the 

• "t • i * • i _ 1 i _._ _ _ _____ __ 1 _ _ _\ 

major histocompatibility complex). 


405 




1323 


gi5921473 


Homo sapiens 


mRNA for G8 protein (G8 gene, 
located in the class III region of the 
major histocompatibility complex), 
alternative splice variant lacking 
exon 2. 


381 


92 


1323 


AAM39144 


Homo sapiens 


t *r-_ t t~i T T 1 j * 1 _~1 "¥"7 /"V 

HYSE- Human polypeptide SEQ 
ID NO 2289. 


3o 1 


oo 
yz 


1324 


AAM87150 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 14743. 


74 


38 


1324 


AAB59115 


Homo sapiens 


HUMA- Breast and ovarian cancer 
associated antigen protein 
sequence SEQ ID 823. 


oZ 


ia 


1324 


gil5158712 


Agrobactermm 
tumefaciens str. 
C58 (Cereon) 


AGR_L_725p 


40 




1325 


AAY48404 


Homo sapiens 


MET A- Human prostate cancer- 
associated protein 101. 


Dj 


jU 


1325 


AAM59935 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 32040. 




JO 


1325 


AAM72530 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 32oJo. 


55 


38 


1326 


gi466912 


Mycobacterium 
leprae 


tpz; B1549_C2_20o 


60 




1326 


gi 12203 77 


Avian infectious 
bronchitis virus 


nucleocapsia protein 


Oh- 


56 
_ u 


1326 


gil 3 177409 


Ectocarpus 
siliculosus virus 


"~ 7" 1 IOC 

EsV-1-135 


D3 




1327 


gi200964 


Mus museums 


serine 2 ultra nign sullur protein 


ZU / 


_ _ 


1327 


gi200962 


Mus mus cuius 


serine 1 ultra high sulfur protein 


202 


32 


1327 


gi32472 


Homo sapiens 


H.sapiens mRNA for high-sulphur 
keratin. 


196 


32 


1328 


AAR23732 


Homo sapiens 


MINu Gene 519 cDJNA derived 
peptide. 


D ID 


OO 


1328 


gi35065 


Homo sapiens 


Human NKCj5 mRNA, expressed 
in natural killer cells and T-cells. 


_ 14 


OO 


1328 


AAW59874 


Homo sapiens 


HUMA- Amino acid sequence of 
the cDNA clone CAT-1 
(HTXET53). 


D 14 


66 

OO 


1329 


gi200964 


Mus musculus 


serine 2 ultra high sulfur protein 


359 


49 


1329 


AAM39466 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2611. 


o /i 1 
341 


_ l 


1329 


AAM78420 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1082. 


337 


53 


1330 


giliy_77oy 


Homo sapiens 


Similar tr» RT1TP>J rTYNJA 

1200013F24 gene, clone 
MGC: 12197 IMAGE:3997840, 
mRNA, complete cds. 


1256 


100 


1330 


gi7582294 


Homo sapiens 


BM-011 


781 


98 


1330 


AAM79664 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3310. 


255 


31 


1331 


gil47 18451 


Homo sapiens 


sialic acid-binding lectin 1 1 


796 


71 
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% Identity 








(SIGLEC11) mRNA, complete 
cds. 






1331 


AAY41724 


Homo sapiens 


GETH Human PRO940 protein 
sequence. 


535 


50 


1331 


AAB44280 


Homo sapiens 


GETH Human PRO940 (UNQ477) 
protein sequence SEQ ID NO:259. 


535 


50 


1332 


AAT90731_ 
aal 


Homo sapiens 


FARB Human placental bikunin 
cDNA. 


849 


98 


1332 


gil2804515 


Homo sapiens 


serine protease inhibitor, Kunitz 
type, 2, clone MGC:2021 
IMAGE:2959462, mRNA, 
complete cds. 


848 


98 


1332 


gi2065529 


Homo sapiens 


placental bikunin mRNA, complete 
cds. 


848 


98 


1333 


gil4042550 


Homo sapiens 


cDNA FLJ 14779 fis, clone 
NT2RP4000398, moderately 
similar to ZINC FINGER 
PROTEIN 140. 


2165 


98 


1333 


AAB93164 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 12091. 


2165 


98 


1333 


AAM93693 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3604. 


2159 


100 


1334 


gil2804907 


Homo sapiens 


Similar to metaxin 1, clone 
MGC:2518 IMAGE:3546178, 
mRNA, complete cds. 


1512 


100 


1334 


gil326108 


Homo sapiens 


Human metaxin (MTX) gene, 
complete cds. 


1098 


100 


1334 


gi2564913 


Homo sapiens 

~ 


clk2 kinase (CLK2), propinl, 
cotel, glucocerebrosidase (GBA), 
and metaxin genes, complete cds; 
metaxin pseudogene and 
glucocerebrosidase pseudogene; 
and thrombospondin3 (THBS3) 
gene, partial cds. 


1098 


100 


1335 


AAW85614 


Homo sapiens 


GEMY Secreted protein clone 
fr473 2. 


381 


83 


1335 


AAY94865 


Homo sapiens 


PROT- Human protein clone 
HP 10540. 


381 


83 


1335 


AAY36022 


Homo sapiens 


GEST Extended human secreted 
protein sequence, SEQ ID NO. 
407. 


365 


80 


1336 


AAB 18447 


Homo sapiens 


MILL- Amino acid sequence of 
human TANGO 216 polypeptide. 


2257 


99 


1336 


AAB 18455 


Homo sapiens 


MILL- A human TANGO 216 
polypeptide clone. 


2257 


99 


1336 


AAU 19662 


Homo sapiens 


HUMA- Human novel 
extracellular matrix protein, Seq 
ID No 312. 


1876 


96 


1337 


AAB 18447 


Homo sapiens 


MILL- Amino acid sequence of 
human TANGO 216 polypeptide. 


2257 


99 


1337 


AAB 18455 


Homo sapiens 


MILL- A human TANGO 216 
polypeptide clone. 


2257 


99 


1337 


AAU19662 


Homo sapiens 


HUMA- Human novel 
extracellular matrix protein, Seq 
ID No 312. 


1876 


96 


1338 


AAY86303 


Homo sapiens 


HUMA- Human secreted protein 
HOGCK20, SEQ ID NO:218. 


2133 


94 
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S score 


% Identity 


1338 


gil4456615 


Homo sapiens 


PIG-T mRNA for phosphatidyl 
inositol glycan class T, complete 
cds. 


2120 


96 


1338 


gi 15929 132 


Homo sapiens 


clone MGC:8909 
IMAGE:3921680, mRNA, 
complete cds. 


2120 


96 


1339 


gil2836893 


; Gallus gallus 


IPR328-like protein 


160 


29 


1339 


gi3093433 


Homo sapiens 


Chromosome 16 BAC clone 
CIT987SK-625P11, complete 
sequence. 


153 


29 


1339 


gi4558766 


Homo sapiens 


neuronal voltage gated calcium 
channel gamma-3 subunit mRNA, 
complete cds. 


153 


29 


1340 


gil2836893 


Gallus gallus 


IPR328-like protein 


158 


29 


1340 


gi3 093433 


Homo sapiens 


Chromosome 16 BAC clone 
CIT987SK-625P11, complete 
sequence. 


151 


29 


1340 


gi4558766 


Homo sapiens 


neuronal voltage gated calcium 
channel gamma-3 subunit mRNA, 
complete cds. 


151 


29 


1341 


AAW85737 


Homo sapiens 


SAGA Polypeptide with 
transmembrane domain. 


692 


100 


1341 


ABB11882 


Homo sapiens 


HYSE- Human transmembrane 
protein homologue, SEQ ID 
NO:2252, 


692 


100 


1341 


AAG89353 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 473. 


692 


100 


1342 


ABB 12032 


Homo sapiens 


HYSE- Human SIGP 2328134 
homologue, SEQ ID NO:2402. 


866 


97 


1342 


AAY21851 


Homo sapiens 


INCY- Human signal peptide- 
contianing protein (SIGP) (clone 
ID 2328134). 


866 


97 


1342 


gi4101574 


Homo sapiens 


54TMp (54tm) mRNA, complete 
cds. ; 


860 


96 


1343 


gi3002925 


Homo sapiens 


T cell receptor beta chain 
(TCRBV 13S1-TCRBJ2S1) 
mRNA, complete cds. 


1658 


100 


1343 


gi2982508 


Homo sapiens 


mRNA for TCR beta chain, 
specific for Mage 3/HLA-A2. 


1527 


94 


1343 


gi3002933 


Homo sapiens 


T cell receptor beta chain 
(TCRBV3S1-TCRBJ2S3) mRNA, 
complete cds. 


1251 


76 


1344 


gi 14973269 


Streptococcus 

pneumoniae 

TIGR4 


cell wall surface anchor family 
protein 


481 


19 


1344 


gil5991793 


Streptococcus 
gordonii 


platelet binding protein GspB 


303 


17 


1344 


gi8885520 


Streptococcus 
gordonii 


streptococcal hemagglutinin 


293 


16 


134:) 


AAY07751 


Homo sapiens 


T TT TTV >r A T T . -i , . 

HUMA- Human secreted protein 
fragment encoded from gene 8. 


293 


100 


1345 


gil 142588 


Trypanosoma 
brucei 


CR3 


85 


42 


1345 


gi3037018 


Bodo saltans 


NADH dehydrogenase subunit 5 


80 


35 


1346 


AAG78000 


Homo sapiens 


BIOW- Human actin 14. 


663 


100 


1346 


ABB17913 


Homo sapiens 


HUMA- Human nervous system 
related polypeptide SEQ ID NO 


644 


98 
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6570. 






1346 


AAY94954 


Homo sapiens 


GEMY Human secreted protein 
clone iw66_l protein sequence 
SEQ ID NO: 114. 


596 


68 


1347 


gi9837433 


Homo sapiens 


sialic acid binding 
immunoglobulin-like lectin 8 long 
splice variant (Siglec8) gene, 
complete cds. 


2206 


88 


1347 


gi6746556 


T T * 

Homo sapiens 


sialic acid-binding 
immunoglobulin-like lectin-8 
(SIGLEC8) mRNA, complete cds. 


2031 


93 


1347 


gi6980022 


TT ' 

Homo sapiens 


siglec SAF2 (SAF2) mKJNA, 
complete cds. 


om 1 

203 1 




1348 


gil 545 1469 


Homo sapiens 


siglec-like protein (SLG2) gene 
and alternatively spliced variants, 
complete cds. 


2689 


99 


1348 


gil5217166 


Homo sapiens 


sialic acid-binding Ig-like lectin 10 
(SIGLEC 10) mRNA, complete 
cds. 


2682 


99 


1348 


gil4164613 


Homo sapiens 


sialic acid binding 
immunoglobulin-like lectin 10 
(SIGLEC 10) mRNA, complete 
cds. 


2356 


no 


1349 


AAB60112 


Homo sapiens 


INCY- Human transport protein 


775 


100 


1349 


gi9oo31 17 


Homo sapiens 


mRNA for organic cation 
transporter. 






1349 


AAB47000 


Homo sapiens 


dUJiin/ Human JbJUUI protein. 




A Q 


1350 


AAY76219 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 96. 


336 


94 


1350 


gi2906006 


Homo sapiens 


WASP interacting protein (WIP) 
mKJNA, partial cas. 


134 


30 


1350 


gi22269 


Zea mays 


cell wall protein (108 AA) 


105 


34 


1351 

* 


AAB08767 


Homo sapiens 


INCY- A human leukocyte and 
blood related protein (LBAF). 


92 


37 


1351 


gi5 76631 


Torpedo 
marmorata 


14 kDa transmembrane protein 


87 


32 


1351 


AAM78542 


T T * 

Homo sapiens 


HYSE- Human protein SbQ ID 
NO 1204. 


11 


5 1 


1352 


gi5817194 


T T * 

Homo sapiens 


mRNA; cDNA DKFZp434F0l 1 
(from clone DKFZp434F011); 
partial cds. 


221 


so 


1352 


gi7576452 


Homo sapiens 


hBOIT mRNA for potent brain 
type organic ion transporter, 
complete cds. 


160 


36 


1352 


AAY06116 


Homo sapiens 


MILL- Human organic cation 
transporter OCT-3. 


154 


35 


1353 


gil6552104 


Homo sapiens 


cDNA FLJ32082 fis, clone 
OCBBF2000231, weakly similar 

DUnCPUHT TP A QT7 A O 
10 rHvJoJrxlL/JLrlr AoH AZ 

INHIBITOR SUBUNIT B 
PRECURSOR. 


2566 


99 


1353 


AAY66713 


Homo sapiens 


GETH Membrane-bound protein 
PRO1309. 


2566 


99 


1353 


AAU12396 


Homo sapiens 


GETH Human PRO1309 
polypeptide sequence. 


2566 


99 


1354 


gil 5559274 


Homo sapiens 


clone MGC: 20205 


502 


40 
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IMAGE:3627858 5 mRNA. 
complete cds. 






1354 


AAB15549 


Homo sapiens 


INCY- Human immune system 
molecule from Incyte clone 
2774913. 


502 


40 


1354 


AAB 19729 


Homo sapiens 


CURA- Human SECX Clone 
4339264-2 encoded protein. 


502 


40 


1355 


AAY99399 


Homo sapiens 


GETH Human PRO 1268 
(UNQ638) amino acid sequence 
SEQ IDNO:214. 


603 


100 


1355 


AAY78808 


Homo sapiens 


PROT- Hydrophobic domain 
containing protein clone HP 10537 
protein sequence. 


603 


100 


1355 


AAB 87570 


Homo sapiens 


GETH Human PRQ1268. 


603 


100 


1356 


AAM78418 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1080. 


1902 


97 


1356 


ABB 11897 


Homo sapiens 


HYSE- Human F22162_l 
homologue, SEQ ID NO:2267. 


1827 


93 


1356 


AAM79402 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3048. 


1820 


93 


1357 


gi397607 


Homo sapiens 


H.sapiens encoding CLA- 1 
mRNA. 


2331 


99 


1357 


AAY49573 


Homo sapiens 


WHED Human CLA-1 protein 
sequence. 


2331 


99 


1357 


AAW97900 


Homo sapiens 

* 


MILL- Human SR-BI class B 
scavenger. 


2318 


98 


1358 


gi854065 


Human 
herpesvirus 6 


U88 


348 


31 


1358 


gi 10434098 


Homo sapiens 


cDNA FLJ 12547 fis, clone 
NT2RM4000634. 


273 


32 


1358 


AAB95124 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 17122. 


273 


32 


1359 


AAE05302 


Homo sapiens 


MILL- Human TANGO 457 
protein. 


1521 


96 


1359 


AAE05303 


Homo sapiens 


MILL- Human mature TANGO 
457 protein. 


1397 


96 


1359 


AAE05305 


Homo sapiens 


MILL- Human TANGO 457 
protein cytoplasmic domain. 


1260 


100 


1360 


gil 0129690 


Homo sapiens 


mRNA for mucolipidin (ML4 
gene). 


804 


53 


1360 


gil0438844 


Homo sapiens 


cDNA: FLJ22449 fis, clone 
HRC09609. 


804 


53 


1360 


gil 3477347 


Homo sapiens 


mucolipin 1, clone MGC:3287 
IMAGE:3507836, mRNA, 
complete cds. 


804 


53 


1361 


AAE04122 


Homo sapiens 


HUMA- Human gene 23 encoded 
secreted protein HE80K73, SEQ 
ID NO: 108. 


214 


61 


1361 


AAE04169 


Homo sapiens 


HUMA- Human gene 23 encoded 
secreted protein HE80K73, SEQ 
ID NO:158. 


207 


60 


1361 


AAG00392 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 4473. 


117 


43 


1362 


AAY27853 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene No. 101. 


274 


94 


1362 


gi904289 


Phaseolus 
vulgaris 


fungine endopolygalacturonase 
inhibitor 


66 


41 
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1362 


AAB27594 


Homo sapiens 


HUMA- Human secreted protein 
SEQ ID NO: 95. 


60 


58 


1363 


gi 170 16967 


Homo sapiens 


NUANCE (NUA) mRNA, 
complete cds; alternatively spliced. 


3404 


71 


1363 


gil7861384 


Homo sapiens 


nesprin-2 gamma mRNA, 
complete cds. 


3404 


71 


1363 


gi5262574 


Homo sapiens 


mRNA; cDNA DKFZp434G173 
(from clone DKFZp434G173); 
complete cds. 


3404 


71 


1364 


AAB95854 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 18912. 


72 


in 


1364 


gi9621943 


Pelargonium 
senecioides 


NADH dehydrogenase 


58 


42 


1364 


gi9621945 


Pelargonium 
trifidum 


NADH dehydrogenase 


CO 




1365 


AAW29654 


Homo sapiens 


GEMY Human secreted protein 

1 — ^ <» AT A 1 

DM406 1. 


1 Af\ 

140 


/IS 


1365 


gilOl 87870 


Rhodococcus sp. 


onpA transport 




zo 


1365 


gil5559671 


Homo sapiens 


clone MGC:20633 
IMAGE:4761663, mRNA, 
complete cds. 


72 


33 


1366 


gi 105 66471 


Mus musculus 


Ghacolm 


OCA 


(Jo 


1366 


gi 14278927 


Mus musculus 


gliacolin 


850 


68 


1366 


gi3747099 


Mus musculus 


Clq-related factor 


724 


66 


1367 


gi2745756 


Aotus trivirgatus 


ribonuclease k6 precursor 


/JO 1 

431 


n 1 
91 


1367 


gi5730384 


Eulemur fulvus 
collaris 


ribonuclease k6 precursor 


305 


86 


1367 


gi5730382 


Nycticebus 
coucang 


ribonuclease k6 precursor 


279 


o(J 


1368 


AAE09651 


Homo sapiens 


HUMA- Human gene 13 encoded 
lipid metabolism protein HTJNI73, 
SEQ ID NO:45. 


A CI A 

484 


no 


1368 


AAG64355 


Homo sapiens 


UYFU- Human lambda crystallm. 


400 


y / 


1368 


AAY92506 


Homo sapiens 


INCY- Human OXRE-3 with 
identity to lambda crystallin. 


381 


no 


1369 


AAM25241 


Homo sapiens 


HYSE- Human protein sequence 
SEQ IDNO:756. 


A O A 

484 


yo 


1369 


AAE09651 


Homo sapiens 


HUMA- Human gene 13 encoded 
lipid metabolism protein HTJNI73, 
SEQ ID NO:45. 


O CO 

352 


1 AA 


1369 


AAG64355 


Homo sapiens 


UYFU- Human lambda crystallin. 


268 


98 


1370 


AAM79626 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3272. 


214 


1 AA 
1UU 


1370 


AAM79368 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3014. 


70 


A A 

44 


1370 


AAO02702 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 16594. 


65 


36 


1371 


gi6653659 


Oryctolagus 
cunicuius 


chloride channel CLC-6 


3919 


96 


1371 


ABB 11 826 


Homo sapiens 


HYSE- Human CI channel 
homologue, SEQ ID NO:2196. 


3865 


96 


1371 


gi 1770376 


Homo sapiens 


H. sapiens mRNA for chloride 
channel, ClC-6a. 


1620 


100 


1372 


AAG71967 


Homo sapiens 


YEDA Human olfactory receptor 
polypeptide, SEQ ID NO: 1648. 


725 


97 


1372 


AAG71962 


Homo sapiens 


YEDA Human olfactory receptor 


714 


95 
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polypeptide, SEQ ID NO: 1643. 






1372 


gil2007416 


Mus musculus 


m51 olfactory receptor 


553 


72 


1373 


AAM63071 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 35176. 


278 


100 


1373 


AAM75882 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protem 
SEQ ID NO: 36188. 


278 


100 


1373 


AAM67333 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 27639. 


234 


100 


1374 


AAH2653l_ 
aal 


Homo sapiens 


REGC Human 

proton/oligonucleotide transporter 
nPHTl cDNA. 


920 


100 


1374 


AAB82821 


Homo sapiens 


REGC Human 

proton/oligonucleotide transporter 
nJrrll 1 polypeptide. 


815 


100 


1374 


gi2208839 


Rattus 
norvegicus 


peptide/histidine transporter 


721 


82 


1375 


gil904l8 


Homo sapiens 


Human cathepsin L gene, complete 
cds. 


1597 


87 


1375 


gi297l5 


Homo sapiens 


Human mRNA for pro-cathepsin L 
(major excreted protein MEP). 


1597 


87 


1375 


AAW47031 


Homo sapiens 


USSH Human procathepsin L. 


1597 


87 


1376 


gil 6566341 


Homo sapiens 


G protein-coupled receptor 
(GPR101) gene, complete cds. 


2554 


98 


1376 


AAB86428 


Homo sapiens 


BOEN/ Human brain SERALPHA 
protein. 


2554 


98 


1376 


AAU04369 


Homo sapiens 


AREN- Human G-protein coupled 
receptor, hRUP15. 


2554 


98 


1377 


AAY53605 


Homo sapiens 


METR- Peptide of human KChAP 
that binds to KValpha and Kvbeta 
subunits. 


79 


35 


1377 


AAY53608 


T T _ • 

Homo sapiens 


METR- KChAP domain that binds 
to KValpha and Kvbeta subunits. 


78 


35 


1377 


gi6 102853 


T T * 

Homo sapiens 


mRNA; cDNA DKFZp727A05 1 
(from clone DKFZp727A05l); 
partial cds. 


82 


38 


1378 


AAB61616 


Homo sapiens 


PROT- Human protein HP 1 0678. 


2416 


94 


1378 


AAG68126 


Homo sapiens 


FARB Human 7TM-GPCR protein 
sequence SEQ ID NO:6. 


966 


40 


1378 


AAE 12023 


Homo sapiens 


INCY- Human G-protein coupled 
receptor, GCREC-2. 


95 1 


40 


1379 


AAY30735 


Homo sapiens 


TTT TX M A A * " J /> 

HUMA- Ammo acid sequence of a 
human secreted protein. 


280 


100 


1379 


gi333947 


Human 
respiratory 
syncytial virus 


membrane glycoprotein 


48 


39 


1379 


gi222567 


Human 
respiratory 
syncytial virus 


SH protein 


48 


39 


1380 


gi2459682 


Homo sapiens 


MAGE-B2 (MAGE-B2), MAGE- 
B3 (MAGE-B3), MAGE-B4 
(MAGE-B4), and MAGE-Bl 
(MAGE-Bl) genes, complete cds. 


884 


55 


1380 


gi3687199 


Homo sapiens 


Xp22 bins 169-171 BAC GSHB- 
383H3 (Genome Systems Human 


884 


55 
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BAG Library) complete sequence. 






1380 


gi4033512 


Homo sapiens 


DAM 10 exon 3, partial sequence; 
and DAM10=DSS-AHC critical 
interval MAGE superfamily 
protein (DAM10) gene, complete 
cds. 


867 


54 


1381 


gil0198115 


Homo sapiens 


2P domain potassium channel 
TREK2 (KCNK10) mRNA, 
complete cds. 


2697 


100 


1381 


gi8452900 


Rattus 
norvegicus 


potassium channel TREK- 2 


2556 


95 


1381 


g i45 84799 


Mus musculus 


TREK-1 K+ channel subunit 


1244 


65 


1382 


ABB 11297 


Homo sapiens 


HYSE- Human Coxsackie 
adenovirus receptor homologue, 
SEQ ID NO: 1667. 


699 


97 


1382 


gi 14279421 


Danio rerio 


coxsackievirus and adenovirus 
receptor-like protein 


312 


40 


1382 


gi6013133 


Rattus 
norvegicus 


coxsackie-adenovirus-receptor 
homolog 


306 


39 


1383 


gil 70 16394 


Homo sapiens 


cervical cancer 1 proto-oncogene- 
binding protein KG19 (KG19) 
mRNA, complete cds. 


753 


100 


1383 


AAM58441 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 30546. 


99 


28 


1383 


AAB86126 


Homo sapiens 


HUMA- Human MIT-like protein 
fragment encoded by cDNA clone 
HMSMI80. 


99 


28 


1384 


AAM06866 


Homo sapiens 


HYSE- Human foetal protein, SEQ 
ID NO: 1074. 


1133 


95 


1384 


gi 15099951 


Mus musculus 


diacylglycerol acyltransferase 2 


959 


51 


1384 


gil5099953 


Homo sapiens 


diacylglycerol acyltransferase 2 
mRNA, complete cds. 


951 


50 


1386 


gi338506 


Homo sapiens 


Human salivary statherin gene, 
exons 2-6. 


254 


83 


1386 


gi338508 


Homo sapiens 


Human statherin mRNA, complete 
cds. 


254 


83 


1386 


AAY94527 


Homo sapiens 


INCY- Human statherin protein. 


254 


83 


1387 


gil0435784 


Homo sapiens 


cDNA FLJ13693 fis, clone 
PLACE2000111. 


1011 


100 


1387 


AAB94721 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 15739. 


1011 


100 


1387 


gil3592427 


Caenorhabditis 
elegans 


similar to glycoproteins 


124 


34 


1388 


gil 2654579 


Homo sapiens 


peptidylprolyl isomerase B 
(cyclophilin B), clone MGC:2224 
IMAGE:2966791, mRNA, 
complete cds. 


918 


98 


1388 


gil4250758 


Homo sapiens 


peptidylprolyl isomerase B 
(cyclophilin B), clone MGC:14109 
IMAGE:3502055, mRNA, 
complete cds. 


918 


98 


1388 


gi337999 


Homo sapiens 


Human secreted cyclophilin-like 
protein (SCYLP) mRNA, complete 
cds. 


918 


98 


1389 


AAE07112 


Homo sapiens 


HUMA- Human gene 6 encoded 
secreted protein fragment, SEQ ID 


2473 


99 
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NO: 129. 






1389 


gi936S530 


Homo sapiens 


mRNA full length insert cDNA 
clone EUROIMAGE 363668. 


2457 


99 


1389 


gi 12053 163 


Homo sapiens 


mRNA; cDNA DKFZp434D0727 
(from clone DKFZp434D0727); 
complete cds. 


2378 


99 


1390 


gil6589056 


Homo sapiens 


type II gonadotropin-releasing 
hormone receptor gene, partial cds. 


1021 


99 


1390 


gil7048804 


Homo sapiens 


The CDS shown includes some 
apparent amino acids (from Gly 10 
onw ards) which would be deleted 
in a short intron 


1008 


98 


1390 


gi 14029600 


Cercopithecus 
aethiops 


GnRH receptor II 


944 


92 


1391 


gil 6359249 


Mus musculus 


RIKENcDNA 130001 0M03 gene 


2226 


91 


1391 


AAM93450 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3100. 


575 


37 


1391 


gil 0438431 


Homo sapiens 


cDNA: FLJ22155 fis, clone 
HRC00205. 


596 


34 


1392 


AAE04896 


Homo sapiens 


INCY- Human transporter and ion 
channel-9 (TRICH-9) protein. 


825 


100 


1392 


gil2003980 


Homo sapiens 


spinster-like protein mRNA, 
complete cds. 


695 


52 


1392 


gil4249892 


Homo sapiens 


spinster-like protein, clone 
MGC:15767 IMAGE:3501826, 
mRNA, complete cds. 


695 


52 


1393 


AAG71515 


Homo sapiens 


YEDA Human olfactory receptor 
polypeptide, SEQ ID NO: 1196. 


1051 


94 


1393 


AAG72603 


Homo sapiens 


YEDA Human OR-like 
polypeptide query sequence, SEQ 
ID NO: 2284. 


1051 


94 


1393 


AAU24762 


Homo sapiens 


SENO- Human olfactory receptor 
AOLFR130B. 


482 


47 


1394 


AAB08894 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 4 SEQ 
ID NO:51. 


165 


59 


1394 


gil 5626257 


Buffalopox virus 


p8 protein homologue 


69 


40 


1394 


AAO0645 1 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 20343. 


69 


35 


1395 


gi2792525 


Equus caballus 


connexin 43 


64 


35 


1395 


gil 5 148992 


Human 

immunodefi cienc 
y virus type 1 


vpu protein 


64 


30 


1395 


gi5738572 


Human 

immunodefi cienc 
y virus type 1 


VPU protein 


60 


33 


1396 


AAM83617 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 11210. 


61 


41 


1396 


gi4467773 


Helicobacter 
pylori 


cytotoxin associated protein A 


60 


34 


1396 


gi7248699 


Helicobacter 
pylori 


cytotoxin associated protein CagA 


60 


34 


1397 


gil 1862939 


Mus musculus 


DDM36 


5233 


88 


1397 


gil 1862941 


Mus musculus 


DDM36E 


5224 


88 


1397 


gi7650186 


Mus musculus 


neighbor of Punc ell protein 


5196 


87 
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1398 


gi5596705 


Homo sapiens 


Novel human mRNA similar to C. 
elegans gene WP:CE18674 9 
TR:Q19985. 


1305 


71 


1398 


gi 15292481 


Drosophila 
melanogaster 


SD03655p 


1174 


59 


1398 


AAB88372 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone PSEC0108. 


886 


71 


1399 


gil335598 


Simian sarcoma 
virus 


coding sequence of pl5E 


64 


33 


1399 


gil4039584 


Casuarius 

* 

casuarius 


ATPase 8 


55 


50 


1399 


gil 7427567 


Ralstonia 
solanacearum 


HYPOTHETICAL 
TRANSMEMBRANE PROTEIN 


72 


42 


1400 


AAM88491 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 16084. 


38 


41 


1400 


AAO09674 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 23566. 


38 


A A 

44 


1400 


gi3845106 


Plasmodium 
falciparum 


metal binding protein (DHHC 
domain) 


55 


36 


1401 


gi!0434098 


Homo sapiens 


cDNA FLJ12547 fis, clone 
NT2RM4000634. 


149 


34 


1401 


AAB95124 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 17122. 


149 


34 


1401 


AAO09309 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 23201. 


139 


26 


1402 


AAE01249 


Homo sapiens 


HUMA- Human gene 1 8 encoded 
secreted protein HFIIN69, SEQ ID 
NO:lll. 


222 


100 


1402 


AAE01299 


Homo sapiens 


HUMA- Human gene 18 encoded 
secreted protein HFIIN69, SEQ ID 
NO: 162. 


222 


100 


1402 


AAE01332 


Homo sapiens 


HUMA- Human gene 18 encoded 
secreted protein fragment, SEQ ID 
NO: 197. 


222 


100 


1403 


AAM06589 


Homo sapiens 


HYSE- Human foetal protein, SEQ 
ID NO: 320. 


237 


1 AA 

100 


1403 


gil 0732779 


Mus museums 


APRIL 


56 


43 


1403 


AAM87662 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 15255. 


38 


58 


1404 


AAO09486 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 23378. 


486 


98 


1404 


gi9955912 


Homo sapiens 


GPVI mRNA for platelet 
glycoprotein VI-2, complete cds. 


288 


37 


1404 


AAB40232 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 46 SEQ 
ID NO: 142. 


326 


40 


1405 


AAM06606 


Homo sapiens 


HYSE- Human foetal protein, SEQ 
ID NO: 337. 


50 


34 


1405 


gi495989 


Homo sapiens 


Human rearranged IgH chain gene, 
VJ6 region, partial cds. 


58 


27 


1405 


AAM85487 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 13080. 


56 


32 


1406 


gil3377867 


Gallus gallus 


claudin-3 


116 


24 


1406 


gil5553371 


Danio rerio 


claudin c 


112 


26 
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1406 


gil5553375 


Danio rerio 


claudin h 


110 


23 


1407 


AAY78801 


Homo sapiens 


PROT- Hydrophobic domain 
containing protein clone HP00631 
amino acid sequence. 


701 


100 


1407 


AAY32204 


Homo sapiens 


INCY- Human receptor molecule 
(REC) encoded by Incyte clone 
2132179. 


701 


100 


1407 


gi5231135 


Homo sapiens 


androgen induced protein (AIG-1) 
mRNA, complete cds. 


695 


99 


1408 


gil 3543940 


Homo sapiens 


Similar to RIKEN cDNA 
2610017G09 gene, clone 
MGC: 12975 IMAGE: 3 3473 12, 
mRNA, complete cds. 


2232 


96 


1408 


AAB12138 


Homo sapiens 


PROT- Hydrophobic domain 
protein isolated from HT-1080 
cells. 


2221 


96 


1408 


AAG81335 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO: 188. 


2218 


96 


1410 


gil 07 19608 


Homo sapiens 


IL-22 receptor (IL22R) mRNA, 
complete cds. 


1699 


100 


1410 


AAW97861 


Homo sapiens 


ZYMO Human cytokine receptor 
11 (Zcytorll). 


1699 


100 


1410 


AAY97045 


Homo sapiens 


MILL- Human TANGO 241. 


1699 


100 


1411 


gil4090278 


Rattus 
norvegicus 


TAT1 


668 


84 


1411 


gi458247 


Homo sapiens 


Human X-linked PEST-containing 
transporter (XPCT) mRNA, partial 
cds. 


414 


51 


1411 


gi458255 


Homo sapiens 


Human X-linked PEST-containing 
transporter (XPCT) gene, exon 6. 


414 


51 


1412 


gi4378057 


Homo sapiens 


organic anion transporter 1 
(OAT1) mRNA, complete cds. 


317 


51 


1412 


gi4579725 


Homo sapiens 


mRNA for h OAT 1-2, complete 
cds. 


317 


51 


1412 


gi5901645 


Homo sapiens 


organic anion transporter 1 
(SLC22A6) mRNA, complete cds. 


317 


51 


1413 


gi3881524 


Caenorhabditis 
elegans 


ZK1067.4 


714 


41 


1413 


gil 9322 


Lycopersicon 
esculentum 


glycine-rich protein 


63 


50 


1413 


gi2204081 


Pinctada fucata 


insoluble protein 


93 


52 


1414 


AAB43682 


Homo sapiens 


HUMA- Human cancer associated 
protein sequence SEQ ID 
NO: 1127. 


1522 


100 


1414 


gil 2654351 


Homo sapiens 


solute carrier family 25 
(mitochondrial carrier; phosphate 
carrier), member 3, clone 
MGC:5280 IMAGE: 29 84 830, 
mRNA, complete cds. 


1522 


100 


1414 


gil2654961 


Homo sapiens 


solute carrier family 25 
(mitochondrial carrier; phosphate 
carrier), member 3, clone 
MGC:5556 IMAGE:3457151, 
mRNA, complete cds. 


1522 


100 


1415 


gil 7640 15 


Ciona intestinal is 


COS41.5 


314 


42 


1415 


AAM42167 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 7098. 


284 


30 
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1415 


AAE03438 


Homo sapiens 


HUMA- Human gene 12 encoded 
secreted protein HETHW90, SEQ 
ID NO: 121. 


258 


30 


1416 


gil359l7U 


Homo sapiens 


immunoglobulin superfamily 
receptor translocation associated 
protein 2c (IRTA2) mRNA, 
complete cds, alternatively spliced. 


1128 


88 


1416 


gil 5277746 


Homo sapiens 


Fc receptor-like protein 5 (FCRH5) 

T% "VTA 1 ■ 1 

mRNA, complete cds. 


1128 


88 


1416 


AAB82315 


T T * 

Homo sapiens 


T T^y^l^V TT ' 111* 

UYCO Human immunoglobulin 
receptor isoform IRTA2c. 


1128 


o o 

88 


1417 


AAV40386_ 
aal 


Homo sapiens 


INCY- Human zinc binding 
protein ZB-2 encoding cDNA. 


525 


OA 

80 


1417 


gil 3278762 


Homo sapiens 


ring finger protein 5, clone 
MGC:2407 IMAGE:2822537, 
mRNA, complete cds. 


525 


80 


1417 


gil 3366064 


Homo sapiens 


mRNA for HsRmal, complete cds. 


525 


80 


1418 


gi3077703 


Oryctolagus 
cuniculus 


mitsugumin29 


1336 


93 


1418 


gi3461888 


Mus musculus 


mitsugumin29 


1314 


91 


1418 


AAU25436 


XT * 

Homo sapiens 


INCY- Human mddt protein from 
clone LG:171377.1:2000MAY19. 


1050 


96 


1419 


gil 34525 08 


Mus musculus 


claudm 14 


371 


A r\ 

40 


1419 


gil2597447 


Homo sapiens 


claudin 14 (CLDN14) mRNA, 
complete cds. 


370 


40 


1419 


gil 5082421 


Homo sapiens 


Similar to claudin 14, clone 
MGC:20195 IMAGE:46 84949, 

Tt "VTA 1 * J 

mRNA, complete cds. 


370 


40 


1420 


AAG00539 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 4620. 


173 


49 


1420 


AAU20426 


T T • 

Homo sapiens 


T TT T"fc jTATT j 1 j * 

HUMA- Human secreted protein, 
Seq ID No 418. 


163 


A '—f 

47 


1420 


AAG75413 


T T * 

Homo sapiens 


Tf YT T"A AT X. T T 1 

HUMA- Human colon cancer 
antigen protein SEQ ID NO:6177, 


159 


50 


1421 


gil4486155 


T"» j 

Bos taurus 


Tit jl V% 1 J. ' 

Rh type B glycoprotein 


1703 




1421 


gil5718471 


Homo sapiens 


Rh type B glycoprotein (RHBG) 
gene, exons 9, and 10 and 

1 A. J 

complete cds. 


1448 


80 


1421 


gi9858562 


Homo sapiens 


Rh type B glycoprotein (RHBG) 

T"\ "VTA 1 i "1 

mRNA, complete cds. 


1448 


80 


1422 


A AH Jtf\f\(\ A r\ 

AAM00949 


T T — * 

Homo sapiens 


HYSE- Human bone marrow 
protein, SEQ ID NO: 425. 


215 


A f 

46 


1422 


AAE01852 


T T * 

Homo sapiens 


T" *f*T TTl A* A T T -t -4 1 "1 

HUMA- Human gene 1 1 encoded 
secreted protein fragment, SEQ ID 
NO:175. 


174 


40 


1422 


gil4209834 


Mus musculus 


ATP-binding cassette transporter 
sub-family A member 7 


178 


38 


1423 


gil 2053628 


Homo sapiens 


mRNA for ribonuclease 7. 


718 


99 


1423 


a a ~\r a a 1 no 
AAY44192 


Homo sapiens 


INNO- Human keratinocyte- 
derived RNase-like protein. 


718 




1423 


AAB 10601 


Homo sapiens 


SCHD Human S AP-2 pre-protein. 


718 


99 


1424 


AAM83996 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 11589. 


1153 


99 


1424 


AAG02219 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6300. 


483 


99 
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% Identity 


1424 


gi222902 


Oncorhynchus 
keta 


pituitary- specific transcription 
factor, Pit-l/GHF-1 


98 


32 


1425 


AAB 82485 


Homo sapiens 


ZYMO Human secretin-like 
receptor Zgprl. 


499 


92 


1425 


AAB82487 


Homo sapiens 


ZYMO Human secretin-like 
receptor Zgprl splice variant. 


499 


92 


1425 


AAE03382 


Homo sapiens 


HUM A- Human gene 5 encoded 
secreted protein HEOMX53, SEQ 
ID NO:40. 


499 


92 


1426 


gi6808374 


Homo sapiens 


mRNA; cDNA DKFZp434G0812 
(from clone DKFZp434G0812); 
partial cds. 


3211 


100 


1426 


gi433383 


Tripneustes 
gratilla 


dynein heavy chain isotype 5A 


241 


26 


1426 


gi6706264 


Leishmania 
major 


dynein heavy chain 


271 


26 


1428 


gil 3540300 


Mus musculus 


nucleolar protein C7B 


427 


34 


1428 


gil3561516 


Mus musculus 


nucleolar protein C7 


426 


34 


1428 


AAM25939 


Homo sapiens 


HYSE- Human protein sequence 
SEQ ID NO: 1454. 


156 


90 


1429 


AAU27632 


Homo sapiens 


ZYMO Human protein 
AFP674535. 


117 


49 


1429 


AAM40391 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 3536. 


117 


49 


1429 


gi7022187 


Homo sapiens 


cDNA FLJ10261 fis, clone 
HEMBB1000975. 


85 


42 


1430 


AAY0775 1 


Homo sapiens 


HUMA- Human secreted protein 
fragment encoded from gene 8. 


293 


100 


1430 


gil 142588 


Trypanosoma 
brucei 


CR3 


84 


43 


1430 


AAO05990 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 19882. 


72 


33 


1431 


AAM93525 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3259. 


246 


36 


1431 


AAY66693 


Homo sapiens 


GETH Membrane-bound protein 
PRO 1004. 


191 


38 


1431 


AAB65216 


Homo sapiens 


GETH Human PRO 1004 
(UNQ488) protein sequence SEQ 
ID NO:227. 


191 


38 


1432 


AAB88388 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone PSEC0131. 


316 


44 


1432 


AAB25719 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 6 SEQ 
ID NO:108. 


84 


73 


1432 


AAM62047 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 34152. 


45 


39 


1433 


AAB88388 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone PSEC0131. 


318 


44 


1433 


AAB25719 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 6 SEQ 
ID NO:108. 


86 


66 


1433 


gi5629917 


Homo sapiens 


partial FLN2 gene for ABP-L, 
gamma fllamin, exons 1 to 3. 


77 


34 


1434 


AAG75991 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID NO:6755. 


235 


77 



T 
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% Identity 


^ A *\ A 

1434 


AAM78941 


T T * 

Homo sapiens 


HY SE- Human protein SEQ ID 
NO 1603. 


220 


86 


1 A O A 

1434 


ABB 14802 


TT . • 

Homo sapiens 


HUMA- Human nervous system 
related polypeptide bEQ ID NO 
3459. 


c o 
JO 


/irk 
40 


"\ A*) C 

1435 


gi9621664 


Homo sapiens 


RRBDL gene tor rhomboid-related 
protein. 


1 1 on 


DO 


1435 


gil4336709 


x T • „ 

Homo sapiens 


16pl3.3 sequence section 3 of 8. 


1 1 oo 

1 123 


JO 


1435 


gi3287191 


Homo sapiens 


mRNA for rhomboid-related 
protein, complete CDS. 


1 12J 


JO 


1430 


gi/lUoooo 


Homo sapiens 




one 
zoo 


JD 


1436 


AAU15887 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 840. 


238 


56 


1436 


AAU16342 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 1295. 


238 


56 


1437 


gil4l8942 


Mus museums 


samaphonn G 


5660 


93 


1437 


AAY94990 


Homo sapiens 


ALPH- Human secreted protein 
vb21_l, SEQ ID NO: 20. 


5403 


99 


1437 


gi2772584 


Homo sapiens 


semaphorin F homolog mRNA, 

i i j 
complete cds. 


3560 


59 


1439 


gil 1055322 


Homo sapiens 


vanilloid receptor-related 
osmotically activated channel 
(VROAC) mRNA, complete cds. 


3324 


100 


1439 


AAI66972_ 
aal 


TT 1 

Homo sapiens 


MILL- Human ion channel VR-5 
cDNA sequence. 


3319 


99 


1439 


AAG65787 


T T 

Homo sapiens 


A iCTT T TT ~ 1 -i \ j-t-j r 

MILL- Human ion channel VR-5 
protein sequence. 


3318 


99 


1440 


gi4 155033 


T T 1 * 1 j 

Helicobacter 
pylori J99 


cag island protein 


72 


25 


1440 


AAU69567 


T T 

Homo sapiens 


T*T T A A T T ^""1 1- ' 1 J 

PHAA Human G protem-coupled 
receptor from cDNA Seq-2643. 


77 


23 


1440 


gil3171062 


turkey 
coronavirus 


M protein 


56 


41 


1441 


gil6554186 


Homo sapiens 


cDNA FLJ25409 fis, clone 
TST03074. 


644 


100 


1441 


gi4235228 


Mus musculus 


leucine zipper-EF-hand containing 
transmembrane protein 1 


497 


59 


1441 


gi6599194 


Homo sapiens 


mRNA; cDNA DKFZp434C229 
(from clone DKFZp434C229); 
partial cds. | 


493 


63 


1442 


gil6359165 


Homo sapiens 


clone IMAGE:4645529, mRNA, 
partial cds. 


1270 


100 


1442 


ABB 11242 


Homo sapiens 


HYSE- Human SLIT-2 
homologue, SEQ ID NO: 1612. 


653 


99 


*t A A S"*k 

1442 


A AB 07469 


Homo sapiens 


ZYMO A human leucme-nch i 
repeat protein designated Zlrr3. 


451 


36 


1443 


AAE03245 


Homo sapiens 


HUMA- Human gene 3 encoded 
secreted protein fragment, SEQ ID 
ino:9j. 


804 


100 


1443 


AAE03244 


Homo sapiens 


HUMA- Human gene 3 encoded 
secreted protein fragment, SEQ ID 
NO:94. 


799 


100 


1443 


AAE03204 


Homo sapiens 


HUMA- Human gene 3 encoded 
secreted protein HNGNN78, SEQ 
ID NO:54. 


617 


100 


1444 


gi4062463 


Escherichia coli 


ABC transporter probable ATP- 


697 


91 
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is score 


to laentity 








binding subunit homolog 






1444 


gil 5487341 


Escherichia con 


macrolide-specmc ABC-type 
efflux carrier 


697 


r\ -i 
91 


1444 


gil 77436 10 


Agrobacterium 
tumefaciens str. 
C58 (Dupont) 


ABC transporter, nucleotide 
binding/ ATPase protein 


366 


51 


•f A\ A s~ 

1446 


gi3 168602 


T T ' 

Homo sapiens 


pi 60 mRNA, partial cds. 




OA 


1446 


AAW3 11 85 


Homo sapiens 


DAND Human pi 60 polypeptide 
160.1. 


3169 


87 


1446 


gi3 168604 


Homo sapiens 


proline and glutamic acid rich 
nuclear protein isoform mRNA, 
partial cds. 


1641 


99 


•% A A ^7 

1447 


gil 40425 15 


T T * 

Homo sapiens 


cDNA FLJ 14761 ns, clone 
NT2RP3003302. 


o/ 


Jo 


1447 


AAG67254 


T T ' 

Homo sapiens 


HELI- Amino acid sequence of a 
human liver-associated gene. 


6/ 


io 


1447 


a a T^fc /"\ A A r\ e* 

AAB94495 


TT ' 

Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 15188. 


tin 
6/ 




1448 


AAB24058 


Homo sapiens 


GETH Human PRO290 protein 
sequence SEQ ID NO:7. 


1972 


100 v 


1448 


AAY66639 


Homo sapiens 


GETH Membrane-bound protein 
PRO290. 


1972 


100 


1448 


AAB65162 


Homo sapiens 


GETH Human PRO290 (UNQ253) 
protein sequence SEQ ID NO: 33. 


1972 


100 


1449 


gil4043409 


Homo sapiens 


Similar to procollagen, type IV, 
alpha 3, clone MGC: 1 1337 

TH 4T A /TP OAT " O 1 1 T*\T A 

IMAGE:3953l3l, mRNA, 
complete cds. 


89 


75 


1449 


gi5420387 


Leishmama 
major 


proteophosphoglycan 


86 


24 


1449 


gil61 17372 


Macropodid 
herpesvirus 1 


ICP4 


102 


27 


1450 


AAU27660 


Homo sapiens 


ZYMO Human protein 
AFP671052. 


889 


100 


1450 


AAG74151 


T T * 

Homo sapiens 


TTT T~K K A TT 1 - ~ 

HUMA- Human colon cancer 
antigen protein SEQ ID NO:4915. 


74 Q 


1 AA 


1450 


gi5 670326 


Homo sapiens 


ICERE-1 mRNA, complete cds. 


110 


31 


J JUT *4 

1451 


AAY86519 


TT * 

Homo sapiens 


HUMA- Human gene 71 -encoded 
protein fragment, SEQ ID NO:434. 


46 


60 


1451 


AAM59183 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 31288. 


61 


39 


1451 


AAM71718 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 32024. 


61 


39 


1452 


gil20532l9 


Homo sapiens 


mRNA; cDNA DKFZp434N1235 
(from clone DKFZp434Nl235); 
complete cds. 


1031 


93 


1452 


gil 5559050 


Ethmostigmus 
rubripes 


ADP-ATP translocator 


738 


68 


1452 


gi339723 


Homo sapiens 


Human ADP/ATP translocase 
mRNA, 3' end, clone pHAT8. 


693 


68 


1453 


gil502578l 


Clostridium 
acetobutylicum 


Predicted membrane protein 


136 


32 


1453 


AAM95190 


Homo sapiens 


HUMA- Human reproductive 
system related antigen SEQ ID 
NO: 3848. 


42 


23 
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1453 


AAM55650 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 27755. 


61 


31 


1454 


gi4929597 


Homo sapiens 


CGI-64 protein mRNA, complete 
cds. 


1879 


96 


1454 


gi6995987 


Homo sapiens 


mitochondrial carrier homolog 1 
isoform a mRNA, complete cds; 
nuclear gene for mitochondrial 
product. 


1818 


99 


1454 


gi6995989 


Homo sapiens 


mitochondrial carrier homolog 1 
isoform b (MTCH1) mRNA, 
partial cds; nuclear gene lor 
mitochondrial product. 


1483 


99 


1455 


gil7131893 


Nostoc sp. PCC 
7120 


WD-repeat protein 


248 


26 


1455 


gi8 86024 


Thermomonospo 
ra curvata 


PkwA 


248 


30 


1455 


gi 172252 10 


Podospora 
anserina 


beta transducin-like protein HET- 
D2Y 


250 


25 


1456 


AAB36840 


Homo sapiens 


ZYMO Human insulin receptor- 
related receptor protein with signal 
peptide. 


6733 


98 


1456 


gil 86555 


Homo sapiens 


Human insulin receptor-related 
receptor (IRR) mRNA, 3 1 end. 


6728 


99 


1456 


AAB36836 


Homo sapiens 


ZYMO Human insulin receptor- 
related receptor protein. 


6728 


99 


1457 


gi6453436 


Homo sapiens 


mRNA; cDNA DKFZp586E041 
(from clone DKFZp586E041); 
partial cds. 


10115 


100 


1457 


ABB 11803 


Homo sapiens 


HYSE- Human GPI- 122 
homologue, SEQ ID NO:2173. 


6423 


99 


1457 


AAY50125 


Homo sapiens 


GEMY Human 
glycophosphatidylinositol- 
anchored protein GPI- 122. 


6323 


100 


1458 


AAU00023 


Homo sapiens 


BIO J Human activated T- 
lymphocyte associated sequence 2, 
ATLAS-2. 


3623 


99 


1458 


AAE04546 


Homo sapiens 


INCY- Human G-protein coupled 
receptor-2 (GCREC-2) protein. 


2570 


79 


1458 


ABB 11735 


Homo sapiens 


HYSE- Human vasopressin 
receptor homologue, SEQ ID 
NO:2105. 


2546 


100 


1459 


gi7021924 


Homo sapiens 


cDNA FLJ10081 fis, clone 
HEMBA1002018. 


2742 


100 


1459 


AAB92508 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 10631. 


2742 


100 


1459 


gil 0435862 


Homo sapiens 


cDNA FLJ13751 fis, clone 
PLACE3000339, weakly similar to 
GLUCOAMYLASE S1/S2 
PRECURSOR (EC 3.2. 1. 3). 


2687 


99 


1460 


AAM95163 


Homo sapiens 


HUMA- Human reproductive 
system related antigen SEQ ID 
NO: 382L 


75 


33 


1460 


AAM06875 


Homo sapiens 


HYSE- Human foetal protein, SEQ 
ID NO: 1083. 


68 


45 


1460 


AAG76978 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID NO:7742. 


65 


39 
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1*tO 1 


ml ^^^16^ 


jviacaca 
fascicularis 


MHL-u 


84 


30 


1461 


gi2655072 


Pan troglodytes 


MHC class I antigen HLA-H 
ortuoiog 


72 


28 


1461 


gi265507S 


Pan paniscus 


MHC class I antigen HLA-H 
ortholog 


72 


28 


1462 


gi 11066090 


Homo sapiens 


matrix metalloprotease MMP-27 
mRNA, complete cds. 


1191 


85 


1462 


AAE10410 


Homo sapiens 


SCHA/ Human full length matrix 
metalloprotemase-251 (MMP-251) 
protein. 


1188 


85 


1469 




tiomo sapiens 


1NCY- Human peptidase, HPEP- 
10 protein sequence. 


1188 


85 


1463 


gi601948 


Drosophila 
meianogaster 


Inscuteable 


126 


25 


146^ 


gnoo /yoz 


urosopniia 
meianogaster 


Nem 


1 15 


26 




gllDZo I0o4 


oactenopnage 
Mx8 


p4 


66 


40 




A A VIA ^17Q 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 38 SEQ 

ID JN\J: 1 JU. 


434 


87 


1464 


gi406058 


Mus musculus 


protein kinase 


219 


54 


1 464 


gl 1 DO J /ZUh- 


xiomo sapiens 


„_T1 XT A J?^,„ ATA OTIAC _ _ 1 x 

mKJNA tor MAST205, complete 
cds. 


216 


54 


1466 


AAY97293 


Homo sapiens 


INCY- Lipid associated protein 
(LIPAP) 3335404CDL 


1894 


77 


1466 
l*f DO 


A A TiO/IO'i 1 


Homo sapiens 


INCY- Human vesicle associated 
protein 10 SEQ ID NO: 10. 


1116 


48 


1466 


AAM39997 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 3142. 


1116 


48 


14o/ 


gil3278855 


T T 

Homo sapiens 


calcium binding atopy-related 
autoantigen 1, clone MGC:2891 
IMAGE:3009677, mRNA, 
complete cds. 


919 


96 


1467 


gi 13278921 


Homo sapiens 


calcium binding atopy-related 
autoantigen 1, clone MGC:4521 
IMAGE:3009677, mRNA, 
complete cds. 


919 


96 


1467 


AAB58329 


Homo sapiens 


Lung cancer associated 
polypeptide sequence SEQ ID 667. 


919 


96 


1468 


gi 12654401 


Homo sapiens i 


Similar to RNA cyclase homolog, 
clone MGC: 1390 
IMAGE: 3 34346 8, mRNA, 
complete cds. 


690 


78 


1468 


gil0434565 


Homo sapiens 


cDNA FLJ12842 fis, clone 
NT2RP2003286, weakly similar to 
PROBABLE RNA 3 -TERMINAL 
PHOSPHATE CYCLASE (EC 
6 5 14) 


690 


78 


1468 


AAB97250 


Homo sapiens 


HOMO RNA cyclase 41 protein. 


690 


78 


1469 


gil2053215 


Homo sapiens 


mRNA; cDNA DKFZp434K2435 
(from clone DKFZp434K2435); 
complete cds. 


273 


98 


1469 


gi2633333 


Bacillus subtilis 


yhaJ 


57 


25 


1469 


gi 15023682 


Clostridium 
acetobutylicum 


Glycosyltransferase involved in 
cell wall biogenesis 


73 


34 
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1470 


gil4290599 


Homo sar>ier»^ 


IMAGE:3855543, mRNA, 

cnmnlpfp rr\^i 


347 


i r\f\ 

100 


1470 


gi65265 


Xenoous laevis 


a ivciiupua ufjoucdin Dinuing iacior 


1 3 1 
LJ 1 




1470 


gi65201 


Xenopus laevis 


RNA polymerase I transcription 

factor 


128 


29 


1471 


gil3 182747 


Homo sapiens 


miero<?nma1 ^ionnl nf*r»tir1ac<» 

subunit mRNA, complete cds. 




yo 


1471 


AAW29660 


Homo saniens 


HntTlft <3nrMP11C C^J-i r ) r 7 1 r»1 r\r\a. 

secreted protein. 


1 A Q 


96 


1471 


gi 164084 


Canis familiaris 


sienal neofiria«;p 91 IrDa cnhimit 


l^f 1 


OA 


1472 


AAG03600 


Homo sapiens 


Human secreted protein, SEQ ID 
NO- 7681 


245 


86 


1472 


gi7770239 


Homo saniens 


PRG28^1 


ion 
1 jy 


"7 1 

/l 


1472 


gi 14026000 


Mesorhizobium 
loti 


transposase 


64 


46 


1473 


gi3 065951 


Homo sapiens 


Notch3 (NOTCH3) gene, exon 33 
and comrVlptp pHq 


99 


58 


1473 


gi2668592 


Homo sapiens 


Notch3 (NOTCH3) mRNA, 

prvmrVlptp r*rlc 


99 


58 


1473 


AAW49698 


Homo sapiens 


Human Notch3 protein. 


99 


58 


1474 


gi458938 


Saccharomyces 
cerevisiae 


Yhrl86cp 


156 


58 


1474 


gi5921144 


| Schizosaccharom 
yces pombe 


mipl 


151 


83 


1474 


£i6459542 


DpinnpnorMio 

radiodurans 


serin e/tnreonme protein ianase- 
related protein 


96 


31 


1475 


ei6562173 




miviNA, CJLJiN/V x^ivrZ,p jooriU3 J 
^iioiii oiune J-^ivrz^p jooi 1UJ j Jj 
narfinl pHc 


295 


91 


1475 


AAB3S280 


Homo saniens 


rTllTTlJITl CPPTf^t^rl rvrrvfo-iri o «a n nonno 
xxuxiictii iJlULCIIl DCCJUCnCS 

encoded bv eerie 20 53 FO TF) 
NO: 136. 




96 


1475 


AAB94892 


Homo sapiens 


Human nrotein Qpnnpnpp ^IFO TFl 
NO: 16234. 




<>y 


1476 


gi7 160973 


Homo sapiens 


mRNA for VNNP5 nrntpin 

i**A^_L 1 Ivl V XtiX'I^J ^/XWL^XXX. 


1 CKzl 
1!?^'+ 


yo 


1476 


gi6 102996 


Mus musculus 


Vanin-3 


1661 


73 


1476 


gi6649540 


Canis familiaris 


TIFF66 


1370 


67 


1477 


gi7959741 


Homo sapiens 


PRO 1051 


134 


76 


1477 


ei 172903 


Sla prri nrnm v^pq 

cerevisiae 


LicuissunpLion ractor iiiA 


113 


26 


1477 


gi786305 


Saccharomyces 
cerevisiae . 


Transcription factor TFIIIA (PIR 
accession number S20050) 


113 


26 


1478 


si 12652825 


TTfYtnn ccmiptiQ 

1XWXXHJ OALJXV'llO 


vouage-oepenaent anion channel 
2, clone MGG5237 
IMAGE: 2901 130, mRNA, 


/loo 

483 


74 


1478 


gil 5277577 


Homo sapiens 


voltage-dependent anion channel 
2, clone MGC:21498 
IlvlAGE:3875077, mRNA, 
complete cds. 


483 


74 


1478 


gi5 114261 


Homo sapiens 


voltage-dependent anion channel 
isoform 2 (VDAC2) gene, exon 10 
and complete cds. 


483 


74 


1479 


gil0435380 


Homo sapiens 


cDNA FLJ13381 fis, clone : 
PLACE1001010. 


T33 


58 
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1479 


AAB94622 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15476. 


133 


58 


1479 


AAB85361 


Homo sapiens 


Human phosphatase (PP) (clone ID 
7472032CD1). 


118 


50 


1480 


gi!2803725 


Homo sapiens 


APG5 (autophagy 5, S. 
cerevisiae)-like, clone MGC:3622 
IMAGE:3609927, mRNA, 
complete cds. 


204 


100 


1480 


gi2995198 


Homo sapiens 


H.sapiens mRNA for apoptosis 
specific protein. 


204 


100 


1480 


gil2006864 


Homo sapiens 


apoptosis-related protein (APG5L) 
mRNA, complete cds, alternatively 
spliced. 


204 


100 


1481 


gil 26 19679 


Conus arenatus 


conotoxin scaffold VI/VII 
precursor 


56 


25 


1481 


gi2661493 


Drosophila 
melanogaster 


56F3.e 


72 


34 


1481 


gi2648663 


Archaeoglobus 
fulgidus 


DNA-directed RNA polymerase, 
subunit H (rpoH) 


55 


27 


1482 


gil 2654623 


Homo sapiens 


RAB31, member RAS oncogene 
family, clone MGC: 1258 
IMAGE:3534853, mRNA, 
complete cds. 


194 


58 


1482 


gil457954 


Homo sapiens 


Human small GTP-bmdmg protein 
rab22b mRNA, complete cds. 


194 


58 


1482 


gil3S8195 


Homo sapiens 


Human low-Mr GTP-bmdmg 
protein (RAB31) mRNA, complete 
cds. 


194 


58 


1483 


gi7959778 


Homo sapiens 


PRO 1546 


163 


65 


1483 


AAG02753 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6834. 


154 


68 


1483 


AAB93922 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 13907. 


137 


59 


1484 


gi6979921 


Drosophila 
melanogaster 


RhoGTPase , 


160 


32 


1484 


gi7263024 


Drosophila 
melanogaster 


G protein RhoBTB 


160 


32 


1484 


gil5291731 


Drosophila 
melanogaster 


LD24835p 


160 


32 


1485 


gil4150450 


Rattus 
norvegicus 


UDP-GalNAc:polypeptide N- 
acetyl galactosaminyltransferase T9 


246 


93 


1485 


gi3047203 


Caenorhabditis 
elegans 


GLY7 


128 


54 


1485 


gi304259 


Bos taurus 


UDP-GalNAc:polypeptide, N- 
acetylgalactosaminyltransferase 


109 


45 


1486 


AAB95830 


Homo sapiens 


Human protein sequence SEQ ID 
NO:18850. 


202 


76 


1486 


AAG03710 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7791. 


192 


73 


1486 


AAG02922 


T T • 

Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7003. 


182 


69 


1487 


AAB38012 


Homo sapiens 


Human secreted protein encoded 
by gene 3 clone HNHCT15. 


1244 


84 


1487 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 
NO: 121. 


1249 


85 


1487 


gi3 39771 


Homo sapiens 


Human transposon LI ,1 with a 


1243 


84 
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% Identity 








base deletion relative to L1.2B 
resulting m a premature stop codon 
in the coding region. 






1488 


gi!3560707 


V T * 

Homo sapiens 


group XIII secreted phospholipase 
A2 mRNA, complete cds. 


o oo 

383 


98 


1488 


gil5824793 


Homo sapiens 


group XIII secreted phospholipase 
A2 


383 


98 


1488 


AAY27572 


Homo sapiens 


Human secreted protein encoded 
by gene No. 6. 


o oo 

383 


98 


1489 


gi2995442 


Homo sapiens 


mRNA for UDPGahGlcNAc bl,4 
galactosyltransferase. 


C\C\ £1 

996 


98 


1489 


gi4520136 


Homo sapiens 


mRNA for beta- 1,4- 
galactosyltransferase II, complete 
cds. 


996 


98 


1489 


gi3869131 


Mus museums 


beta-l,4~galactosyltransferase II 


946 


94 


1490 


AAM06551 


Homo sapiens 


Human foetal protein, SEQ ID NO: 
282. 


140 


49 


1490 


AAB51718 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 45 SEQ ID 
NO:158. 


129 


58 


1490 


gil 196431 


Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 


124 


52 


1491 


gil 20528 84 


Homo sapiens 


mRNA; cDNA DKFZp564C2478 
(from clone DKFZp564C2478); 
complete cds. 


487 


98 


1491 


gi7023332 


Homo sapiens 


cDNA FLJ 10961 fis, clone 
PLACE1000588, highly similar to 
INTERFERON-INDUCED 
GUANYLATE-BINDING 
PROTEIN 1. 


487 


98 


1491 


AAB93371 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12521. 


487 


98 


1492 


AAG00392 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4473. 


150 


81 


1492 


AAB54106 


Homo sapiens 


Human pancreatic cancer antigen 
protein sequence SEQ ID NO:558. 


136 


71 


1492 


gi914110 


Streptococcus 
pyogenes 


EmmL15 


107 


20 


1493 


gi6855513 


Gallus gallus 


syndesmos 


554 


60 


1493 


gil 3623247 


Homo sapiens 


Similar to RIKEN cDNA 
1110001K21 gene, clone 
MGC: 11275 IMAGE:3944355, 
mRNA, complete cds. 


534 


58 


1493 


gil 2544542 


.-—4 1 J • 

Corynebacterium 
glutamicum 


RXA02115 


99 


o o 

28 


1494 


gil2082725 


Mus musculus 


T"» "Ml "t • ... J /■> -» • 

B cell phosphomositide 3 -kinase 
adaptor 


335 


62 


1494 


gil2082811 


Gallus gallus 


B cell phosphoinositide 3 -kinase 
adaptor 


211 


48 


■» A /~\ A 

1494 


gi330842 


Equine 
herpesvirus 1 


myristylated virion protein 


61 


o o 

33 


1495 


AAB43811 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1256. 


273 


100 


1495 


AAB64482 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 13 SEQ ID 
NO:120. 


273 


100 


1495 


gi5 106795 


Homo sapiens 


sec61 homolog mRNA, complete 


273 


100 
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S score 


% Identity 








cds. 






1496 


AAE01786 


Homo sapiens 


Human gene 17 encoded secreted 
protein HWBEM18, SEQ ID 
NO: 107. 


4730 


97 


1496 


gi6650678 


Mus musculus 


nuclear pore membrane 
glycoprotein POM210 


4025 


76 


1496 


gi56463 


Rattus 
norvegicus 


gp210(AA 1-1886) 


4000 


79 


1497 


gil 85996 


Homo sapiens 


Human Ig germline kappa L-chain 
V~region gene (HK1 66), V-kappa- 
1. 


607 


100 


1497 


AAR38651 


Homo sapiens 


Human V- kappa fragment encoded 
by clone vk65.15. 


607 


100 


1497 


AAR62931 


Homo sapiens 


Human V- kappa vk65 . 1 5 region. 


607 


100 


1498 


gi431857 


Homo sapiens 


H. sapiens mRNA for delta 4-3- 
oxosteroid 5 beta-reductase. 


460 


76 


1498 


gil 1640835 


Homo sapiens 


5-beta steroid reductase (SRD5B1) 
gene, exon 9 and complete cds. 


460 


76 


1498 


gi5689216 


Oryctolagus 
cuniculus 


delta4-3-oxosteroid 5beta- 
reductase 


442 


70 


1499 


gil752736 


Saccharomyces 
cerevisiae 


gene required for phosphoylation 
of oligosaccharides/ has high 
homology with YJR061w 


210 


47 


1499 


AAB53977 


Homo sapiens 


Human colon cancer antigen 
protein sequence SEQ ID 
NO:1517. 


178 


71 


1499 


AAB27956 


Homo sapiens 


Human secreted protein SEQ ID 
NO: 110. 


177 


65 


1500 


AAB93159 


Homo sapiens 


Human protein sequence SEQ ID 
NO:1208L 


2209 


64 


1500 


AAB58796 


Homo sapiens 


Breast and ovarian cancer 
associated antigen protein 
sequence SEQ ID 504. 


1455 


65 


1500 


gil3377567 


Vibrio cholerae 


accessory colonization factor AcfD 


105 


26 


1501 


AAB95655 


Homo sapiens 


Human protein sequence SEQ ID 
NO:18417. 


186 


78 


1501 


AAB95596 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18279. 


172 


80 


1501 


gil 196433 


Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 


182 


78 


1502 


AAB58202 


Homo sapiens 


Lung cancer associated 
polypeptide sequence SEQ ID 540. 


427 


100 


1502 


gi4572328 


Homo sapiens 


caveolin-1 gene, exon 3 and 
complete cds. 


345 


100 


1502 


gi6599075 


Homo sapiens 


caveolin-l/-2 locus, Contigl, 
D7S522, genes CAV2 (exons 1, 
2a, and 2b), CAV1 (exons 1 and 
2). 


345 


100 


1503 


gil 99584 


Mus musculus 


MHox 


405 


96 


1503 


gi51362 


Mus musculus 


DNA-binding protein 


405 


96 


1503 


gil 836044 


Rattus sp. 


rHox protein 


405 


96 


1504 


gi8 163762 


Homo sapiens 


membrane cofactor protein CD46 
variant (MCP) mRNA, partial cds. 


639 


83 


1504 


AAB58394 


Homo sapiens 


Lung cancer associated 
polypeptide sequence SEQ ID 732. 


635 


82 


1504 


AAG75528 


Homo sapiens 


Human colon cancer antigen 


635 


82 
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protein obQ ID NU:o29z. 






1505 


gi!5990400 


Homo sapiens 


clone IMAGE: 3954884, mRNA, 
partial cds. 


872 


83 


1505 


gi3523113 


Homo sapiens 


prostate-specific transglutaminase 
(TCjM4) gene, alternative spliced 
variant, exon 2 and partial cds. 


92 


46 


1505 


AAB54389 


Homo sapiens 


Human pancreatic cancer antigen 
protein sequence obQ ID JNU:o41. 


84 


56 


1506 


AAB94891 


Homo sapiens 


Human protein sequence SEQ ID 
NO:16231. 


214 


65 


1506 


gil 196431 


Homo sapiens 


Human factor VIII gene LI 
element insertion DINA. 


197 


66 


1506 


AAB38280 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 20 SEQ ID 
NO:136. 


196 


69 


1507 


gil 184173 


Homo sapiens 


Human nucleoporin 98 (NUP98) 
mKJNA, complete cds. 


1058 


100 


1507 


gil 1414896 


Homo sapiens 


NUP98 mRNA for nucleoporin, 
complete cds. 


1058 


100 


1507 


gi4545101 


Homo sapiens 


cell-line HeLa Nup98-Nup96 
precursor splice variant 1 mKJNA, 
complete cds. 


1060 


88 


1508 


gi5 106521 


Homo sapiens 


K-Cl cotransporter KCC4 mRNA, 
complete cds. 


1062 


96 


1508 


gil 0440500 


Homo sapiens 


mRNA for FLJ00098 protein, 
partial cds. 


1062 


96 


1508 


gi 104405 14 


Homo sapiens 


mRNA for FLJ00105 protein, 
partial cds. 


1062 


96 


1509 


gi6691968 


Homo sapiens 


Human DNA sequence from clone 
RP1-148M19 on chromosome 
Xpll.22-ll.3 Contains ZNF741 
(zinc finger protein), a ribosomal 
protein L23a pseudogene, STSs 
and GSSs, complete sequence. 


293 


81 


1509 


* a f\r\ /TO O /\ 

gi4096339 


Homo sapiens 


Human zinc linger protein 
(ZNF741) mRNA, complete cds. 


293 


ol 


1509 


AAB21033 


Homo sapiens 


Human nucleic acid-binding 
protein, NuABF-37. 


293 


81 


1510 


AAB38012 


Homo sapiens 


Human secreted protein encoded 
by gene 3 clone HNHCT15. 


426 


57 


1510 


gi5052951 


Homo sapiens 


LINE1 element inserted in B- 
globin gene intron 2. 


425 


57 


1510 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 

XT/~i • 1 11 

ss\j, izi. 


426 


57 


1511 


gil6041769 


Homo sapiens 


clone MGG23189 
ljvlA(jrb:4o5451d, mRNA, 
complete cds. 


313 


55 


1511 


gi4454678 


Homo sapiens 


zinc finger protein 4 


308 


55 


iri 1 

15 1 1 


glloO / /4 


riomo sapiens 


Human Kruppel related zinc imger 
protein (HTF10) mRNA, complete 
cds. 


321 


Z>J 


1512 


gil4027838 


Mesorhizobium 
loti 


transcriptional regulator 


91 


26 


1513 


gi48 86463 


Homo sapiens 

V 


mRNA; cDNA DKFZp586G1219 
(from clone DKFZp586G1219); 
partial cds. 


1039 


100 
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1513 


gi4337460 


Homo sapiens 


neuroblastoma-amplified protein 
mRNA, complete cds. 


1039 


100 


1513 


AAB38417 


Homo sapiens 


Fragment of human secreted 
protein encoded by gene 5 clone 
HCGMF16. 


1039 


100 


1514 


gil621611 


Homo sapiens 


Human TRAP family member- 
associated NF-kB activator TANK 
mRNA, complete cds. 


1667 


76 


1514 


gil518018 


Homo sapiens 


Human TRAF-interacting protein 
I-TRAF mRNA, complete cds. 


1655 


77 


1514 


AAW27163 


Homo sapiens 


Human TRAF inhibitor protein I- 
TRAF. 


1655 


77 


1515 


gil2957169 


Mus musculus 


synaptotagmin-like protein 3-b 


142 


60 


1515 


gil 3647085 


Mus musculus 


synaptotagmin-like protein 3-a 
delta 3S-II 


142 


60 


1515 


gi 13647079 


Mus musculus 


synaptotagmin-like protein 3-a + 
3S-I 


142 


60 


1516 


gi6688199 


Homo sapiens 


mRNA for AMP-activated protein 
kinase gamma2 subunit (AMPK 
gamma2 gene). 


1055 


93 


1516 


gi5931569 


Homo sapiens 


mRNA for H91620p, complete 
cds. 


1055 


93 


1516 


gil2642942 


Homo sapiens 


AMP-activated protein kinase 
gamma subunit (PRKAG2) 
mRNA, complete cds. 


1055 


93 


1517 


gi6807587 


Homo sapiens 


Novel human gene mapping to 
chomosome 1. 


2360 


100 


1517 


gil 769491 


Homo sapiens 


Human kruppel-related zinc finger 
protein (ZNF184) mRNA, partial 
cds. 


1135 


49 


1517 


gil 86774 


Homo sapiens 


Human Kruppel related zinc finger 
protein (HTF10) mRNA, complete 
cds. 


904 


41 


1518 


gi9956065 


Homo sapiens 


clone CDABP0092 mRNA 
sequence. 


602 


100 


1518 


gi4038733 


Homo sapiens 


mRNA for beta 2-microglobulin, 
complete cds. 


602 


100 


1518 


gi5725512 


Homo sapiens 


beta-2 microglobulin gene, 
complete cds. 


602 


100 


1519 


gil87177 


Homo sapiens 


Human lamin-like protein in 
Hindlll repetitive element derived 
DNA, 3' end. 


216 


57 


1519 


gi339771 


Homo sapiens 


Human transposon Ll.l with a 
base deletion relative to L1.2B 
resulting in a premature stop codon 
in the coding region. 


223 


57 


1519 


gi5070622 


Homo sapiens 


retrotransposon LI insertion in X- 
linked retinitis pigmentosa locus, 
complete sequence. 


223 


57 


1520 


AAE03963 


Homo sapiens 


Human gene 17 encoded secreted 
protein fragment, SEQ ID NO: 142. 


110 


47 


1520 


gi4097459 


Elephantulus 
edwardii 


reverse transcriptase 


109 


47 


1520 


A AB 94930 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 16405. 


109 


46 


1521 


gil0934047 


Mus musculus 


Scot-tl 


1035 


77 


1521 


gil0934052 


Mus musculus 


Scot-t2 


1035 


77 
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1521 


gilo4423 


Sus scrofa 


succmyl-CoA: alpha- Ketoacia 
coenzyme A transferase 


1UZ4 


ol 


1522 


gil2003128 


Eremothecium 
gossypii 


Yerl54p 




o 1 
-5 I 


1522 


gi5052482 


Drosophila 
melanogaster 


BcDNA.GH02220 


102 


29 


1522 


gi6227006 


Arabidopsis 
thaliana 


r IoCjio.o 




Z>\) 


1523 


gi3419880 


t r • 

Homo sapiens 


niKJNA lor MJjwAJJAJVU 1, 
complete cds. 




1 nn 


1523 


gi836683 


TT ' „ „ 

Homo sapiens 


Human 

metalloprotease/disintegrin-like 
^mjjo) gene, partial cas. 






1523 


AAR75352 


Homo sapiens 


Human fetal brain MDC protein. 


2984 


99 


1524 


gi 1109782 


TT ~ • „ 

Homo sapiens 


Human protein-tyrosine 
phosphatase mRNA, complete cds. 


ZD 1 o 


y / 


1524 


gi!781037 


Mus musculus 


neuronal tyrosine threonine 
phosphatase 1 


1 QQA 

lyyo 


o / 


1524 


AAB66436 


Homo sapiens 


Human MAP-kinase phosphatase 
n vxiD. 


883 


99 


1525 


gi757911 


Homo sapiens 


H.sapiens mRNA for A2b 
adenosine receptor. 


442 


100 


1525 


gil78150 


Homo sapiens 


Human adenosine azd receptor 
(ADORA2) mRNA, complete cds. 




1 nn 
1 uu 


1525 


AAR41526 


T T * 

Homo sapiens 


Human A2b adenosine receptor. 




1 


1526 


gil3540160 


Homo sapiens 


TCF12-TEC fusion protein 
mRNA, partial cds. 


335 


79 


1526 


gi 183 930 


Homo sapiens 


Human HEB helix-loop-helix 
protein (HEB) mRNA, complete 
cds. 


337 


75 


1526 


g i 184448 


Homo sapiens 


transcription factor (HTF4) 
mRNA, complete cds. 


337 


75 


1527 


AAB43940 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1385. 


741 


95 


1527 


gi 14602778 


Homo sapiens 


spermine synthase, clone 
MGC:2071 IMACj.b:35 06022, 
mKJNA, complete cds. 


690 


95 


1527 


gi2198557 


TT * 

Homo sapiens 


spermidine aminopropyltransferase 
mRNA, complete cds. 






1528 


gil3436l52 


Homo sapiens 


reticulocalbm 2, Hr-nand calcium 
binding domain, clone MGC:1650 
IMAGE:3505241, mRNA, 
complete cas. 


0 /4 


oi 


1528 


gi469885 


Homo sapiens 


Ksapiens ERC-55 mRNA. 


674 


81 


1528 


AAW21949 


TT _ " 

Homo sapiens 


Eo-Dindmg protein Eo-BPSlJ/. 


0 /4 


ol 


1529 


gil044033l 


Homo sapiens 


cDNA: FLJ23591 fis, clone 
LNCjl4/2y. 


606 


100 


1529 


AAB84327 


Homo sapiens 


Amino acid sequence of a human 
lyase ana associatea protein 
HLYAP-2. 


606 


100 


1529 


gil5l50358 


Mus musculus 


UDP-glucuronic acid 
decarboxylase 


605 


99 


1530 


AAB38012 


Homo sapiens 


Human secreted protein encoded 
by gene 3 clone HNHCT15. 


3094 


92 


1530 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 


3094 


92 
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\T/-\. ini 

NO:121. 






1530 


gi5052951 


Homo sapiens 


LINE1 element inserted in B- 
globin gene intron 2. 


3083 


91 


1531 


gi5596433 


Homo sapiens 


candidate tumor suppressor protein 
NOC2 (NOC2) mRNA, complete 
cds. 


330 


87 


1531 


gi7020579 


Homo sapiens 


cDNA FLJ20462 fis, clone 
KAT06107. 




o / 


1531 


gi 13477353 


Homo sapiens 


Similar to rabphilin 3A-Iike 
(without C2 domains), clone 
MGC:3453 IMAGE: 35 293 17, 
mRNA, complete cds. 


j/y 


O / 


1532 


gil531645 


Rattus 
norvegicus 


C2-HC type zinc imger protein r- 
MyT3 


1 Q 1 zL 
Ly 14 


o / 


1532 


gi29l475l 


Rattus 
norvegicus 


neural zinc finger factor 3; NZF-3 


1 QAQ 


oo 


1532 


gil53l653 


Xenopus laevis 


C2-HC type zinc finger protein X- 
MyTl 


300 


42 


1533 


gil2805043 


Homo sapiens 


clone IMAGE: 346 1487, mRNA, 
partial cds. 


364 


41 


1533 


AAG81328 


Homo sapiens 


Human AFP protein sequence SEQ 
IDNO:174. 


356 


A 1 

41 


1533 


gi2226004 


Homo sapiens 


Human Tiggerl transposable 
element, complete consensus 
sequence. 


/l A 1 

44 1 


1 L 


1534 


gil0436783 


Homo sapiens 


cDNA FLJ 14341 fis, clone 

r-pT T "\ 7 "O f~\ l/^A/^iTylO -. i c 1 1 -, lint* 

1HYK01UUU343, wealtly similar 
to ATROPHIN-1. 


3184 


97 


1534 


AAB95860 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18924. 


3184 


97 


1534 


gil2802l59 


Homo sapiens 


SH3-SAM adaptor protein 
(HACol) mKJNA, complete cds. 


515 


46 


1535 


gi!5928572 


Mus museums 


Similar to leucine rich repeat (in 

TIT* TT\ * l m * J • 

FLU) interacting protein 2 


103 


66 


1535 


gil409l82l 


Oryza sativa 


Putative protein with region similaT 
to cyclin-dependent kinase like 
proteins 


98 


38 


1535 


gi70202l4 


Homo sapiens 


cDNA FLJ20248 fis, clone 
OvJLr 0343. 


97 


63 


1536 


AAG02639 


Homo sapiens 


Human secreted protein, SEQ ID 

JNU: 0/2U. 


160 


71 


1536 


AAG02753 


Homo sapiens 


Human secreted protein, SEQ ID 
NO. Ooi4. 


141 


60 


1536 


gi7959778 


Homo sapiens 


PR01546 


140 


60 


1537 


gi7022610 


Homo sapiens 


cDNA FLJ 10521 lis, clone 
NT2RP2000841. 


10/1 

184 




1537 


AAB92909 


Homo sapiens 


Human protein sequence SEQ ID 

\T/\ i 1 fori 

NO:11539. 


184 


39 


1537 


gill527193 


Drosophila 


Sunday driver 


112 


41 


1538 


gil2654055 


Homo sapiens 


clone IMAGE:3455871, mRNA, 
partial cds. 


849 


85 


1538 


gi3 133291 


Homo sapiens 


mitogen activated protein kinase 
activated protein kinase gene, 
complete cds. 


844 


85 


1538 


gi2911813 


Mus musculus 


mitogen-activated protein kinase- 
activated protein kinase 


841 


84 
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S score 


/o laentixy 


1539 


gill761810 


Arabidopsis 
thaliana 


glutathione dependent 
dehydroascorbate reductase 
precursor 


60 


32 


1540 


gi 13 162677 


Homo sapiens 


GLUT4 enhancer factor mRNA, 
complete cds. 


1399 


99 


1540 


gil2655101 


Homo sapiens 


clone IMAGE: 3 140406, mRNA, 
partial cds. 


1399 


99 


1540 


AAB58934 


Homo sapiens 


Breast and ovarian cancer 
associated antigen protein 
sequence SEQ ID 642. 


1,395 


yy 


1541 


gi30058 


Homo sapiens 


Human mRNA for pro-alpha- 1 
type 3 collagen. 


1606 


98 


1541 


gi!6197601 


Homo sapiens 


type III preprocollagen alpha 1 
chain (COL3A1) gene, exon 1 and 
complete cos. 


1606 


98 


1541 


AAW12842 


Homo sapiens 


Truncated pro-alphal(III) chain. 


1606 


98 


1542 


AAE01436 


Homo sapiens 


T T 1 J J J. J 

Human gene I encoded secreted 
protein HWLFJ10, SEQ ID 
NO:91. 


1622 


r\r\ 

99 


1542 


AAE01464 


XX * 

Homo sapiens 


Human gene 1 encoded secreted 
protein HWLFJ10, SEQ ID 
NO: 119. 


lolo 


9o 


1542 


AAE01515 


Homo sapiens 


Human gene 1 encoded secreted 
protein fragment, SbQ ID NO: 172. 


1618 


98 


1543 


gi 186043 


XX * 

Homo sapiens 


immunoglobulin light chain 
variable region (IGL@) mRNA, 
partial cds. 


54 / 


oO 


1543 


gi219886 


Homo sapiens 


Human Ig kappa light chain gene, 
V- and J-region. 


54.3 


oy 


1543 


gi33248 


Homo sapiens 


H. sapiens gene for Ig kappa light 
chain variable region '012'. 


540 


oy 


1544 


gi76736l8 


Mus musculus 


ubiquitin specinc protease 


1 Oil 


/ j 


1544 


gi7328l68 


Homo sapiens 


mRNA; cDNA DKFZp434K1822 
(from clone DKFZp434K1822); 
partial cds. 


1004 


74 


1544 


gi5 823525 


Drosophila 
melanogaster 


ubiquitin-specific protease nonstop 


957 


41 


1545 


gi6693836 


Rattus 
norvegicus 


SNIP-b 


3975 


85 


1545 


gi6693834 


X^ , j 

Rattus 
norvegicus 


SNIP-a 


39 Id 




1545 


gi30984l8 


Mus musculus 


P140 


3699 


75 


1546 


gil79433 


Homo sapiens 


Human biglycan (BGN) gene, 
exon 8. 


2032 


95 


1546 


gil28032l7 


Homo sapiens 


biglycan, clone MGC:2298 

IMAOJi.o lOZoJJ, mKJNA, 

complete cds. 


15 12 


95 


1546 


gi 13279002 


Homo sapiens 


biglycan, clone MGC: 10461 
complete cds. 


1512 


96 


1547 


gil4718648 


Homo sapiens 


allantoicase mRNA, partial cds. 


111 


95 


1547 


gi9255889 


Mus musculus 


allantoicase 


1039 


58 


1547 


gi4929823 


Xenopus laevis 


allantoicase 


681 


41 


1548 


gil2653161 


Homo sapiens 


ribosomal protein L35, clone 
MGC:8582 IMAGE:2960987, 
mRNA, complete cds. 


203 


78 
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1548 


gi 150 12043 


Homo sapiens 


Similar to ribosomal protein L35, 
clone MGC:l3488 
IMAGE:4251487, mRNA, 
complete cds. 


203 


78 


1548 


gi562074 


Homo sapiens 


Human ribosomal protein L35 
mRNA, complete cds. 


203 


78 


1549 


gi 1903236 


Mus musculus 


capping protein beta 3 subunit 


156 


100 


1549 


gi595257 


T T" * 

Homo sapiens 


Human F-actm capping protein 
beta subunit mRNA, complete cds. 


156 


1 AA 
IUU 


1549 


gi5 00749 


Mus musculus 


capping protein beta subunit, 
isoform 2 


156 


1 AA 


1550 


gil5278186 


Homo sapiens 


MAGI- 1 A mRNA, complete cds, 
alternatively spliced. 


526 


Q1 

y i 


1550 


gi3370998 


Homo sapiens 


mRNA for BAI1 -associated 
protein 1, complete cds. 




y l 


1550 


gil5278182 


Homo sapiens 


MAGI- IB alpha beta mRNA, 
complete cds, alternatively spliced. 




01 


1551 


gi 12654299 


Homo sapiens 


clone IMAGE: 3447394, mRNA, 
partial cds. 




QQ 

yy 


1551 


gil 1095188 


Homo sapiens 


dipeptidyl peptidase 8 (DPP 8) 
mRNA, complete cds. 


1357 


67 


1551 


AAB47187 


T -w- • 

Homo sapiens 


Human DPP8. 


lor 7 

13j / 


o/ 


1552 


gil91012 


Cricetus cricetus 


ornithine decarboxylase 


1 C3 


3o 


1552 


g i49440 


Cricetus cricetus 


ornithine decarboxylase (AA 1- 
455) 


1 <i 
ID3 


JO 


1552 


gi9858179 


Danio rerio 


ornithine decarboxylase 


149 


44 


1553 


gil2053087 


Homo sapiens 


mRNA; cDNA DKFZp434B0819 
(from clone DKFZp434B08l9); 
complete cds. 


168 


IUU 


1553 


gil61 18555 


Homo sapiens 


ELMOl mRNA, complete cds. 


168 


100 


1553 


gil61 18551 


Mus musculus 


ELMOl 


168 


100 


1554 


AAY91640 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 34 SEQ ID 
NO:313. 


238 


81 


1554 


AAB93352 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12476. 


238 


81 


1554 


AAY91484 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 34 SEQ ID 
NO:157. 


233 


1 AA 
IUU 


1555 


AAB38012 


Homo sapiens 


Human secreted protein encoded 
by gene 3 clone HNHCT15. 


726 


87 


1555 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 
NO:l2l. 


726 


87 


1555 


gil 196433 


Homo sapiens 


T T ■ y"TTT T~ 1 

Human factor VIII gene LI 
element insertion DNA. 


727 


87 


1556 


gi4530437 


Homo sapiens 


thyroid hormone receptor- 
associated protein complex 
component i KArz4U mKiNA, 
complete cds. 


1 1 Af 
1125 


90 


1556 


gi7 109237 


Drosophila 
melanogaster 


TRAP240 


213 


44 


1556 


gi7230590 


Drosophila 
melanogaster 


Pap/DTRAP240 


213 


44 


1557 


AAB 64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 


2330 


89 
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S score 


% Identity 








NO: 121. 






1557 


AAB38012 


Homo sapiens 


Human secreted protein encoded 
by gene 3 clone HNHCT15. 


2325 


89 


1557 


gi5052951 


Homo sapiens 


LINE1 element inserted in B- 
globin gene intron 2. 


2309 


88 


1558 


gil 1066463 


Rattus 
norvegicus 


RhoGEF glutamate transport 
modulator GTRAP48 


3450 


72 


1558 


gi71 10160 


Homo sapiens 


guanine nucleotide exchange factor 
(LARG) mRNA, complete cds. 


1063 


50 


1558 


AAW64468 


Homo sapiens 


Human secreted protein from clone 
CW420 2. 


1063 


50 


1559 


gil 0440888 


Morone saxatilis 


myosin heavy chain FM3A 


456 


51 


1559 


gil5982970 


Danio rerio 


myosin IIIA 


476 


55 


1559 


gi7958618 


Homo sapiens 


class III myosin (MY03 A) 
mRNA, complete cds, alternatively 
spliced. 


456 


51 


1560 


AAB93846 


Homo sapiens 


Human protein sequence SEQ ID 
NO:13688. 


1112 


66 


1560 


gi3 834629 


Mus musculus 


diaphanous-related formin; pl34 
mDia2 


565 


28 


1560 


gi3 171906 


Homo sapiens 


mRNA for dia-156 protein. 


559 


30 


1561 


AAB59019 


Homo sapiens 


Breast and ovarian cancer 
associated antigen protein 
sequence SEQ ID 727. 


162 


47 


1561 


AAG74843 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO: 5 607. 


88 


52 


1561 


AAB9575 1 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18660. 


85 


30 


1562 


gil3195147 


Mus musculus 


HCH 


1228 


87 


1562 


AAW03515 


Homo sapiens 


Human DOCK180 protein. 


1308 


56 


1562 


gil339910 


Homo sapiens 


Human DOCK180 protein mRNA, 
complete cds. 


1304 


56 


1563 


AAB52017 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 6 SEQ ID NO: 66. 


266 


100 


1563 


gi6449393 


Callinectes 
sapidus 


copper-specific metallothionein 
CuMT-II 


57 


37 


1563 


gi60 10621 


Human 

calici virus strain 
BAY/2. 1/98/DE 
U 


RNA-dependent RNA polymerase 


51 


36 


1564 


gi540073 


Homo sapiens 


Human agouti gene, exon 3 and 
complete cds. 


362 


97 


1564 


gi608648 


Homo sapiens 


agouti signalling protein (ASP) 
gene, complete cds. 


356 


96 


1564 


AAW10102 


Homo sapiens 


Human agouti signalling protein. 


352 


84 


1565 


gi2463646 


Homo sapiens 


Human 3-hydroxy-3- 
methylglutaryl CoA synthase gene, 
exon 9 and complete cds. 


1277 


100 


1565 


gi6 19877 


Homo sapiens 


H. sapiens mRNA for 3-hydroxy-3- 
methylglutaryl coenzyme A 
synthase. 


1277 


100 


1565 


gil 5928472 


Mus musculus 


3-hydroxy-3-methylglutaryl- 
Coenzyme A synthase 2 


1155 


88 


1566 


gi5912057 


Homo sapiens 


mRNA; cDNA DKFZp434B172 
(from clone DKFZp434B172); 
partial cds. 


350 


98 
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1566 


gi7578787 


T T * 

Homo sapiens 


AD021 protein (AD021) mRNA, 
complete cds. 


157 


A 1 

41 


1566 


AAB57036 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO:l6l4. 


69 


33 


1567 


gi2226005 


Homo sapiens 


Human Tiggerl transposable 
element, complete consensus 
sequence. 


269 


53 


1567 


gi7634783 


T "T • 

Homo sapiens 


HDCMB45P mRNA, partial cds. 


210 


43 


1567 


AAB39252 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 12 SEQ ID 
NO: 132. 


126 


61 


1568 


gil2653985 


Homo sapiens 


glycine cleavage system protein H 
(aminomethyl carrier), clone 
MGC:5190 IMAGE:3451361, 
mRNA, complete cds. 


846 


91 


t c y n 

1568 


gi 184348 


Homo sapiens 


f -f T TT j * T^ "V T k 

Human H-protem mRNA, 
complete cds. 


846 


91 


1568 


gi2 19671 


Homo sapiens 


mRNA for hydrogen earner 
protein, a component of an enzyme 
complex, glycine synthase (EC 
2.1.2.10). 


846 


91 


1569 


gi388109 


Enterococcus 
faecalis 


regulatory protein 


64 


47 


1 c s~ r\ 

1569 


AAY25744 


T T * 

Homo sapiens 


Human secreted protein encoded 
from gene 34. 


mm y*- 

56 


50 


1570 


gi532505 


T T * 

Homo sapiens 


TT 1 *1 '1 VI A A * "1 

Human bile acid CoA: Ammo acid 
N-acyltransferase mRNA, 
complete cds. 


2049 


99 


1570 


gil5215152 


Mus musculus 


Similar to bile acid-Coenzyme A 
dehydrogenase: ammo acid n- 
acyltransferase 


1412 


68 


1570 


gi604902 


Rattus 
norvegicus 


Kan-1 


1394 


68 


1571 


AAG89290 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 410. 


198 


97 


1571 


gil5467674 


Homo sapiens 


OSBP-related protein 1 mRNA, 
complete cds. 


204 


100 


1 CT1 

1571 


AAB38248 


TT— ! ^ 

Homo sapiens 


Human secreted protein sequence 
encoded by gene 46 SEQ ID 
JNO:104. 


62 


51 


1572 


gi4263739 


Homo sapiens 


BAG clone GS1-489L14 from 
7pl4-pl2, complete sequence. 


594 


68 


1572 


gi6102812 


Homo sapiens 


GLI3 gene for GLI3 protein. 


594 


68 


1572 


gi 183248 


Homo sapiens 


Human DNA-binding protein 

//~|T TO\ T»*XT A 1 i 1 

(GLI3) mRNA, complete cds. 


594 


68 


1573 


gil019435 


Trypanosoma 

* 

cruzi 


muc in-like protein 


133 


37 


1573 


gil019433 


Trypanosoma 

« 

cruzi 


mucin-like protein; Method: 
conceptual Translation supplied by 
author 


114 


38 


1573 


gi 1280434 


Drosophila 
melanogaster 


hemomucin 


127 


33 


1574 


AAB76873 


Homo sapiens 


Human lung tumour protein related 
protein sequence SEQ ID NO:798. 


141 


37 


1574 


gi60 15472 


Hylobates 
muelleri 


dopamine receptor D4 


133 


34 
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1574 


gil3421134 


Caulobacter 
crescentus 


translation initiation factor IF-2 


155 


34 


1575 


AAB74709 


Homo sapiens 


Human membrane associated 
protein MEMAP-15. 


1294 


100 


1575 


gil4495648 


Homo sapiens 


clone MGC: 15606 
IMAGE:3163718, mRNA, 
complete cds. 


334 


36 


1575 


gi 15721 997 


Homo sapiens 


zonadhesin (ZAN) gene, complete 
cds, alternatively spliced. 


173 


20 


1576 


gil 181628 


Homo sapiens 


H. sapiens APXL mRNA. 


1624 


78 


1576 


gil773381 


Homo sapiens 


chromosome X clone U177G4, 
U152H5, U168D5, 174A6, 
U172D6, and U186B3 from Xp22, 
complete sequence. 


1624 


78 


1576 

* 


AAW58988 


Homo sapiens 


Homo sapiens fetal kidney clone 
BD335 14 encoded protein. 


317 


71 


1577 


gi6979943 


Homo sapiens 


type 1 tumor necrosis factor 
receptor shedding aminopeptidase 
regulator mRNA, complete cds. 


1499 


86 


1577 


gi6381989 


Homo sapiens 


adipocyte-derived leucine 
aminopeptidase mRNA, complete 
cds. 


1492 


86 


1577 


gi6642987 


Homo sapiens 


aminopeptidase PILS (APPILS) 
mRNA, complete cds. 


1492 


86 


1578 


gil4626461 


Rhizobium 
leguminosarum 


HupE 


82 


28 


1578 


gi897 


Canis familiaris 


endothelin-2 


64 


36 


1579 


gi7657864 


Homo sapiens 


BAG clone RP1 1-236P2 from 2, 
complete sequence. 


4025 


98 


1579 


gil 374698 


Homo sapiens 


mRNA for nuclear protein, NP220, 
complete cds. 


4014 


98 


1579 


AAY07032 


Homo sapiens 


Breast cancer associated antigen 
precursor sequence. 


4014 


98 


1580 


gi551065 


Mus musculus 


protease-nexin 1 


995 


83 


1580 


gil4715029 


Mus musculus 


serine (or cysteine) proteinase 
inhibitor, clade E (nexin, 
plasminogen activator inhibitor 
type 1), member 2 


995 


83 


1580 


gi4 12265 


Rattus 
norvegicus 


glia-derived neurite-promoting 
factor (GdNPF) 


994 


82 


1581 


gi499184 


Felis catus 


neuronal protein 


305 


93 


1581 


gil 0433455 


Homo sapiens 


cDNA FLJ12066 fis, clone 
HEMBB 1002266, moderately 
similar to NEURONAL 
PROTEIN. 


286 


65 


1581 


AAB95041 


Homo sapiens 


Human protein sequence SEQ ID 
NO:16804. 


286 


65 


1582 


AAY13385 


Homo sapiens 


Amino acid sequence of protein 
PR0293. 


3183 


99 


1582 


AAB80253 


Homo sapiens 


Human PR0293 protein. 


3183 


99 


1582 


AAB33472 


Homo sapiens 


Human PRO 133 8 protein UNQ693 
SEQ ID NO:279. 


1676 


55 


1583 


gil2803185 


Homo sapiens 


nucleophosmin (nucleolar 
phosphoprotein B23, numatrin), 
clone MGC: 8463 
IMAGE:2821577, mRNA, 
complete cds. 


268 


75 
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1583 


gi 14250 152 


Homo sapiens 


nucleophosmin (nucleolar 
phosphoprotein B23, numatrin), 
clone MGC: 14826 
IMAGE:4276604, mRNA, 
complete cds. 


268 


75 


1583 


gi 152 14852 


Homo sapiens 


nucleophosmin (nucleolar 
phosphoprotein B23, numatrin), 
clone MGC: 13433 
IMAGE :4097025, mRNA, 
complete cds. 


268 


75 


1584 


gi4406691 


Homo sapiens 


clone 24922 mRNA sequence, 
complete cds. 


69 / 


1 flA 


1584 


gi7023544 


Homo sapiens 


cDNA r LJ 1 1 094 lis, clone 
rVLALJi iuioj / j5, weaKiy simuar xo 
TRNA PSEUDOURIDINE 
SYNTHASE B (EC 4.2.1.70). 


oy / 


i nn 
1UU 


1584 


AAB93492 


Jrlomo sapiens 


Human proxem sequence ojo/\^ lu 
NO: 12796. 




1 00 

L\J\J 


•t c o c 

1585 


gil55999 


Bombyx mori 


silk lioroin 


1 zl7 
l^f / 




1585 


gi930003 


Bombyx mori 


silk fibroin (AA 37 - 252) 


147 . 


39 


1585 


gi765323 


Bombyx mori 


silk iibrom neavy cnam 


ICQ 

13o 


Dy 


1586 


gi 12002682 


T T _ J „ 

Homo sapiens 


rxiKM-contaming protein (^Cv_ri J 
mRNA, alternative splice product, 
complete cds. 


IZoZ 


RQ 


1586 


gi/669988 


Homo sapiens 


mKlN A; cUNA URr/ip/OllN loin 
(from clone DKFZp761N1814). 




OO 


1586 


AAB12318 


Homo sapiens 


Human secreted protein encoded 
by gene 18 clone HE2FL70. 




1 HA 
1 


1587 


gi50!96l8 


Homo sapiens 


HrBJU mKJNA, complete cds. 




Qfi 

yu 


1587 


gi4530066 


Homo sapiens 


androgen receptor associated 
protein 54 (AKA54) mRNA, 
complete cds. 


1005 


90 


1587 


AAY78418 


Homo sapiens 


Human androgen receptor 
coactivator ARA54 SEQ ID NO:2. 


1005 


90 


1588 


AAG73501 


Homo sapiens 


Human secreted protein fragment, 
SEQ ID NO:277. 


285 


94 


1588 


gil 196432 


T T * 

Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 




y4 


1588 


gi339771 


Homo sapiens 


Human transposon LI .1 with a 
base deletion relative to L1.2B 
resulting in a premature stop codon 
in the coding region. 




y4 


1589 


AAB94900 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 16288. 


287 


Of 


1589 


gil 196433 


Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 


289 


70 


1589 


gil 916229 


Homo sapiens 


Human line-1 reverse transcriptase 
gene, partial cds, and granulocyte 
chemotactic protem-2 (GCP-2) 

gCllC, CDIIipiCJlC luo. 


278 


70 


1590 


gi6562173 


Homo sapiens 


mRNA; cDNA DKFZp566H033 
(from clone DKF2p566H033); 
partial cds. 


227 


88 


1590 


gil 335 199 


Homo sapiens 


Human Kpnl repetitive sequence 
(T-betaG41) 3kb downstream of 
beta-globin gene. 


227 . 


86 


1590 


AAB38280 


Homo sapiens 


Human secreted protein sequence 


226 


88 
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encoded by gene 20 SEQ ID 
NO: 136. 






1591 


gi 10436007 


Homo sapiens 


cDNA FLJ13859 fis, clone 
THYRO1001033, weakly similar 
to TRANSFORMATION- 
SENSITIVE PROTEIN IEF SSP 
3521. 


1794 


80 


1591 


AAB95593 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18273. 


1794 


80 


1591 


gi7020708 


Homo sapiens 


cDNA FLJ20535 fis, clone 
KAT11013. 


1789 


79 


1592 


gi6599260 


Homo sapiens 


mRNA; cDNA DKFZp434P1721 
(from clone DKFZp434P1721); 
partial cds. 


1184 


76 


1592 


gil0567164 


Homo sapiens 


GASC-1 mRNA, complete cds. 


561 


59 


1592 


AAR66461 


Homo sapiens 


AF-17 protein, N-terminal region 
with similarity to peregrin. 


181 


34 


1593 


gi 182020 


Homo sapiens 


Human elastin gene, exon 1 . 


2405 


89 


1593 


AAB08630 


Homo sapiens 


Amino acid sequence of a human 
elastin polypeptide. 


2392 


88 


1593 


gi 182062 


Homo sapiens 


Human elastin mRNA, complete 
cds. 


2386 


88 


1594 


gi 14424570 


Homo sapiens 


clone MGQ16614 
IMAGE :4 1 1 1 344, mRNA, 
complete cds. 


349 


88 


1594 


gi 1196433 


Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 


349 


89 


1594 


AAB38012 


Homo sapiens 


Human secreted protein encoded 
by gene 3 clone HNHCT15. 


349 


89 


1595 


gi291854 


Homo sapiens 


aminopeptidase A mRNA, 
complete cds. 


934 


100 


1595 


gil518865 


Sus scrofa 


aminopeptidase A 


876 


89 


1595 


gi7673021 


Rattus 
norvegicus 


aminopeptidase A 


847 


85 


1598 


gi7288173 


Homo sapiens 


gene for alphal,6 
fucosyltransf erase, exon 5. 


367 


100 


1598 


gi2055307 


Homo sapiens 


mRNA for N-Acetyl-beta-D- 
glucosaminide, complete cds. 


367 


100 


1598 


gi3451263 


Homo sapiens 


mRNA for glycoprotein 6-alpha-L- 
fucosyltransferase, transcript Bl. 


367 


100 


1599 


AAB93267 


Homo sapiens 


Human protein sequence SEQ ID 
NO:12300. 


149 


46 


1599 


gi295671 


Saccharomyces 
cerevisiae 


selected as a weak suppressor of a 
mutant of the subunit AC40 of 
DNA dependant RNA polymerase 
I and III 


123 


25 


1599 


gill559313 


Halocynthia 
roretzi 


synaptotagmin 


121 


24 


1600 


AAB48139 


Homo sapiens 


Human TANGO 209 variant 1 
polypeptide. 


219 


79 


1600 


AAB48140 


Homo sapiens 


Human TANGO 209 variant 2 
polypeptide. 


219 


79 


1600 


AAB48141 


Homo sapiens 


Human TANGO 209 variant 3 
polypeptide. 


224 


51 


1602 


gil2053353 


Homo sapiens 


mRNA; cDNA DKFZp586D0222 
(from clone DKFZp586D0222); 
complete cds. 


548 


82 
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1602 


gi7023456 


Homo sapiens 


cDNA FLJ11040 fis, clone 
PLACE1004388. 


548 


82 


1602 


gi 10434241 


Homo sapiens 


cDNA FLJ 12633 fis, clone 
NT2RM4001856. 


548 


82 


1603 


gil5082532 


Homo sapiens 


clone MGC: 20434 
IMAGE:4650497, mRNA, 
complete cds. 


212 


85 


1603 


gil3096814 


Mus musculus 


RIKEN cDNA 4932442K08 gene 


165 


72 


1603 


gi9622395 


Plasmodium 
falciparum 


variant surface protein 


66 


44 


1604 


gil321596 


Homo sapiens 


Human (clone HSLV2132) Ig 
lambda light chain V region, 
subgroup II, partial sequence. 


374 


100 


1604 


AAG76014 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:6778. 


362 


97 


1604 


AAB36212 


Homo sapiens 


Human immune system associated 
protein HISAP-10. 


342 


91 


1605 


gi 10434674 


Homo sapiens 


cDNA FLJ1291 1 fis, clone 
NT2RP2004425, highly similar to 
Mus musculus axotrophin mRNA. 


2340 


94 


1605 


AAB95234 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17375. 


2340 


94 


1605 


AAB27239 


Homo sapiens 


Human EXMAD-17 SEQ ID NO: 
17. 


2340 


94 


1606 


gi8918522 


Homo sapiens 


PCCX2 mRNA for protein 
containing CXXC domain 2, 
partial cds. 


2200 


84 


1606 


gil4042116 


Homo sapiens 


cDNA FLJ14534 fis, clone 
NT2RM2000599, weakly similar 
to Homo sapiens F-box protein 
Lilina (LILINA) mRNA. 


2446 


78 


1606 


AAB92702 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11102. 


2446 


78 


1607 


gi3 176762 


Homo sapiens 


receptor for viral semaphorin 
protein (VESPR) mRNA, complete 
cds. 


807 


100 


1607 


AAY13462 


Homo sapiens 


Viral-encoded semaphorin protein 
receptor (VESPR) polypeptide. 


807 


100 


1607 


AAB28522 


Homo sapiens 


Human VESPR. 


807 


100 


1608 


gil373425 


Homo sapiens 


Human bumetanide-sensitive Na~ 
K-2C1 cotransporter (NKCC2) 
mRNA, complete cds. 


1295 


99 


1608 


AAW29683 


Homo sapiens 


Human Na-K-2C1 cotransporter 
NKCC2. 


1295 


99 


1608 


gi5 16000 


Oryctolagus 
cuniculus 


bumetanide-sensitive Na-K-Cl 
cotransport protein splice isoform 
B 


1244 


95 


1609 


AAB95655 


Homo sapiens 


Human protein sequence SEQ ID 
NO:18417. 


229 


61 


1609 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 
NO:121. 


221 


58 


1609 


gi 1196433 


Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 


226 


60 


1610 


AAB88388 


Homo sapiens 


Human membrane or secretory 
protein clone PSEC0131. 


923 


100 


1610 


AAB25719 


Homo sapiens 


Human secreted protein sequence 


114 


100 
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encoded by gene 6 SEQ ID 
NO: 108. 






1610 


AAB25718 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 6 SEQ ID 
NO: 107, 


73 


100 


1611 


AAY04734 


Homo sapiens 


Protein containing PDZ domain 
from clone 38-2-lc. 


269 


98 


1611 


AAY04732 


Homo sapiens 


Protein containing PDZ domain 
from clone 3 8 -2- la. 


269 


98 


1611 


AAY53753 


Homo sapiens 


Amino acid sequence of the 
MMSC2 protein. 


269 


98 


1612 


gi 17942 11 


Homo sapiens 


Human oncostatin-M specific 
receptor beta subunit (OSMRB) 
mRNA, complete cds. 


1127 


92 


1612 


AAR85912 


Homo sapiens 


Oncostatin M receptor-beta 
subunit. 


1127 


92 


1612 


gil5012082 


Homo sapiens 


Similar to oncostatin M receptor, 
clone MGC: 13583 
IMAGE:4043935, mRNA, 
complete cds. 


1127 


92 


1613 


gil2803103 


Homo sapiens 


heterogeneous nuclear 
ribonucleoprotein Al, clone 
MGC:8473 IMAGE: 282 1751, 
mRNA, complete cds. 


406 


82 


1613 


gil50824S6 


Homo sapiens 


Similar to heterogeneous nuclear 
ribonucleoprotein Al, clone 
MGC:20389 IMAGE :45 6465 5, 
mRNA, complete cds. 


406 


82 


1613 


gi496898 


Homo sapiens 


H.sapiens mRNA for hnRNPcore 
protein Al. 


406 


82 


1614 


gi 12653633 


Homo sapiens 


lysyl oxidase-like 2, clone 
MGC: 1709 IMAGE:3347512, 
mRNA, complete cds. 


898 


99 


1614 


gi 1890108 


Homo sapiens 


Human lysyl oxidase-related 
protein (WS9-14) mRNA, 
complete cds. 


898 


99 


1614 


AAB00077 


Homo sapiens 


Human lysyl oxidase related 
protein (Lor). 


898 


99 


1615 


gil2803157 


Homo sapiens 


COX15 (yeast) homolog, 
cytochrome c oxidase assembly 
protein, clone MGC: 8634 
IMAGE:2961532, mRNA, 
complete cds. 


159 


52 


1615 


gil5426569 


Homo sapiens 


clone MGC4234 
IMAGE:2961532, mRNA, 
complete cds. 


159 


52 


1615 


gi3603230 


Homo sapiens 


cytochrome oxidase assembly 
factor (COX15) mRNA, nuclear 
gene encoding mitochondrial 
protein, complete cds. 


159 


52 


1616 


gil215746 


Bos taurus 


vacuolar system associated 
protein-60 


1309 


67 


1616 


gi7672979 


Homo sapiens 


glucosidase II beta subunit mRNA, 
complete cds. 


1656 


71 


1616 


gi!293640 


Homo sapiens 


Human protein kinase C substrate 
80K-H (PRKCSH) gene, exon 15- 
17. 


1652 


70 
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1617 


gil418820 


Homo sapiens 


H. sapiens mRNA for gamma 1 
isoform of 61kDa regulatory 
subunit ofPP2A. 


355 


100 


1617 


gil203812 


Homo sapiens 


Human protein phosphatase 2A 
B 'alpha 1 regulatory subunit 
mRNA, complete cds. 


355 


100 


1617 


gil000892 


Homo sapiens 


protein phosphatase 2 A B56- 
gammal (PP2A) mRNA, V end of 
cds. 


355 


100 


1618 


gil4010930 


Homo sapiens 


BAG clone RP1 1-576F1 from 2, 
complete sequence. 


1868 


100 


1618 


gi7022375 


Homo sapiens 


cDNA FLJ 10379 fis, clone 
NT2RM2002014. 


1863 


99 


1618 


AAB92758 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11220. 


1863 


99 


1619 


gi296665 


Homo sapiens 


Human bone marrow serine 
protease gene (medullasin) 
(leukocyte neutrophil elastase 
gene). 


629 


93 


1619 


gi34533 


Homo sapiens 


Human mRNA for medullasin 
(leukocyte (neutrophil) elastase. 


629 


93 


1619 


gi307123 


Homo sapiens 


Human elastase/medullasin 
mRNA, complete cds. 


629 


93 


1620 


gi9965989 


Homo sapiens 


calcineurin A catalytic subunit 
gamma isoform mRNA, complete 
cds. 


1650 


97 


1620 


gi258001 


human, testis, 
mRNA, 2134 nt]. 
[Homo sapiens 


calcineurin A catalytic subunit 


1639 


96 


1620 


gi 13436077 


Homo sapiens 


clone MGC: 10576 
IMAGE:3677098, mRNA, 
complete cds. 


1623 


94 


1621 


gi 1429374 


Escherichia coli 


DcuC protein 


739 


100 


1621 


gil3360118 


Escherichia coli 
0157:H7 


c4-dicarboxylate anaerobic carrier 
DcuC 


739 


100 


1621 


gil786839 


Escherichia coli 
K12 


transport of dicarboxylates 


739 


100 


1622 


A AB 82485 


Homo sapiens 


Human secretin-like receptor 
Zgprl. 


271 


94 


1622 


AAB66272 


Homo sapiens 


Human TANGO 378 SEQ ID NO: 
29. 


271 


94 


1622 


AAB82487 


Homo sapiens 


Human secretin-like receptor 
Zgprl splice variant. 


271 


94 


1623 


gi4062658 


Escherichia coli 


Flagellar basal body P-ring protein 
precursor 


752 


87 


1623 


gil787320 


Escherichia coli 
K12 


homolog of Salmonella P-ring of 
flagella basal body 


752 


87 


1623 


gil3360922 


Escherichia coli 
0157:H7 


flagellar basal body P-ring protein 
Flgl 


751 


86 


1624 


gil773192 


Escherichia coli 


similar to S. cerevisiae dall 


334 


98 


1624 


gi2735238 


Escherichia coli 


GlxB3 


334 


98 


1624 


gil3360031 


Escherichia coli 
0157:H7 


allantoinase 


333 


96 


1625 


gi40992 


Escherichia coli 


dehydrogenase 


1481 


93 


1625 


gil3363752 


Escherichia coli 
0157:H7 


aspartate-semialdehyde 
dehydrogenase 


1481 


93 
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1625 


gi3859587 


Shigella sonnei 


aspartate semialdehyde 
dehydrogenase 


1481 


93 


1626 


gil46346 


Escherichia coli 


phenylalanyl-tRNA synthetase 
beta-subunit 


735 


94 


1626 


gil742793 


Escherichia coli 


Phenylalanine— tRN A ligase (EC 
6.1.1.20) b chain 


735 


94 


1626 


gil3361887 


Escherichia coli 
0157:H7 


phenylalanine tRNA synthetase 
beta-subunit 


735 


94 


1627 


AAB49502 


Homo sapiens 


Clone HYASC03. 


310 


98 


1627 


gi7020468 


Homo sapiens 


cDNA FLJ20396 fis, clone 
KAT00561. 


144 


48 


1627 


AAB 18980 


Homo sapiens 


Amino acid sequence of a human 
transmembrane protein. 


144 


48 


1628 


gil4021587 


Mesorhizobium 
loti 


transcriptional regulator 


333 


38 


1628 


gil 4523075 


Sinorhizobium 
meliloti 


probable LysR-family protein 


329 


39 


1628 


gi9949248 


Pseudomonas 
aeruginosa 


probable transcriptional regulator 


327 


37 


1629 


gi3 11422 


Escherichia coli 


ORF-2 


282 


100 


1629 


gil5081358 


uncultured 
bacterium 


PufM 


53 


41 


1629 


gil 3362968 


Escherichia coli 
0157:H7 


holin protein 


52 


33 


1630 


gil59333 


Leishmania 
major 


glycoprotein 96-92 


101 


28 


1630 


AAY91958 


Homo sapiens 


Human cytoskeleton associated 
protein 13 (CYSKP-13). 


100 


28 


1630 


gi8 163686 


Streptococcus 
pneumoniae 


surface protein PspC 


105 


29 


1632 


gi8 87820 


Escherichia coli 


UUG start; possible frameshift at 
end? 


820 


71 


1632 


gi466651 


Escherichia coli 


No definition line found 


343 


82 


1632 


gil 742360 


Escherichia coli 


Phosphinothricin acetyltransferase 
(EC 2.3.1.-). 


248 


83 


1633 


gi7022678 


Homo sapiens 


cDNA FLJ 10565 fis, clone 
NT2RP2002954. 


290 


100 


1633 


AAB92950 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11 629. 


290 


100 


1633 


gil755198 


Cavia porcellus 


nitric oxide synthase 


71 


33 


1634 


AAY48563 


Homo sapiens 


Human breast tumour-associated 
protein 24. 


342 


100 


1634 


gil2804499 


Homo sapiens 


ribonuclease 6 precursor, clone 
MGC:1360 IMAGE:2959583, 
mRNA, complete cds. 


440 


66 


1634 


gil2804759 


Homo sapiens 


ribonuclease 6 precursor, clone 
MGG3554 IMAGE:2959583, 
mRNA, complete cds. 


440 


66 


1635 


gil0438872 


Homo sapiens 


cDNA: FLJ22471 fis, clone 
HRC 10529. 


1620 


100 


1635 


AAY86509 


Homo sapiens 


Human gene 70-encoded protein 
fragment, SEQ ID NO:424. 


696 


100 


1635 


AAY86510 


Homo sapiens 


Human gene 70-encoded protein 
fragment, SEQ ID NO:425. 


436 


100 


1636 


gi8096340 


Homo sapiens 


mRNA for RERE, complete cds. 


365 


97 


1636 


gi7413896 


group A 


NSP3 protein 


71 


29 
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rotavirus 








1636 


gi9368393 


Human rotavirus 


NSP3 protein 


65 


27 


1637 


AAB58426 


Homo sapiens 


Lung cancer associated 
polypeptide sequence SEQ ID 764. 


514 


94 


1637 


AAB00191 


Homo sapiens 


Breast cancer protein BCN5. 


514 


94 


1637 


AAB85481 


Homo sapiens 


Human 23553 sulfatase 
polypeptide. 


514 


94 


1638 


AAY42750 


Homo sapiens 


Human calcium binding protein 1 
(CaBP-1). 


741 


100 


1638 


gil2060826 


Homo sapiens 


serologically defined breast cancer 
antigen NY-BR-20 mRNA, partial 
cds. 


673 


89 


1638 


AAY07006 


Homo sapiens 


Breast cancer associated antigen 
precursor sequence. 


637 


88 


1639 


gi 10434205 


Homo sapiens 


cDNA FLJ12612 fis, clone 
NT2RM4001582, highly similar to 
Mus musculus COP9 complex 
subunit 7b (COPS7b) mRNA. 


908 


86 


1639 


AAB94175 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 14484. 


908 


86 


1639 


gi 152 15085 


Mus musculus 


Similar to COP9 (constitutive 
photomorphogenic), subunit 7b 
(Arabidopsis) 


900 


85 


1640 


gil334836 


Human 
herpesvirus 4 


BCRF2 3072 repeat, reading frame 
1 


98 


29 


1640 


gil334837 


Human 
herpesvirus 4 


BWRF1 reading frame 2 


98 


29 


1640 


gil334838 


Human 
herpesvirus 4 


BWRF1 reading frame 3 


98 


29 


1641 


gil3161011 


Homo sapiens 


sclerostin gene, complete cds. 


758 


100 


1641 


gil3161020 


Homo sapiens 


sclerostin mRNA, complete cds. 


758 


100 


1641 


gi 132364 18 


Homo sapiens 


SOST (SOST) mRNA, complete 
cds. 


758 


100 


1642 


gil3436023 


Mus musculus 


RIKEN cDNA B2301 14J08 gene 


279 


94 


1642 


AAY36125 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 510. 


279 


94 


1642 


AAY99458 


Homo sapiens 


Human PRO 193 amino acid 
sequence SEQ ID NO:410. 


279 


94 


1643 


gil5680215 


Homo sapiens 


Similar to ribosomal protein L23a, 
clone MGC:23063 
IMAGE:4872948, mRNA, 
complete cds. 


428 


76 


1643 


gi 15990426 


Homo sapiens 


Similar to cadherin 1, type 1, E- 
cadherin (epithelial), clone 
MGC:23017 IMAGE: 3 95 9 042, 
mRNA, complete cds. 


428 


76 


1643 


gi404015 


Homo sapiens 


Human ribosomal protein L23a 
mRNA, partial cds. 


428 


76 


1644 


gil2653271 


Homo sapiens 


thyroid hormone receptor 
interactor 13, clone MGQ8565 
IMAGE:2822981, mRNA, 
complete cds. 


514 


82 


1644 


gi2232019 


Homo sapiens 


HP VI 6 El protein binding protein 
mRNA, complete cds. 


514 


82 


1644 


gi703102 


Homo sapiens 


thyroid receptor interactor 
(TRIP 13) mRNA, partial cds. 


513 


82 
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1646 


gil 3544066 


Homo sapiens 


Similar to mini chromosome 
maintenance deficient 2 (S. 
cerevisiae), clone MGC: 13220 
IMAGE: 3 959276, mKJNA, 
complete cds. 


273 


9o 


1646 


gil4043350 


TT ' 

Homo sapiens 


clone MLrCrzi 23 
IMAGE.3143264, mRNA, 
complete cos. 






1646 


gil40440l6 


Homo sapiens 


clone MGC: 14281 

TAT A. /^TJ./l 1 "2 1 *-k-.OMA 

lMA\jrii:41 J iy43, mKJNA, 

complete cds. 


273 


98 


1648 


gi7960207 


Oncorhynchus 
mykiss 


vitelline envelope protem aipna 


ZjZ 




1648 


gil60l98 


Plasmodium 
knowlesi 


circumsporozoite protein 


1 Q1 

iy 1 




1648 


gii5oo4/y>> 


Mycoplasma 
bovis 


vanaoie sunace npoproiem 
Vsp422-8 


1 / / 


^4 


1650 


gi474280 


Mus mus cuius 


mannosyi-oiigosaccnariQe aipna- 
i ,z-mannosiaase 


Ql 1 


60 


1650 


gil 5929672 


Mus musculus 


Similar to mannosidase 1, alpha 


911 


88 


1650 


gi2154997 


Sus scrofa 


Man9-mannosidase 


869 


83 


1651 


gi59 15662 


TT 1 

Homo sapiens 


integrin alpha 1 1 subunit precursor 
(ITGA11) mRNA, complete cds. 


j>4Z 


QA 


1651 


AAB30929 


Homo sapiens 


Amino acid sequence of a human 
alpha 11 integrin chain. 


542 


84 


1651 


AAB50085 


Homo sapiens 


Human A259. 


542 


84 


1652 


gi45 12295 


Homo sapiens 


DNA for immunoglobulin heavy- 
chain variable region, complete 
sequence, 3 of 5. 


619 


100 


1652 


gi296657 


Homo sapiens 


Human Ig heavy chain gene 
variable region V(lZLr-l ) (v(n;-iv 
family). 


oli 


yy 


1652 


gil85579 


Homo sapiens 


H.sapiens lmmunoglouulin 
germline heavy chain gene, V 
region. 


OU 1 


07 


1654 


AAE04841 


Homo sapiens 


Human SGP039 phosphatase 
polypeptiae. 


667 


92 


1654 


gi2582800 


Medicago sativa 


protein phosphatase 2C 


107 


41 


1654 


gi8778653 


Arabidopsis 
thaliana 


ryClo.o 


1 fW 

1U / 




1655 


gi9928511 


Mycobacterium 
tuberculosis 


SEQ ID NO 18B* 


89 


27 


1655 


gil3897999 


Galleria 
mellonella 


silk protease inhibitor 2 precursor 


50 


40 


1655 


gi204419 


Rattus 
norvegicus 


giycam 1 


nr 


Z4 


1656 


gil2653509 


Homo sapiens 


DKFZP564K1964 protein, clone 
MGC:8605 IMAGE:2961267, 
mRNA, complete cds. 


221 


AC 

95 


1656 


gov lziyy 


riomo sapiens 


mKJNA; CJJJNA jJivJ H Z.pjo4lvl yO'H- 

(from clone DKFZp564K1964); 
complete cds. 


ZZ / 




1656 


gi4530587 


Homo sapiens 


TADA1 protein mRNA, complete 
cds. 


227 


95 


1658 


gil 2654931 


Homo sapiens 


protein disulfide isomerase-related 
protein, clone MGC:55 17 
IMAGE:3454007, mRNA, 


2281 


100 
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complete cds. 






1658 


gil 136743 


Homo sapiens 


Human mRNA for protein 
disulfide isomerase-related protein 
P5, complete cds. 


2281 


100 


1658 


AAW25154 


Homo sapiens 


Human disulphide epimerase like 
enzyme, EP52. 


2281 


100 


1659 


gi6694278 


Homo sapiens 


cell recognition molecule Caspr2 
(CASPR2) mRNA, complete cds. 


356 


95 


1659 


gil3624214 


Homo sapiens 


contactin-associated protein 2 
(CNTNAP2) mRNA, complete 
cds. 


356 


95 


1659 


gil 857708 


Homo sapiens 


contactin associated protein 
(Caspr) mRNA, complete cds. 


140 


37 


1660 


AAG02922 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7003. 


136 


96 


1660 


gi34373 


Homo sapiens 


Human DNA for LINE-1 
transposable element ORFI and II. 


135 


96 


1660 


gi7959889 


Homo sapiens 


PR02221 


127 


88 


1661 


gi4588087 


Homo sapiens 


PTH-responsive osteosarcoma B 1 
protein (Bl) mRNA, complete cds. 


569 


87 


1661 


AAG74841 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:5605. 


569 


87 


1661 


gi4588085 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein (Dl) mRNA, partial cds. 


312 


84 


1662 


gi4754907 


Homo sapiens 


hi stone deacetylase 4 mRNA, 
complete cds. 


2090 


94 


1662 


AAB49957 


Homo sapiens 


Human histone deacetylase 
HDAC-4. 


2090 


94 


1662 


gil4495171 


Gallus gallus 


histone deacetylase-4 


1790 


78 


1663 


AAB74704 


Homo sapiens 


Human membrane associated 
protein MEMAP-10. 


183 


100 


1663 


gi2621542 


Methanothermob 
acter 

thermautotrophic 
us 


conserved protein 


83 


70 


1663 


gil 0440349 


Homo sapiens 


mRNA for FLJ00009 protein, 
partial cds. 


82 


35 


1664 


gil5159543 


Agrobacterium 
tumefaciens 


AGR_LJ2143p 


99 


27 


1664 


gi5360174 


Gallus gallus 


NOTCH- 1 


98 


31 


1664 


gi4960212 


Bos taurus 


cone-rod home ob ox 


85 


27 


1665 


AAB32388 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 1 8 SEQ ID 
NO:74. 


359 


100 


1665 


AAY91419 

> 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 6 SEQ ID 
NO: 140. 


83 


36 


1665 


gi860970 


Homo sapiens 


H. sapiens mRNA for HPS protein. 


84 


35 


1666 


gil5012095 


Homo sapiens 


Similar to protease inhibitor 3, 
skin-derived (SKALP), clone 
MGC: 13613 IMAGE:4083155, 
mRNA, complete cds. 


621 


100 


1666 


gi28712 


Homo sapiens 


H. sapiens encoding skin-derived 
antileukoproteinase . 


621 


100 


1666 


gi2 19615 


Homo sapiens 


Human gene for elafm, complete 
cds. 


621 


100 
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1667 


gi6650233 


Homo sapiens 


zinc finger protein 74 (ZNF74) 
gene, exon 3, alternative splice 
products and complete cds. 


259 


58 


1667 


gil5081398 


Homo sapiens 


kruppel-like zinc finger protem 
(ZNF30O) mRNA, complete cds. 


246 


73 


1667 


gi 1769491 


Homo sapiens 


Human kruppel-related zinc finger 
protein (ZNF184) mRNA, partial 
cds. 


246 


64 


1668 


AAB38280 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 20 SEQ ID 
NO:l36. 


142 


62 


1668 


gill 96431 


Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 


131 


58 


1668 


gi6562173 


Homo sapiens 


mRNA; cDNA DKFZp566H033 
(from clone DKFZp566H033); 
partial cds. 


147 


60 


1669 


AAG03136 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7217. 


179 


75 


1669 


AAG02563 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6644. 


77 


60 


1669 


AAG73365 


Homo sapiens 


Human gene 20-encoded secreted 
protein HFCAA91, SEQ ID 
NO: 136. 


76 


63 


1670 


gil2804499 


Homo sapiens 


ribonuclease 6 precursor, clone 
MGC:1360 IMAGE: 29595 83, 
mRNA, complete cds. 


376 


83 


1670 


gil2804759 


Homo sapiens 


ribonuclease 6 precursor, clone 
MGC:3554 IMAGE:2959583, 
mRNA, complete cds. 


376 


83 


1670 


gi5091495 


Homo sapiens 


ribonuclease 6 precursor, mRNA, 
complete cds. 


376 


83 


1672 


AAG63163 


Homo sapiens 


Amino acid sequence of 
carcmoembryonic antigen-like 
polypeptide. 


2187 


99 


1672 


AAR54714 


Homo sapiens 


Carcmoembryonic antigen CEA- 
(c). 


A) S~ 

463 


34 


1672 


AAB43688 


Homo sapiens 


Human cancer associated protem 
sequence SEQ ID NO: 1133. 


At A*" ^ 

466 


31 


1673 


gi 13097624 


Homo sapiens 


clone 1MAGE:36080S4, mRNA, 
partial cds. 


760 


89 


1673 


gil0438279 


Homo sapiens 


cDNA: FLJ22029 fis, clone 
HEP08661. 


411 


51 


1673 


AAB93267 


Homo sapiens 


Human protem sequence SEQ ID 
NO: 12300. 


379 


47 


1674 


gi339776 


Homo sapiens 


Human transposon LI .2. 


686 


85 


1674 


gi5070621 


Homo sapiens 


retrotransposon LI insertion in X- 
linked retinitis pigmentosa locus, 
complete sequence. 


686 


85 


1674 


gi5052950 


Homo sapiens 


LINE1 element inserted in B- 
globin gene intron 2. 


684 


85 


1675 


AAB56206 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 130 SEQ ID 
NO:300. 


44 


43 


1675 


AAB56350 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 130 SEQ ID 
NO:444. 


52 


39 


1675 


AAW33908 


Homo sapiens 


Peptide analogue of human insulin- 


57 31 



I 
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like growth factor- 1 (IGF-1). 






1676 


gi34234 


Homo sapiens 


H. sapiens mRNA for laminin- 
binding protein. 


290 


67 


1676 


gi 13529269 


Homo sapiens 


laminin receptor 1 (67kD, 
ribosomal protein SA), clone 
MGC: 12521 IMAGE :3 9970 19, 
mRNA, complete cds. 


289 


67 


1676 


gi 14250794 


Homo sapiens 


laminin receptor 1 (67kD, 
ribosomal protein SA), clone 
MGC:16750 IMAGE:4 130936, 
mRNA, complete cds. 


289 


67 


1677 


gi475782 


Homo sapiens 


Human GS2 gene, exon 7 and 
complete cds. 


1249 


97 


1677 


gi458226 


Homo sapiens 


Human GS2 mRNA, complete cds. 


1249 


97 


1677 


AAG00737 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4818. 


531 


94 


1678 


gi3 089427 


Homo sapiens 


SSC6 rearranged T cell receptor 
beta chain (TCRBV17) gene, 
complete cds. 


444 


100 


1678 


gi3089433 


Homo sapiens 


SSC9 rearranged T cell receptor 
beta chain (TCRBV17) gene, 
complete cds. 


444 


100 


1678 


gi3 002927 


Homo sapiens 


T cell receptor beta chain 
(TCRBV1 7S 1-TCRBJ1S5) 
mRNA, complete cds. 


444 


100 


1679 


gil5929119 


Homo sapiens 


clone MGC: 8834 
IMAGE: 3920437, mRNA, 
complete cds. 


455 


100 


1679 


gi70221 59 


Homo sapiens 


cDNA FLJ10242 fis, clone 
HEMBB1 000630. 


455 


100 


1679 


AAB92624 


Homo sapiens 


Human protein sequence SEQ ID 
NO:10919. 


455 


100 


1680 


gil0186503 


Homo sapiens 


sialic acid-specific acetylesterase II 
mRNA, complete cds, alternatively 
spliced. 


817 


100 


1680 


gi6808138 


Homo sapiens 


mRNA; cDNA DKFZp761A051 
(from clone DKFZp761A051); 
partial cds. 


817 


100 


1680 


gi 10242345 


Homo sapiens 


sialic acid-specific 9-Q- 
acetylesterase I mRNA, complete 
cds. 


753 


100 


1681 


gi4768463 


Homo sapiens 


clone 118 T-cell receptor beta 
chain (TCRBV10S1P) mRNA, 
partial cds. 


76 


44 


1681 


gi4768574 


Homo sapiens 


clone 179 T-cell receptor beta 
chain (TCRBV10S1P) mRNA, 
partial cds. 


71 


42 


1681 


gi4768459 


Homo sapiens 


clone 115 T-cell receptor beta 
chain (TCRBV10S1P) mRNA, 
partial cds. 


69 


44 


1682 


gi4097459 


Elephantulus 
edwardii 


reverse transcriptase 


68 


48 


1683 


gi 12203 15 


Homo sapiens 


early placenta insulin-like peptide 
EPIL (INSL4) mRNA, complete 
cds. 


349 


100 


1683 


AAR89134 


Homo sapiens 


Human early placental insulin-like 
protein. 


349 


100 
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1683 


AAW17676 


Homo sapiens 


Human relaxin-r elated factor~2 
(RRF-2). 


349 


100 


1684 


gi23365 


Homo sapiens 


Human mRNA for 17-beta- 
hydroxysteroid dehydrogenase (17- 
HSD) (EC 1.1.1.62). 


800 


83 


1684 


gi975895 


Homo sapiens 


Human 17-beta-hydroxysteroid 
dehydrogenase (EDH17B2) gene, 
complete cds. 


800 


83 


1684 


gil77127 


Homo sapiens 


Human 17-beta-hydroxysteroid 
dehydrogenase (EDH17B1 and 
EDH17B2) genes, complete coding 
regions and flanks. 


800 


83 


1685 


AAB18919 


Homo sapiens 


A novel polypeptide designated 
PRQ4356. 


1336 


99 


1685 


AAB31206 


Homo sapiens 


Amino acid sequence of human 
polypeptide PR04356. 


1336 


99 


1685 


gi5834584 


Homo sapiens 


mRNA encoding rat C4.4-like 
protein. 


231 


31 


1686 


gil655963 


Homo sapiens 


Human transforming growth 
factor-beta type II receptor gene, 
exon 7 and complete cds. 


1634 


99 


1686 


gi339570 


Homo sapiens 


Human TGF-beta type II receptor 
mRNA, complete cds. 


1634 


99 


1686 


AAR36601 


Homo sapiens 


TGF-beta 1 receptor type II (clone 
3FF). 


1634 


99 


1687 


gi508260 


Homo sapiens 


Human type 1 vasoactive intestinal 
peptide receptor (VI RG) gene, 
exon 13 and complete cds. 


390 


76 


1687 


gi407462 


Homo sapiens 


H.sapiens HIVR mRNA for 
vasoactive intestinal peptide (VIP) 
receptor. 


390 


76 


1687 


gi292904 


Homo sapiens 


Human vasoactive mtestial peptide 
receptor mRNA, complete cds. 


390 


76 


1688 


gil81268 


Homo sapiens 


Human c-yes-1 mRNA. 


520 


90 


1688 


AAY24421 


Homo sapiens 


Human yesl protein. 


520 


90 


1688 


AAB84663 


Homo sapiens 


Amino acid sequence of human 
tyrosine kinase protein Yes. 


520 


90 


1689 


AAG02314 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6395. 


139 


55 


1689 


gil3325174 


Homo sapiens 


clone MGC: 10997 
IMAGE:3638158, mRNA, 
complete cds. 


104 


70 


1689 


gi9652123 


Mus musculus 


disrupter of silencing SAS10 


86 


40 


1690 


gi 135 16467 


Homo sapiens 


HDNBl(nomzygously deleted m 
neuroblastoma- 1)/UFD2 mRNA, 
complete cds. 


1217 


89 


1690 


gi4 104976 


Homo sapiens 


ubiquitin-fusion degradation 
protein 2 (UFD2) mRNA, 
complete cds. 


1217 


89 


1690 


gi 14582754 


XX ■ 

Homo sapiens 


ubiqmtm-fusion degradation 
protein 2 mRNA, complete cds. 


1217 


89 


1691 


gi2463632 


Homo sapiens 


monocarboxylate transporter 
homologue MCT6 mRNA, 
complete cds. 


326 


96 


1691 


AAG73420 


Homo sapiens 


Human gene 21 -encoded secreted 
protein HBWBR94, SEQ ID 
NO: 192. 


63 


35 
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1691 


gi9246437 


Staphylococcus 
aureus 


fmtA-like protein 


83 


26 


1692 


AAB58175 


Homo sapiens 


Lung cancer associated 
polypeptide sequence SEQ ID 513. 


394 


98 


1692 


gil4211500 


Homo sapiens 


secretory protein SEC8 mRNA, 
complete cds. 


394 


98 


1692 


gil4042555 


Homo sapiens 


cDNA FLJ14782 fis, clone 
NT2RP4000524, highly similar to 
Mus musculus Sec8 mRNA. 


394 


98 


1693 


gi3786312 


T T * 

Homo sapiens 


mRNA for extracellular matrix 
protein, complete cds. 


498 


39 


1693 


gi 13937865 


T X * 

Homo sapiens 


lumican, clone MGC: 12410 
IMAGE:3950745, mRNA, 
complete cds. 


312 


29 


1693 


gi699577 


Homo sapiens 


Human lumican mRNA, complete 
cds. 


312 


29 


1694 


gi3002588 


Mus musculus 


Plenty ofSH3s; POSH 


219 


93 


1694 


gi7230620 


Rattus 
norvegicus 


SH3 domam-containing adapter 
protein isoform SETA- 1x23 


130 


34 


1694 


AAW34246 


T T * 

Homo sapiens 


SH3 domain of human clone 53 
protein. 


114 


50 


1695 


gil0645308 


T " 1 * 

Leishmama 
major 


L8453.1 


151 


27 


1 /"7 C\ C 

1695 


gil5419013 


Toxoplasma 
gondii 


subtilisin-like protein 


147 


30 


1695 


gu2018147 


Chlamydomonas 
reinhardtii 


vegetative cell wall protein gpl 


143 


30 


1696 


AAB43791 


X X * 

Homo sapiens 


XT • i "1 j * 

Human cancer associated protein 
sequence SEQ ID NO: 1236. 


353 


98 [ 


1697 


gil56368 


Caenorhabditis 
elegans 


metallothionein-2 


56 


39 


1697 


gil56381 


Caenorhabditis 
elegans 


metallothionein-like protein 


-B 


39 


1697 


gi6782 


Caenorhabditis 
elegans 


metall othionein-II 


56 


39 


1698 


gi9858855 


XX * 

Homo sapiens 


HPT protein (HPT) mRNA, 
complete cds; nuclear gene for 
mitochondrial product. 


1318 


85 


1698 


AAB29653 


T T * 

Homo sapiens 


X X 1 * i 1 

Human membrane-associated 
protein HUMAP-10. 


1318 


85 


1698 


AAB32389 


Homo sapiens 


TT j j , 

Human secreted protein sequence 
encoded by gene 1 9 SEQ ID 
NO:75. 


13 1 8 


85 


1699 


gi6841138 


Homo sapiens 


HSPC099 mRNA, partial cds. 


275 


100 


1699 


gi7022824 


Homo sapiens 


cDNA FLJ10656 fis, clone 
NT2RP2006038. 


123 


23 


1 s~ f~\r\ 

1699 


AAB93037 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11 816. 


123 


23 


1 (\)\) 


A A T3Q£^ C7 

AAooODo / 


riomo sapiens 


riuman rLiiArll-y protein 
sequence SEQ ID NO:9. 


585 


5j 


1700 


gi7023841 


Homo sapiens 


cDNA FLJ11280 fis, clone 
PLACE1009459. 


581 


54 


1700 


AAB93652 


Homo sapiens 


Human protein sequence SEQ ID 
NO:13161. 


581 


54 


1702 


gil2330618 


Human 

immunodeficienc 


envelope glycoprotein 


71 


29 
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y virus type 1 








1703 


AAR48975 


Homo sapiens 


Human beta 1 , 6-N- 

acetylglucosaminyltransferase 

protein. 


413 


100 


1703 


gi870752 


Homo sapiens 


Human mRNA for N~ 
acetylglucosaminyltransferase V, 
complete cds. 


413 


100 


1703 


gi4545222 


Homo sapiens 


alpha- l,3(6)-mannosyl glycoprotein 

beta-l,6-N-acetyl- 

gluc os aminyltransf erase (MGAT5) 

mRNA, complete cus. 


413 


100 


1704 


gil 26545 35 


Homo sapiens 


HSPC025, clone MGC:735 
IMAGE:3507964, mRNA, 
complete cds. 


1251 


81 


1704 


gil3960140 


Homo sapiens 


HSPC025, clone MGC:4223 
1MAGE:2959747, mKJNA, 
complete cds. 


1251 


81 


1704 


gi4679028 


Homo sapiens 


HSPC021 


1ZD 1 




1706 


gil4250636 


Homo sapiens 


nuclear factor of kappa light 
polypeptide gene enhancer in B- 
cells inhibitor-like 2, clone 
MGC:339o IMAOL:3ozo3 /4, 
mKJNA, complete cds. 


2605 


86 


1706 


gi6580428 


Homo sapiens 


partial NFKBIL2 gene for 
IkappaBR, exons 1-13. 


2751 


95 


1706 


gi746415 


Homo sapiens 


Human I kappa BR mRNA, 
complete cds. 


1668 


72 


1707 


AAB95830 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18850. 




/z 


1707 


gi7959889 


Homo sapiens 


PR02221 


137 


49 


1707 


gi2072969 


Homo sapiens 


Human LI element LI. 24 p40 
gene, complete cds. 


133 


48 


1708 


gi5901529 


Homo sapiens 


C2H2 type Kruppel-like zinc 
finger protein splice vanant b 
(ZNr23o) mKJNA, complete cds. 


565 


99 


1708 


gi5705917 


human, MOLT 4 
T-cells ? mRNA, 
79 o ntj. [Homo 

• 

sapiens 


HKR-Tl=Kruppel-like zinc finger 
protein 


223 


45 


1708 


gi49o /Jo 


Homo sapiens 


in. sapiens rLCirzf in ix in /a ioi z.mo 
finger protein. 




48 


i /uy 


gilooD / ID 


jdos Taurus 


pi UCOlldgCIi J. lN**pi ULCillaoC 




42 


1709 


AAW47029 


Homo sapiens 


Human N-proteinase (70 kDa short 


254 


43 


1709 


AAW47030 


Homo sapiens 


Bovine N-proteinase. 


254 


42 


1710 


gil 28623 92 


Mus museums 


JJoo 




/ o 


1710 


gi763113 


Homo sapiens 


H.sapiens ERK3 mRNA. 


872 


92 


1710 


gil294779 


Homo sapiens 


ERK3 protein kinase mRNA, 
complete cds. 


OHO 

872 


yz 


171 i 
1/11 


gO /ZOoU 


xiscnencnia con 


uuLtL j\.ciud.uyi-d.uyi i/ciiiici jjiuucni 

synthase 


jij 


91 


1711 


gi664S70 


Escherichia coli 


beta-ketoacyl-acyl carrier protein 
synthase II 


315 


91 


1711 


gi4062664 


Escherichia coli 


3-oxoacyl- synthase (EC 2.3.1.41) 
II 


315 


91 


1712 


AAR89952 


Homo sapiens 


Insulin-like growth factor binding 
protein-3. 


726 


95 
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1712 


gil2652547 


Homo sapiens 


insulin-like growth factor binding 
protein 3, clone MGC:2305 
IMAGE:3506666, mRNA, 
complete cds. 


722 


95 


1712 


gi398164 


Homo sapiens 


H. sapiens mRNA for insulin-like 
growth factor binding protem-3. 


722 


95 


1713 


gil 0435700 


Homo sapiens 


cDNA FLJ13633 fis, clone 
PLACE 1011114, weakly similar to 
PROBABLE ATP-DEPENDENT 
RNA HELICASE HAS1. 


710 


97 


1713 


AAB99891 


Homo sapiens 


Human RNA helicase gene 
helicain B protein sequence SEQ 
ID NO:4. 


710 


97 


17 13 


AAB99892 


T T * 

Homo sapiens 


T T T4"V T A 1 1 " 

Human RNA helicase gene 
helicain C protein sequence SEQ 
ID NO:6. 


710 


97 


1714 


gil 53 84740 


T T * 

Homo sapiens 


mRNA for Paralemmin-2 (PALM2 
gene). 


1652 


100 


1714 


gil5384742 


Homo sapiens 


mRNA for Palm2-AKAP2 fusion 
protein (PALM2/AKAP2 gene). 


1577 


100 


1714 


gil4041780 


Homo sapiens 


mRNA for AKAP-2 protein. 


415 


60 


1715 


gi3600073 


Homo sapiens 


Human endogenous retrovirus K 
clone 7. 1 polymerase mRNA, 
partial cds. 


573 


48 


1715 


gi3 600067 


Homo sapiens 


TT 1 j * Tr 

Human endogenous retrovirus K 
clone 10.9 polymerase mRNA, 
partial cds. 


572 


48 


1715 


gil 780973 


Human 
endogenous 
retrovirus K 


pol protein 


572 


48 


1716 


AAU00025 


Homo sapiens 


Human activated T-lymphocyte 
associated sequence 4, ATLAS-4. 


315 


74 


1716 


gi5880909 


Drosophila 
melanogaster 


RNA-binding protein Smaug 


87 


29 


1716 


gi/3 80929 


Drosophila 
melanogaster 


smaug protein 


87 


29 


1717 


gi35825 


T T • 

Homo sapiens 


Human mRNA for pregnancy zone 
protein. 


3998 


93 


1717 


gi579594 


T T • 

Homo sapiens 


alpha 2-macroglobulm 690-740 


2841 


64 


1717 


AAR1 1749 


TT * 

Homo sapiens 


TT 11- O 1 1 1 ■ 1 *j_ 

Human alpha- 2 macroglobulm bait 
region mutant. 


2832 


64 


1718 


gi4760578 


Mus musculus 


PB-Cadherin 


1033 


84 


1718 


gil398906 


T"» J l 

Rattus 

• 

norvegicus 


long type PB-cadhenn 


1027 


84 


1718 


gil398912 


Rattus 
norvegicus 


short type PB-cadherin 


1027 


84 


1719 


gi339776 


Homo sapiens 


Human transposon LI .2. 


312 


62 


1719 


gi5070621 


Homo sapiens 


retrotransposon LI insertion in X- 
iiiiKcG retinitis pigmentosa locus, 
complete sequence. 


312 


62 


1719 


gi7959889 


Homo sapiens 


PRQ2221 


308 


71 


1720 


AAE06588 


Homo sapiens 


Human protein having 
hydrophobic domain, HP 10778. 


687 


100 


1720 


gil 5072402 


Raja erinacea 


organic solute transporter alpha 


357 


45 


1720 


AAB38348 


Homo sapiens 


Human secreted protein encoded 
by gene 28 clone HLDOW79. 


328 


100 
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% Identity 


1721 


AAG81345 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:208. 


525 


90 


1721 


AAB93797 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 13560. 


525 


90 


1721 


AAB44681 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 41 SEQ ID 
NO: 146. 


1 c\r\ 

199 


100 


1722 


gi 1809225 


Homo sapiens 


Human BAC clone RG161K23 
from 7q21, complete sequence. 


2183 


83 


1722 


gi 1698396 


Homo sapiens 


Human lanosterol 14-demethylase 
cytochrome P450 (CYP51) 
mRNA, complete cds. 


2183 


83 


1722 


gi871883 


Homo sapiens 


Human mRNA for lanosterol 14- 
demethylase, complete cds. 


2183 


83 


1723 


gi 161 82326 


Drosophila 
melanogaster 


GH01206p 


108 


29 


1723 


AAG77172 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:7938. 


90 


39 


1723 


gi3 878064 


Caenorhabditis 
elegans 


H19J13.1 


72 


25 


1724 


AAY53040 


Homo sapiens 


Human secreted protein clone 
kj320_l protein sequence SEQ ID 
NO:86. 


2480 


100 


1724 


gi35 10639 


Rattus 
norvegicus 


UDP-GalNAc:polypeptide N- 
acetyl gal actosammyl transferase T5 


1351 


59 


1724 


gi6688167 


Homo sapiens 


partial mRNA for GalNAc-T5 
(GALNT5 gene). 


1082 


100 


1725 


gil4603092 


Homo sapiens 


Similar to CD47 antigen (Rh- 
related antigen, integrin-associated 
signal transducer), clone 
MGC: 15298 IMAGE :43 03 5 34, 
mKJNA, complete cds. 


1329 


97 


1125 


gi 152775 80 


Homo sapiens 


clone MGC: 9240 
IMAGE:385791 1, mRNA, 
complete cds. 


i no 


y / 


1725 


gi396705 


Homo sapiens 


H. sapiens integrin associated 
protein mRNA, complete CDS,. 


1329 


97 


1726 


gil864011 


Homo sapiens 


mRNA for SHPS-1, complete cds. 


858 


98 


1726 


gi2052056 


Homo sapiens 


H.sapiens mRNA for SIRP-alphal. 


858 


98 


1726 


gi65 18913 


Homo sapiens 


Bit mRNA, complete cds. 


858 


98 


1727 


gi2707601 


Homo sapiens 


A4 differentiation-dependent 
protein (A4), triple LIM domain 
protein (LM06), and 
synaptophysin (SYP) genes, 
complete cds; and calcium channel 
alpha- 1 subunit (CACNA1F) gene, 
partial cds. 


1656 


100 


1727 


gi6180176 


Homo sapiens 


transcription factor IGHM 
enhancer 3, JM1 1 protein, JM4 
protein, JM5 protein, T54 protein, 
JM10 protein, A4 differentiation- 
dependent protein, triple LIM 
domain protein 6, and 
synaptophysin genes, complete 
cds; and L-type calcium channel 
alpha- 1 subunit gene, partial cds, 
complete sequence. 


1656 


100 
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^ score 


/o lueniiiy 


1727 


gi899301 


Homo sapiens 


Human mRNA for synaptophysin 
(p38). 


1 £AO 


1UU 


1728 


gil5590682 


TT * 

Homo sapiens 


nistone deacetylase ya (HDAOyj 
mRNA, complete cds 9 alternatively 
spliced. 


/COO 




1728 


gil2060992 


Mus musculus 


MEF2-interacting transcription 
repressor MITR 


626 


95 


1728 


gil3l830l7 


Mus musculus 


nistone deacetyiase-relatea protein 




V4 


1729 


gi59H884 


Homo sapiens 


mRNA; cDNA DKFZp434N126 
(rrom clone JJKrZp4J4JNl2oj. 


1605 


99 


1729 


gi 159 12209 


Arabidopsis 
thaliana 


AtlgZ /dZv/ 1 1 /xio_Z 


OOQ 


'XA 


1729 


gil4 164377 


Mus musculus 


1 ype 11 membrane protein 01 
ER~mouse gene similar to alpha- 
mannosidase 


0 1 £ 


JO 


1730 


gi21842 


Triticum 
aestivum 


proline-rich protein 


403 


33 


1730 


gi4138732 


Zea mays 


proline-rich protein 


398 


30 


1730 


gil 1610622 


Rattus 
norvegicus 


GABA-A epsilon subunit splice 
variant 


/i 1 1 


^o 


1731 


gil 89222 


Homo sapiens 


Human neurokinin-2 receptor 
(TAC2R) gene, exon 5. 


1733 


95 


1731 


AAW80135 


Homo sapiens 


Human recombinant neurokinin-2 
(NK-2) receptor protein. 


1733 


95 


1731 


gil89135 


Homo sapiens 


TT 1 ! " A ^ ^ _„ 

Human neurokinin A receptor 
(NK-2R) mRNA, complete cds. 


1 /iZ 


yD 


1732 


AAB75594 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 37 bEQ ID 
NO: 148. 


678 


99 


1732 


AAB75542 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 37 SEQ ID 
NO:96. 




1 C\f\ 


1732 


gil864011 


Homo sapiens 


mKJNA tor iSHFS-l 5 complete cds. 


ZD 1 


4J 


1733 


AAY66648 


T T ' 

Homo sapiens 


Membrane-bound protein 
PRO 1120. 






1733 


AAB65171 


Homo sapiens 


Human rRUl (UJN^jyj 
protein sequence SEQ ID NO: 84. 




yj 


1733 


AAE01440 


Homo sapiens 


Human gene 5 encoded secreted 
protein HE9QN39, SEQ ID 
NO:yD. 




O'? 

yj 


1734 


gil3161409 


Mus musculus 


family 4 cytochrome P450 


539 


78 


1734 


gil 55947 


Blaberus 
discoidalis 


cytochrome P450 


Ovl Q 


A1 


1734 


gi3249041 


Diploptera 
punctata 


corpora allata cytochrome P450 

• 


233 


42 


1735 


A A Y7 1159 


Homo sapiens 


Human phosphodiesterase 
interacting protein, myomegalin. 


682 


45 


1735 


gi4761644 


Rattus 

* 

norvegicus 


myomegalm 


484 


55 


1735 


gi53782 


Mus musculus 


profilin (AA 1-140) 


414 


74 


1736 


gi4959382 


Homo sapiens 


human endogenous retrovirus 
HERV-H19 pol protein (pol) gene, 
partial cds; env protein (env) gene, 
complete cds; and 3* LTR, 
complete sequence. 


301 


39 


1736 


gi8439396 


HERV-H/env62 


envelope protein 


301 


39 
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% Identity 


1736 


gi5442112 


Simian retrovirus 
type 2 


envelope glycoprotein 


Zyo 


/IT 

4Z 


1737 


AAY76177 


Homo sapiens 


Human secreted protein encoded 
by gene 54. 


Zoo 


i no 


1738 


gi3688630 


Homo sapiens 


hdkk-4 gene, exon3, exon4 ana 
complete cds. 


oy 


1 f\f\ 


1738 


gi6049610 


Homo sapiens 


dickkopf-4 (DKK-4) mRNA, 
complete cds. 


75y 


i nn 
1UU 


1738 


AAW73017 


Homo sapiens 


Human cysteine-rich secreted 
protein CRSP-2. 


759 


100 


1739 


gi 10439926 


Homo sapiens 


cDNA: FLJ23293 fis, clone 
HEP10514. 


1786 


99 


1739 


gi4406632 


Homo sapiens 


clone 25221 mRNA sequence, 
complete cds. 


1253 


64 


1739 


gil0435296 


Homo sapiens 


cDNA FLJ13321 fis, clone 
OVARC1001703, weakly similar 
to Mus musculus ARL-6 
interacting protein-2 (Aip-2) 
mRNA. 


1084 


62 


1740 


AAB49278 


Homo sapiens 


Protein encoded by zsig8 1 cDN A 
fragment. 


/33 


y4 


1740 


gi 133 5033 


-w- -— - • 

Homo sapiens 


Human mRNA for collagen VI 
alpha- 2 alternative C-terminal 
domain. 


oy 


Z4 


1740 


AAY40063 


Homo sapiens 


Peptide sequence derived from a 
human secreted protein. 


63 


35 


1741 


gil4714807 


Mus musculus 


Similar to transporter-like protein 


565 


83 


1741 


AAG81264 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:46. 


OD / 


1 fin 


1741 


AAY66673 


Homo sapiens 


Membrane-bound protein 
PROH15. 


657 


100 


1742 


gi 145 83077 


Homo sapiens 


PAS-kinase (PASK) mRNA, 
complete cds. 


2227 


yy 


1742 


AAB65630 


Homo sapiens 


Novel protein kinase, SEQ ID NO: 
156. 


2223 


99 


1742 


gil5487238 


Mus musculus 


PASKIN protein 


1632 


/Z 


1743 


gil3879899 


Mycobacterium 
tuberculosis 
CDC 1551 


PPE family protein 


118 


31 


1743 


gi 13 34643 


Xenopus laevis 


APEG precursor protein 


yu 




1743 


gil 84511 


Homo sapiens 


Human zinc-finger DNA-binding 
motifs (IA-1) mRNA, complete 
cds. 


86 


32 


1744 


gi5931718 


Chlamydomonas 
reinhardtii 


1 -alpha dynem heavy chain 


1 QHA 

lo /4 


oz 


1744 


gi9409781 


Chlamydomonas 
reinhardtii 


1 beta dynein heavy chain 


986 


36 


1744 


gi514215 


Chlamydomonas 
reinnaruiii 


dynein beta heavy chain 


978 


36 


1745 


AAY13392 


Homo sapiens 


Amino acid sequence of protein 
PR0328. 


1168 


57 


1745 


AAB01373 


Homo sapiens 


Neuron-associated protein. 


1168 


57 


1745 


AAY95343 


Homo sapiens 


Human PR0328 antitumour 
protein. 


1168 


57 


1746 


AAY99398 


Homo sapiens 


Human PRO1301 (UNQ667) 
1 amino acid sequence SEQ ID 


677 


94 



I 
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NO:212. 






1746 


AAW88501 


T T * 

Homo sapiens 


Human stomach carcinoma clone 
HP10415-encoded protein. 


, 675 


94 


1746 


AAB24255 


Homo sapiens 


Human cytochrome P450 
(HUCYP) protein SEQ ID NO:l. 


675 


94 


1747 


AAG02314 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6395. 


78 


66 


1747 


AAB45051 


Homo sapiens 


Human secreted protem encoded 
by gene 27. 


60 


30 


1747 


gi2585662 


Human 

immunodeficienc 
y virus type 1 


envelope glycoprotein 


58 


38 


1748 


AAE03560 


Homo sapiens 


Human differentially expressed 
kidney cDNA 22360 encoded 
protein. 


1142 


88 


1748 


gil5637151 


Beta vulgaris 


glycine decarboxylase subunit P 


62 


36 


1749 


gil3506805 


T T • 

Homo sapiens 


thymic stromal co-transporter 
mRNA, complete cds. 


2309 


100 


1749 


gil3506808 


Mus musculus 


thymic stromal co-transporter 


1789 


77 


1749 


AAE04906 


Homo sapiens 


Human transporter and ion 
channel- 1 9 (TRICH- 1 9) protem. 


305 


34 


1750 


gil0438815 


Homo sapiens 


cDNA: FLJ22427 Us, clone 
HRC09013. 


4170 


98 


1750 AAB01381 


Homo sapiens 


Neuron-associated protein. 


2964 


95 


1750 


gi6650377 


Mus musculus 


pecanex l 


2288 


73 


1751 


gi825663 


Homo sapiens 


H.sapiens GLASTl gene for glilial 
glutamate transporter, exonl, exon 
2. 


411 


100 


1751 


gi487339 


Homo sapiens 


Human excitatory amino acid 
transporter 1 mRNA, complete cds. 


411 


100 


1751 


gi825504 


Homo sapiens 


Human mRNA for glutamate 
transporter, complete cds. 


411 


100 


1752 


gi 162 1607 


Homo sapiens 


Human neogenin mRNA, complete 
cds. 


593 


100 


1752 


gi641966 


Gallus gallus 


neogenin 


591 


98 


1752 


gil785999 


Rattus 
norvegicus 


* 

neogenin 


pi* v"" 

586 


97 


1753 


gi7020927 


Homo sapiens 


cDNA FLJ20674 fis, clone 
KAIA4450. 


293 


25 


1753 


AAP94014 


Homo sapiens 


Carcmoembryonic cell surface 
antigen. 


254 


27 


1753 


AAR60619 


Homo sapiens 


Carcmoembryonic antigen 
glycoprotein. 


250 


27 


1754 


AAY19507 


Homo sapiens 


Amino acid sequence of a human 
secreted protem. 


343 


91 


1754 


AAY19654 


Homo sapiens 


SEQ ID NO 372 from 
W09922243. 


85 


100 


1754 


gil5075730 


Sinorhizobium 

lllGlllUll 


HYPOTHETICAL 

TR ATsJQA/TP'M'PP ANTE TVD fYTTTTM' 


65 


30 


1755 


gi2506078 


Mus musculus 


tetracycline transporter-like protein 


1120 


99 


1755 


AAY29332 


Homo sapiens 


Human secreted protein clone 
pe584_2 protein sequence. 


806 


78 


1755 


AAB75295 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 
NO: 114. 


806 


78 


1756 


AAE06608 


Homo sapiens 


Human protein having 


1065 


98 
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I.J 1 _ 1 ■ 1 _ _ ' TTT1 1 

hydrophobic domain, JdPlU/9o. 






1756 


AAB88469 


Homo sapiens 


Human membrane or secretory 
protein clone PSBC0027. 


1065 


98 


1756 


gi9664030 


Drosophila 
melanogaster 


aquaporin 


115 


22 


i ft t* ft 
1757 


gi8925284 


TT 1 _ 

Homo sapiens 


phosphatidyhnositol 
polyphosphate 5-phosphatase type 
IV mKJNA, complete cas. 


2598 


An i 

99 


i 'ten 

1757 


gi9295353 


Mus museums 


inositol polyphosphate 5- 
phosphatase 




/4 


1757 


gi53 60761 


T"l — . //. - _ 

Rattus 
norvegicus 


pharbm 


1 on 


/ / 


1 H C O 

1758 


gi395207 


Bos taurus 


potassium channel vrKjro) 


/!> / 




1758 


gi 186669 


Homo sapiens 


Human potassium channel mRNA, 
complete cds. 


754 


83 


1758 


gl304652 


Canis familiaris 


delayed rectifier K+ channel 




CO 

oZ 


1759 


gi7023003 


Homo sapiens 


cDNA FLJ 10769 fis, clone 

TV TT1 T> Ti vf A (\f\ 1 f 1 

N 1 2RP4000 15 1 . 


647 


85 


1759 


AAB93147 


Homo sapiens 


Human protein sequence SEQ ID 
NO:12057. 


647 


85 


1759 


AAB38451 


Homo sapiens 


Fragment of human secreted 
protein encoded by gene 23 clone 
HEGAK44. 


484 


84 


1760 


A AB 87763 


Homo sapiens 


Human T2R33 amino acid 
sequence SEQ ID NO:56. 


693 


83 


1760 


AAB87780 


Homo sapiens 


Human T2R50 amino acid 
sequence SEQ ID NO:76. 


671 


82 


1760 


AAE03828 


X X • 

Homo sapiens 


Human gene 1 1 encoded secreted 
protein HHAUQ28, SEQ ID NO: 
74. 


f r s~ 

656 


89 


1761 


gi 13543624 


Homo sapiens 


uncharacterized hematopoietic 
stem/progenitor cells protein 
MDS029, clone MGC: 14612 
IMAGE:4051044, mRNA, 
complete cds. 


514 


87 


1761 


gil3937872 


Homo sapiens 


uncharacterized hematopoietic 
stem/progenitor cells protein 
MDS029, clone MGC: 12437 
IMAGE:3930701, mRNA, 
complete cds. 


514 


87 


1761 


gil4250122 


Homo sapiens 


uncharacterized hematopoietic 
stem/progenitor cells protein 
MDS029, clone MGC: 14755 
IMAGE:4283253, mRNA, 
complete cas. 


514 


87 


1762 


gi2739094 


Homo sapiens 


sodium/myo-inositol cotransporter 
(SLC5A3) gene, complete cds. 


3621 


100 


1762 


gi662843 


XT * 

Homo sapiens 


Na+/myo-inositol cotransporter 
(SLC5A3) gene, complete cds. 


3616 


99 






r>os taurus 


iNa+/myo-inositoi cotransporter 


j4oO 




1763 


AAW84596 


Homo sapiens 


Amino acid sequence of the human 
Tango-79 protein. 


1212 


44 


1763 


gi 15029689 


Homo sapiens 


clone MGC: 17422 
IMAGE:42 14343, mRNA, 
complete cds. 


1209 


44 


1763 


AAB74705 


Homo sapiens 


Human membrane associated 
protein MEMAP-11. 


1209 


44 
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1764 


gi5923786 


T T • 

Homo sapiens 


zinc metalloprotease ADAMTS6 
(ADAMTS6) mRNA, complete 
cds. 


614 


43 


1764 


AAB72282 


Homo sapiens 


Human ADAMTS-6 amino acid 
sequence. 


614 


43 


1764 


gil2053709 


Homo sapiens 


mRNA for ADAMTS 12. 


1078 


41 


1165 


gil695682 


Homo sapiens 


mRNA for hepatic triglyceride 
lipase, complete cds. 


152 


64 


1765 


gi32498 


Homo sapiens 


Human mRNA for hepatic 
triglyceride lipase (HTGL). 


152 


64 


1765 


gi307129 


Homo sapiens 


Human hepatic lipase mRNA, 
complete cds. 


152 


64 


1766 


gi8 86282 


Homo sapiens 


glycoprotein lb alpha (GPIb) gene, 
partial cds. 


48 


42 


1766 


AAB64868 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 43 SEQ ID 
NO: 154. 


66 


39 


1766 


gi 10798865 


Homo sapiens 


zinc finger transcription factor 
BTEB2 gene, partial cds. 


65 


29 


1767 


gi8886005 


Homo sapiens 


lysophosphatidic acid 
acyltransferase-delta (LPAAT- 
delta) mRNA, complete cds. 


324 


90 


1161 


AAY96592 


T T * 

Homo sapiens 


TT 1 1 1 j_ * 1 " "1 

Human lysophosphatidic acid 
acyltransferase delta. 


324 


90 


1161 


AAY66665 


Homo sapiens 


Membrane-bound protein 
PRO1016. 


324 


90 


1768 


gill60183 


Homo sapiens 


H.sapiens mRNA for metabotropic 
glutamate receptor type 4. 


926 


99 


1768 


gil 935039 


Homo sapiens 


Human metabotropic glutamate 
receptor 4 mRNA, complete cds. 


926 


99 


1768 


gi2298840 


unidentified 


HMGLUR4 


926 


99 


1769 


gil 3279140 


Homo sapiens 


Similar to synaptotagmin 11, 
clone MGC: 10881 
IMAGE:3621175, mRNA, 
complete cds. 


2210 


99 


1769 


gil5489165 


Homo sapiens 


clone MGC: 17226 
IMAGE:4 179089, mRNA, 
complete cds. 


2210 


99 


1769 


gil 4042290 


Homo sapiens 


cDNA FLJ14634 fis, clone 
NT2RP2001081, moderately 
similar to SYNAPTOTAGMIN 
IV. 


2201 


98 


1770 

• 


gil4249942 


Homo sapiens 


Similar to RIKEN cDNA 
0610008P16 gene, clone 
MGC: 15937 IMAGE: 35 3 7224, 

T» "VTA 1 j J 

mRNA, complete cds. 


239 


78 


1770 


AAB564S7 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO: 1065. 


239 


78 


1 / /(J 


AAd 12 


Homo sapiens 


Human transterase HI 1^8-19, SEQ 
ID NO: 19. 


239 


78 


1771 


gi7678873 


Homo sapiens 


mRNA for vascular cadherin-2, 
complete cds. 


5369 


99 


1771 


gi7407150 


Homo sapiens 


protocadherin 12 (PCDH12) 
mRNA, complete cds. 


5369 


99 


1771 


gi8 164037 


Homo sapiens 


vascular endothelial cadherin 2 
mRNA, complete cds. 


5369 


99 



l 
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1772 


gil5082281 


Homo sapiens 


Similar to steroid dehydrogenase 
homolog, clone MGC: 10252 
IMAGE:3622879, mRNA, 
complete cds. 


231 


93 


1772 


gi 152 14803 


Homo sapiens 


Similar to steroid dehydrogenase 
nomolog, clone M<jrC:1332y 
IMAGE:4281565, mRNA, 
complete cos. 


231 


93 


1772 


gi5531815 


Homo sapiens 


steroid dehydrogenase homolog 


231 


93 


1112 


AAG73650 


XT ' 

Homo sapiens 


TT 1 „ i " „ , 

Human colon cancer antigen 
protein SEQ ID NO:4414. 


265 


DO 


1773 


A A T^rt /loni 

AAB94891 


Homo sapiens 


Human protein sequence SbQ 1U 
NO: 16231. 


203 


OH 


1773 


gi 1335205 


Homo sapiens 


TT . t^v"v t A J? T TXTT7 1 

Human DNA tor LINE- 1 
transposable element ORFI and II. 


273 


4o 


111 A 


AAW71708 


Homo sapiens 


Human integral membrane protein 
TMP-1. 


657 


5U 


1774 


gi3603459 


Homo sapiens 


tetraspan NET-5 mRNA, complete 
cds. 


626 


90 


Ml 4 


AAB93885 


Homo sapiens 


Human protein sequence SEQ ID 
NO:13815. 


626 


90 


1775 


AAG71597 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1278. 


ybl 




1115 


AAG71587 


r r ! 

Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1268. 


/46 


75 


1775 


gi 15293749 


TT * 

Homo sapiens 


clone UR1UR2 oltactory receptor 
gene, partial cds. 


IZZ> 


/5 


1776 


A A T» TOA1 1 

AAB38011 


TT • 

Homo sapiens 


Human secreted protein encoded 
by gene 3 clone HPJCX13. 




o5 


1776 


AAB64888 


T T * 

Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 kiiQ ID lNU:oo. 


252 


o5 


1776 


gi 10432803 


Homo sapiens 


cDNA rl_J 11531 lis, clone 
jirSMoA 1 UUZoo 1 . 


1 O') 

1V2 


54 


1 HHH 

VI 1 I 


glIo2o5 L 


riomo sapiens 


riuman uuoz protein gene, 
complete cos. 






1777 


gi 182853 


Homo sapiens 


Human GOS2 gene, 5' flank and 

COS. 


497 


100 


1777 


gil213013 


Mus musculus 


G0S2-like protein 


377 


77 


1 770 
1 / /o 


gl4UZ fy\jD 


riomo sapiens 


v Aivii^j mKiNA, complete cas. 


H- /-> 




1 *7"7 0 

1 //o 


gl4o /yuuo 


riomo sapiens 


VAMrj-iiKe protein 




IVJV 


1778 


a A TXTA/t 1 01 

AAW U4iol 


Homo sapiens 


Cellubrevm-z. 




1UU 


Illy 




±5 os tauius 


ov-AJ-sponam 






1779 


gi3 059229 


Aspergillus 
oryzae 


HAPB 


74 


35 


1 T~IC\ 

1119 


AAY cs4dvo 


Homo sapiens 


Fragment of human pre-pro- 
artemin polypeptide. 


/I c 


4U 


1 TO/1 

1780 


gii3yy3zi 


Macaca mulatta 


MHC class 1 antigen Mamu H^Ub 


o /y 


/2 


1780 


gil399319 


Macaca mulatta 


MHC class I antigen Mamu B*07 


674 


67 


1 7Qfl 

L /oil 


rriQ 1 1 77QQ 

gio 11/ /yy 


.ran irogiooyres 


ivinv^ ciass i antigen 


Oil 




1781 


gi 15929602 


Homo sapiens 


clone MGC: 17861 
IMAGE:3903313, mRNA, 
complete cds. 


607 


51 


1781 


gil5099953 


Homo sapiens 


diacylglycerol acyltransferase 2 
mRNA, complete cds. 


607 


51 


1781 


AAY99421 


Homo sapiens 


Human PR01433 (UNQ738) 
amino acid sequence SEQ ID 


607 


51 
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NO:292. 






1782 


gi2706518 


Homo sapiens 


PMP69 gene, exon 1 (and joined 
CDS). 


1602 


98 


1782 


gi2343157 


Homo sapiens 


peroxisomal membrane protein 69 
(PMP69) mRNA, complete cds. 


1602 


98 


1782 


gil5215442 


Homo sapiens 


Similar to ATP -binding cassette, 
sub-family D (ALD), member 4, 
clone MGC:4125 
IMAGE:2960427, mRNA, 
complete cds. 


1594 


98 


1783 


AAB35235 


Homo sapiens 


Human neurotransmitter 
transporter protein GC42. 


3571 


98 


1783 


gi546769 


human, 

substantia nigra, 
mRNA, 2364 nt]. 
[Homo sapiens 


glycine transporter type lb 


3557 


98 


1783 


AAB35236 


Homo sapiens 


Human glycine transporter type lc. 


3557 


98 


1784 


AAU00017 


Homo sapiens 


Human Plexin-Dl. 


3361 


97 


1784 


gil3097621 


Homo sapiens 


clone IMAGE: 3607457, mRNA, 
partial cds. 


2470 


100 


1784 


AAU00015 


Homo sapiens 


Human Plexin-B2. 


1290 


47 


1785 


gi6 19726 


Homo sapiens 


Human nuclear factor I (NFI) 
mRNA, clone ATI, complete cds. 


953 


99 


1785 


gi6 19730 


Homo sapiens 


Human nuclear factor I (NFI) 
mRNA, clone CT1, partial cds. 


953 


99 


1785 


gi305357 


Mesocricetus 
auratus 


nuclear factor 1-like protein 


953 


99 


1786 


gil3810568 


Homo sapiens 


TLR5 mRNA for Toll-like receptor 
5, complete cds. 


4482 


100 


1786 


gi3 132526 


Homo sapiens 


Toll/interleukin- 1 receptor-like 
protein 3 (TIL3) mRNA, complete 
cds. 


4464 


99 


1786 


gi7648687 


Mus musculus 


toll-like receptor 5 


3235 


72 


1787 


AAB56473 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO:1051. 


474 


100 


1787 


AAB60119 


Homo sapiens 


Human transport protein TPPT-39. 


456 


98 


1787 


gil 1463949 


Homo sapiens 


hUGTrel7 mRNA for UDP- 
glucuronic acid, complete cds. 


308 


62 


1788 


gi4929765 


Homo sapiens 


CGI- 148 protein mRNA, complete 
cds. 


744 


89 


1788 


gi7578785 


Homo sapiens 


NPD008 protein (NPD008) 
mRNA, complete cds. 


744 


89 


1788 


gil4250060 


Homo sapiens 


clone MGC: 14598 
IMAGE:4292664, mRNA, 
complete cds. 


737 


88 


1789 


gil736785 


Escherichia coli 


Acriflavin resistance protein F 
(EnvD protein). 


2265 


100 


1789 


gil5980819 


Yersinia pestis 


AcrB/AcrD/AcrF family 
membrane protein 


1854 


79 


1789 


gil 736782 


Escherichia coli 


Acriflavin resistance protein F 
(EnvD protein). 


1821 


100 


1790 


gil5160166 


Agrobacterium 
tumefaciens 


AGR_L_3181p 


549 


59 


1790 


gi4981492 


Thermotoga 
maritirna 


ribose ABC transporter, permease 
protein 


316 


40 
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1790 


gi 12724646 


Lactococcus 
lactis subsp. 
lactis 


ribose ABC transporter permease 
protein 


314 


35 


1791 


gi41587 


Escherichia coli 


glycerol-3-phosphatase transporter 
(AA 1 - 452, glpT) 


1073 


100 


1791 


gil799587 


Escherichia coli 


glycerol-3-phosphate transport 
protein 


1073 


100 


1791 


gil788573 


Escherichia coli 
K12 


sn-glycerol-3-phosphate permease 


1073 


100 


1792 


gi 148200 


Escherichia coli 


similar to arylsulfatases of 
BGebsiella pneumoniae and Homo 
sapiens 


679 


98 


1792 


gil3364207 


Escherichia coli 
0157:H7 


arylsulfatase 




no 


1792 


gil790233 


Escherichia coli 
K12 


arylsulfatase 


679 


98 


1793 


gi9657461 


Vibrio cholerae 


sulfate permease family protein 


267 


51 


1793 


gi2635979 


Bacillus subtilis 


similar to transporter 


249 


40 


1793 


gi 14024597 


Me sorhizobium 
loti 


sulfate transporter family protein 


231 


45 


1794 


gil799719 


Escherichia coli 


similar to 


922 


100 


1794 


gil5156677 


Agrobacterium 
tumefaciens 


AGR_C_2926p 


452 


50 


1794 


gi 15074970 


Sinorhizobium 
meliloti 


HYPOTHETICAL 
TRANSMEMBRANE PROTEIN 


A 1 A 

414 


47 


1795 


gi 13325242 


Homo sapiens 


clone MGC:4033 
IMAGE:2820092, mRNA, 

1 j i 

complete cds. 


228 


81 


1795 


gi 1644366 


Rattus 
norvegicus 


ninjurinl 


228 


81 


1795 


gi3 077901 


Mus musculus 


ninjurin 


228 


81 


1796 


gi 15987491 


Homo sapiens 


tumor endothelial marker 5 
precursor (TEM5) mRNA, 
complete cds. 


5742 


89 


1796 


AAB71863 


Homo sapiens 


Human hl5571 GPCR. 


5742 


89 


1796 


gil5987499 


Mus musculus 


tumor endothelial marker 5 
precursor 


5030 


79 


1797 


gil3938575 


Homo sapiens 


Similar to RIKEN cDNA 
2610511E22 gene, clone 
MGC:4251 IMAGE: 3028940, 
mRNA, complete cds. 


1331 


100 


1797 


AAY91598 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 8 SEQ ID 
NO:271. 


1322 


100 


1797 


gil5029776 


Mus musculus 


RIKEN cDNA 261051 1E22 gene 


1317 


98 


1798 


gi42989 


Escherichia coli 


SecY (PrlA) polypeptide (aa 1- 
443) 


953 


100 


1798 


gi606234 


Escherichia coli 


secY 


953 


100 


1798 


gil 5978329 


Yersinia pestis 


preprotem translocase bee Y 
subunit 


OA 1 




1800 


gil0435708 


Homo sapiens 


cDNA FLJ13639 fis, clone 
PLACE101 1219, weakly similar to 
PROBABLE 

OXIDOREDUCTASE (EC 1.-.-.-). 


289 


84 


1800 


AAB94698 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15680. 


289 


84 
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1800 


gil5292559 


Drosophila 
melanogaster 


SD07613p 


101 


67 


1801 


gi7415511 


Homo sapiens 


mRNA for peptide transporter 3, 
complete cds. 


1281 


89 


1801 


gtl38 10437 


Rattus 
norvegicus 


peptide histidine transporter 1 
homolog rPHT2 


1066 


83 


1801 


AAY86224 


Homo sapiens 


Human secreted protein 
HDPWU34, SEQ ID NO: 139. 


1280 


100 


1802 


gi387012 


Homo sapiens 


Human pepsinogen gene, exon 9. 


615 


97 


1802 


gi387013 


Homo sapiens 


Human pepsinogen A (15.0) gene, 
exon 9, clone cgHGP2. 


615 


97 


1802 


AAB66589 


Homo sapiens 


Human pepsin. 


615 


97 


1803 


gil5680159 


Homo sapiens 


Similar to claudin 2, clone 
MGC:20191 IMAGE:4645075, 
mRNA, complete cds. 


1113 


99 


1803 


gil0503980 


Homo sapiens 


clone SP82 claudin 2 mRNA, 
complete cds. 


1113 


99 


1803 


gi9755009 


Homo sapiens 


claudin-2 mRNA, complete cds. 


1113 


99 


1804 


gi476222 


Homo sapiens 


Human anion exchanger 3 brain 
isoform (bAE3) mRNA, complete 
cds. 


317 


74 


1804 


gil 0953762 


Mus musculus 


anion exchanger 3 cardiac isoform 


317 


74 


1804 


gi309095 


Mus musculus 


AE3 protein 


317 


74 


1805 


AAB29632 


Homo sapiens 


Human pollinosis-associated gene 
581-encodedprotein, SEQ ID 
NO: 12. 


2518 


55 


1805 


AAY70023 


Homo sapiens 


Human Protease and associated 
protein-17 (PPRG-17). 


2068 


58 


1805 


gil3529590 


Mus musculus 


Similar to ubiquitin specific 
protease 20 


1924 


56 


1806 


AAY99363 


Homo sapiens 


Human PRO1380 (UNQ717) 
amino acid sequence SEQ ID 
NO:79. 


1773 


98 


1806 


gil2656639 


Homo sapiens 


equilibrative nucleoside transporter 
3 (ENT3) mRNA, complete cds. 


1767 


98 


1806 


AAY82285 


Homo sapiens 


Human ENT1 receptor SEQ ID 
NO:l. 


1767 


98 


1807 


AAG81410 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:338. 


155 


27 


1807 


gi9623190 


Mus musculus 


sphingosine- 1 -phosphate 
phosphohydrolase 


159 


30 

i 


1807 


gil 5778670 


Mus musculus 


sphingosine- 1 -phosphate 
phosphatase 


159 


30 


1808 


gil5082281 


Homo sapiens 


Similar to steroid dehydrogenase 
homolog, clone MGC: 10252 
IMAGE:3622879, mRNA, 
complete cds. 


231 


.93 


1808 


gil5214803 


Homo sapiens 


Similar to steroid dehydrogenase 
homolog, clone MGC: 13329 
IMAGE:4281565, mRNA, 
complete cds. 


231 


93 


1808 


gi5531815 


Homo sapiens 


steroid dehydrogenase homolog 


231 


93 


1810 


gil4270513 


Homo sapiens 


partial SLC22A3 gene for organic 
cation transporter 3, exon 1 and 
joined CDS. 


775 


99 


1810 


gi3581982 


Homo sapiens 1 


mRNA for extraneuronal 


775 


99 



t 
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monoamine transporter. 






1810 


gi4454795 


Mus musculus 


organic cation transporter 3 


720 


89 


1811 


gil212965 


Homo sapiens 


H.sapiens mRNA for 
transmembrane protein rnp24. 


266 


91 


1811 


AAB74750 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 1 8 SEQ ID 
NO:59. 


266 


91 


1811 


AAB74759 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 18 SEQ ID 
NO:68. 


266 


91 


1812 


gil5082375 


Homo sapiens 


Similar to transmembrane 7 
superfamily member 1 
(upregulated in kidney), clone 
MGC:20076 IMAGE :46432 16, 
mRNA, complete cds. 


711 


98 


1812 


gil3096836 


Mus musculus 


Similar to transmembrane 7 
superfamily member 1 
(upregulated in kidney) 


690 


95 


1812 


AAY69992 


Homo sapiens 


Human receptor-associated protein 
from Incyte clone 786873. 


412 


72 


1813 


gil2653811 


Homo sapiens 


zinc finger protein 219, clone 
MGC:1124 IMAGE:3347777, 
mRNA, complete cds. 


500 


38 


1813 


gi6899807 


Homo sapiens 


mRNA for zinc finger protein 219, 
complete cds. 


496 


38 


1813 


gi 14549 186 


Mus musculus 


zinc finger protein 219 


490 


36 


1814 


gi4324468 


Homo sapiens 


LAG1 protein (LAG1) gene, exon 
7 and complete cds. 


1385 


93 


1814 


gi 183051 


Homo sapiens 


Human growth/differentiation 
factor l (GDF-l) mRNA, complete 
cds. 


1385 


93 


1814 


AAR20230 


Homo sapiens 


hUOG-l. 


1385 


93 


1815 


gi6009515 


Xenopus laevis 


epidermis specific serine protease 


338 


41 


1815 


gi9757698 


Xenopus laevis 


embryonic serine protease- 1 


314 


46 


1815 


gil 3277969 


Mus musculus 


Similar to protease, serine, 8 
(prostasm) 


310 


38 


1816 


gi41875 


Escherichia coli 


KefC potassium efflux system 


706 


100 


1816 


gi2 16472 


Escherichia coli 


kefc potassium efflux system 


706 


100 


1816 


gil7S6232 


Escherichia coli 
K12 


K+ efflux antiporter, glutathione- 
regulated 


706 


100 


1817 


gil 054578 


Escherichia coli 


dioxygenase 


1510 


100 


1817 


gil 786565 


Escherichia coli 
K12 


taurine dioxygenase, 2- 
oxoglutarate-dependent 


1510 


100 


1817 


gil3359879 


Escherichia coli 
0157:H7 


taurine dioxygenase 


1506 


99 


1818 


gil 279401 


Escherichia coli 


SapB protein 


928 


98 


1818 


gil742115 


Escherichia coli 


Peptide transport system permease 
protein SapB. 


928 


98 


1818 


gil3361335 


Escherichia coli 
0157:H7 


homolog of Salmonella peptide 
transport permease protein 


928 


98 


1819 


gi9652147 


Homo sapiens 


transmembrane-type protein 
tyrosine phosphatase H (PTPRH) 
gene, exon 20 and complete cds. 


4639 


100 


1819 


gi475004 


Homo sapiens 


mRNA for protein tyrosine 
phosphatase precursor, complete 
cds. 


4308 


93 
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1819 


gil321659 


Rattus 

• 

norvegicus 


brain-enriched membrane- 
associated protein tyrosine 
phosphatase (BEM)-2 


920 


77 


1820 


gi7141125 


Homo sapiens 


tissue-type heart Ellis-van Creveld 
syndrome protem (EVC) mRNA, 
complete cds. 


1277 


100 


1820 


gi7141127 


Homo sapiens 


tissue-type brain Ellis- van Creveld 
syndrome protein (EVC) mRNA, 
complete cds. 


1277 


100 


1820 


gi7271903 


Homo sapiens 


DWF-1 mRNA, complete cds. 


1271 


99 


1821 


AAG71453 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1134. 


f /TO 

568 


98 


1821 


AAG72370 


T T * 

Homo sapiens 


Human OR-like polypeptide query 
sequence, ojqv^ iu ^uji. 




no 


1821 


AAE04556 


Homo sapiens 


Human G-protein coupled 
receptor- 12 (GCREC-12) protein. 


558 


100 


1822 


gi 10440040 


Homo sapiens 


cDNA: FLJ23375 fis, clone 
HEP 16206. 


757 


98 


1822 


AAB95094 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17042. 


98 


43 


1822 


gi6002197 


Homo sapiens 


H.sapiens NDUFV3 gene, exon 1 . 


78 


34 
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O 

scure 


o/ 
/o 

xuentiiy 


912 


gi601918 


Homo sapiens 


glutathione ^-transterase tneta ^ 




i on 


912 


gi769703 


Rattus 
norvegicus 


glutathione b-transrerase suounit i rs 


QQ^ 


/ o 


912 


gi220757 


Rattus 

• 

norvegicus 


glutathione S-transterase Yrs-Yrs 




7R 
to 


913 


gil3872813 


Homo sapiens 


fibulin-6 


4538 


93 


913 


gi 14575679 


Homo sapiens 


nemicentm 






913 


ABB 1 0492 


Homo sapiens 


Human clJiNA ^rii^ ID JN(J. oUU. 




on 


914 


A A TTOAOTO 

AAU80378 


TT „ „ „ „ ! ^, 

Homo sapiens 


Human JJUx'tJ-liKe protein jnuvi. 






914 


AAR83048 


Homo sapiens 


Human macrophage migration 
mmoitory iactor~o ^lviir-jj. 


450 


94 


914 


gi2104581 


Homo sapiens 


pnenylpyruvate tautomerase 11 




OA 


915 


AAR83048 


Homo sapiens 


Human macrophage migration 
inhibitory factor-3 (MIF-3). 


JH-O 


OO 


915 


gi2104581 


Homo sapiens 


piienylpyruvate tautomerase 11 


JHO 


OO 


915 


gi3047378 


Homo sapiens 


U-dopachrome tautomerase 


DHO 


OO 


916 


AAR89197 


TT * 

Homo sapiens 


Human nepatocelluiar growth tactor 
single chain precursor protein. 




Ql 

y i 


916 


gi2l968l 


Homo sapiens 


rlLrr activator precursor 




01 


916 


gi4 190954 


Homo sapiens 


hepatocyte growth factor activator 


3466 


91 


917 


gi5441937 


T T * 

Homo sapiens 


laminm beta precursor; similar to 
AAB92586 (PID:g2708707) 




i nn 
1UU 


917 


AAY15457 


Homo sapiens 


Human laminin beta 4 protein. 






917 


AAY15459 


Homo sapiens 


SEQ ID 5 of W09919347. 


6042 


97 


918 


AAG65887 


Homo sapiens 


Ammo acid sequence or GSK gene 
Id 14936. 


5050 


1 An 


918 


AAG65888 


Homo sapiens 


Ammo acid sequence of GSK gene 
Id 14936. 


45S2 


O /I 

o4 


918 


AAG68261 


Homo sapiens 


Human POLYS protein sequence 
SEQ ID NO: 16. 


yl C A o 

4543 


83 


919 


ABB74955 


Homo sapiens 


Human lung tumour L524S variant 
protein sequence SEQ ID NO: loo. 


905 


100 


919 


ABB74954 


Homo sapiens 


Human lung tumour L524S variant 
protein sequence 1JU> JN\J: ico. 


905 


100 


919 


AAE23751 


Homo sapiens 


Human parathyroid related peptide, 
PTHrP (1-173). 


905 


100 


920 


gi20269l29 


Homo sapiens 


MEGF6 


2584 


95 


920 


AAE17919 


T T * 

Homo sapiens 


Human gene 1 encoded serine 
protease, HMGBM65. 


z40o 


yy 


920 


AAY72091 


Homo sapiens 


Human serine protease #2 encoded 
by clone HMGBM65. 


24oo 




921 


ABB90774 


Homo sapiens 


Human Tumour Endothelial Marker 
polypeptiae obQ iu JNU 1 . 


340 


1 t\r\ 


921 


ABB90773 


TT * _ 

Homo sapiens 


Human 1 umour Endothelial Marker 
polypeptide biii^ iu jnw z /y. 




1 nn 


921 


ABB90772 


Homo sapiens 


Human l umour Jtsnaotheiiai ivrarKer 
poiypeptiae od^ iu rs\j z / / . 


^4fT 


1 nn 


yj,z, 


/\/v u y /yjjy 




Human TP nrntein TP190 


2054 


89 


922 


ABG30500 


Homo sapiens 


Human Carboxypeptidase A. 


2054 


89 


922 


AAB47565 


Homo sapiens 


Protease PRTS-7. 


2054 


89 


923 


AAM52240 


Homo sapiens 


Human MFAP4 SEQ ID NO 3. 


1198 


99 


923 


gi790817 


Homo sapiens 


microfibril-associated glycoprotein 4 


1198 


99 


923 


AAM52239 


Homo sapiens 


Human MAG4V SEQ ID NO 1. 


1197 


100 


924 


AAU81960 


Homo sapiens 


Human PR0536. 


1648 


100 


924 


AAB65173 


Homo sapiens 


Human PR0536 (UNQ337) protein 


1648 


100 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


% 
Identity 








sequence SEQ ID NO:97. 






924 


AAB94830 


Homo sapiens 


Human protein sequence SEQ ID 
NO:15991. 


1648 


100 


925 


AAU 10497 


Homo sapiens 


Human Apohpoprotein A-II, 
APOA2. 


438 


90 


925 


gil3528981 


Homo sapiens 


apolipoprotein A-II 


438 


90 


925 


gi21429231 


Homo sapiens 


apolipoprotein All 


438 


90 


926 


AAY76156 


Homo sapiens 


Human secreted protein encoded by 
gene 33. 


419 


90 


927 


gi 13097252 


Homo sapiens 


Similar to FK506 binding protein 2 
(13 kDa) 


648 


91 


927 


gi337370 


Homo sapiens 


rapamycin- and FK506-binding 
protein 


648 


91 


927 


AAR93551 


Homo sapiens 


Human FKBP-13 immunophilin 
protein. 


628 


90 


928 


AAG73789 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:4553. 


230 


95 


928 


AAB53360 


Homo sapiens 


Human colon cancer antigen protein 
sequence SEQ ID NO:900. 


230 


95 


930 


gi 17982522 


Brucella 
melitensis 


(S)-2-hydroxy-acid oxidase chain D 


1151 


49 


930 


gi 177405 10 


Agrobacterium 
tumefaciens str. 
C58 (U. 
Washington) 


FAD dependent oxidoreductase 


1149 


49 


930 


gil5157181 


Agrobacterium 
tumefaciens str. 
C58 (Cereon) 


AGR_C_3718p 


1149 


49 


931 


ABB89770 


Homo sapiens 


Human polypeptide SEQ ID NO 
2146. 


366 


70 


931 


AAB75367 


Homo sapiens 


Human secreted protein #26. 


366 


70 


931 


AAU04353 


Homo sapiens 


Mammalian toxicological response 
marker protein #5. 


366 


70 


932 


ABB55723 


Homo sapiens 


Human polypeptide SEQ ID NO 52. 


1107 


99 


932 


AAU39014 


Homo sapiens 


Human secreted protein yc2 1. 


1107 


99 


932 


AAM94621 


Homo sapiens 


Human reproductive system related 
antigen SEQ ID NO: 3279. 


203 


67 


933 


gil6648246 


Drosophila 
melanogaster 


GH27263p 


228 


33 


933 


gil711197 


Xenopus laevis 


Xfringe2 


184 


25 


933 


gil 679784 


Xenopus laevis 


radical fringe 


184 


26 


934 


AAY73383 


Homo sapiens 


HTRM clone 2280456 protein 
sequence. 


1571 


100 


934 


AAG75282 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:6046. 


619 


99 


934 


AAM41045 


Homo sapiens 


Human polypeptide SEQ ID NO 
5976. 


89 


29 


935 


AAG65916 


Homo sapiens 


Amino acid sequence of GSK gene 
Id 239881. 


3041 


99 


935 


gil5705411 


Homo sapiens 


peptidoglycan recognition protein L 
precursor 


3041 


99 


935 


AAG65915 


Homo sapiens 


Amino acid sequence of GSK gene 
Id 239881. 


2892 


99 


936 


AAU97218 


Homo sapiens 


Human G protein-coupled receptor, 
TGR25. 


850 


99 



I 
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SEQ 
ID 


Hit ID 1 


Species 


Description 


c 


/o 

a uc ii my 


936 


AAE23415 


Homo sapiens 


Human G-protein coupled receptor-7 


850 


99 


936 


ABB 12463 


Homo sapiens 


Human bone marrow expressed 
protein or*Kl ID jncj. d\jz. 


769 


100 


937 


gil7H232 


Homo sapiens 


1 4-3-3 protein eta cnam 




OA 


937 


gi460779 


Homo sapiens 


14-3-3 eta subtype 


1 094 


94 


937 


gil 477931 


Homo sapiens 


14.3.3 eta chain 


iuy4 


y4 


938 


gil61 18441 


Oryctolagus 
cuniculus 


S-100 calcium-binding protein beta 
subunit 


479 


100 


938 


gil2804681 


Homo sapiens 


SI 00 calcium-binding protein, beta 
(neural) 


479 


100 


938 


gi337730 


Homo sapiens 


SI 00 protein beta subunit 


479 


100 


939 


AAW75082 


XT * 

Homo sapiens 


Human secreted protein encoded by 
gene Zo clone Jn.lLr.bV iz. 




09 

i 


939 


AAM95398 


Homo sapiens 


Human reproductive system related 
antigen bEQ ID Nu: 4Udo. 


71 


50 


939 


AAU29184 


Homo sapiens 


Human PRO polypeptide sequence 
#161. 


/I 


JU 


940 


AAU29315 


TT TT * 

Homo sapiens 


Human PRO polypeptide sequence 

#2yz. 


07/10 


QQ 


940 


ABB75753 


Homo sapiens 


Human pancreas GP354. 


3104 


100 


940 


ABB75751 


T T * 

Homo sapiens 


Human immunoglobulin supertamily 
member GP354. 






941 


AA021477 


T T * 

Homo sapiens 


Human NgR2 protein sequence. 






941 


AA021482 


Homo sapiens 


Mature human NgR2 protein 
sequence. 


2082 


100 


941 


gi20987877 


Mus musculus 


similar to Nogo receptor 


885 


58 


942 


AAB25674 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 10 SEQ ID NO:63. 


/DO 


yo 


942 


AAB36613 


Homo sapiens 


Human FLEXHT-35 protein 
sequence SEQ ID NO:35. 


0 


yu 


942 


gil 4603247 


Homo sapiens 


Similar to RlisJiJN cUNA 
5730409G15 gene 


Zo5 


QQ 

yu 


943 


AAE22093 


XX * 

Homo sapiens 


Human kidney specific renal cell 
carcinoma (RoKa^oj protein. 


1 


ys 


943 


A A xrnr /'no 

AAY85678 


Homo sapiens 


Jtluman Kiuney aisease associaxea 
protein SEQ ID 10. 






943 


gi3 127193 


Rattus 
norvegicus 


kidney-specific protein 


OoO 




944 


ABP41513 


Homo sapiens 


Human ovarian antigen HCOOX52, 
SEQ ID NO:2645. 


2558 


100 


944 


gil 197499 


Homo sapiens 


CI inhibitor 


2527 


100 


944 


AAW18207 


Homo sapiens 


Wild-type CI inhibitor. 


2524 


99 


945 


gil 79 82052 


Brucella 
melitensis 


RIBOSOMAL PROTEIN LI l 
METHYLTRANSFERASE 


241 


36 


945 


gil5157854 


Agrobactenum 
tumefaciens str. 
\^do ^cereon) 


AGR_C_4799p 


207 


36 


945 


gil 8 145768 


Clostridium 
perfringens str. 
13 


probable ribosomal protein LI l 
methyltransferase 


118 


37 


946 


AAB03948 


Homo sapiens 


Human mesenchymal stem cell 
polypeptide. 


462 


98 


946 


AAB64909 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID NO: 87. 


165 


63 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


% 
Identity 


946 


AAU01649 


Homo sapiens 


Human secreted protein 
immunogenic epitope encoded by 
gene #12. 


165 


63 


947 


AAE13809 


Homo sapiens 


Human lung tumour-specific protein 
SAL-82. 


2906 


99 


947 


AAB44467 


Homo sapiens 


Human lung tumour- specific antigen 
encoded by cDNA #103. 


2906 


99 


947 


gil3958036 


Homo sapiens 


RUFY1 


zyUo 


on 

yy 


949 


AAB 88326 


Homo sapiens 


Human membrane or secretory 
protein clone PSEC0021. 


541 




949 


AAB92475 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 10549. 


541 


74 


949 


AAE03226 


Homo sapiens 


Human gene 7 encoded secreted 
protein HN ID.L21, SEQ ID NO: /o. 


541 


74 


950 


ABP47877 


Homo sapiens 


Human polypeptide SEQ ID NO 307. 


1303 


100 


950 


ABP47870 


Homo sapiens 


Human polypeptide SEQ ID JNU 3UU. 


yoy 


yy 


950 


ABP48038 


Homo sapiens 


Human polypeptide SEQ ID NO 468. 


835 


98 


951 


AAU12382 


Homo sapiens 


Human PR0792 polypeptide 
sequence. 


1 1 o o 

H38 


87 


951 


AAB24416 


Homo sapiens 


Human PR0792 protein sequence 
SEQ ID NO:155. 


1138 


o/ 


951 


AAB24055 


Homo sapiens 


Human PR0792 protein sequence 
SEQ ID NO:31. 


1138 


87 


952 


gi7670746 


Homo sapiens 


UDP-glucose glycoprotein 
glucosyltransferase 1 precursor 


7974 


98 


952 


gi 13275621 


synthetic 
construct 


Rat RUGT 


7317 


91 


952 


gi7677176 


Rattus 
norvegicus 


UDP-glucose 

glycoprotein: glucosyltransferase 
precursor 


7317 


91 


953 


gi208 10499 


Homo sapiens 


Similar to RIKEN cDNA 
2900074C18gene 


1237 


100 


953 


AAM42005 


Homo sapiens 


Human polypeptide SEQ ID NO 
6936. 


404 


48 


953 


AAB64390 


Homo sapiens 


Ammo acid sequence of human 
intracellular signalling molecule 
INTRA22. 


402 


50 


954 


gi 18676660 


Homo sapiens 


FLJ00229 protein 


2292 


oo 


954 


AAM40342 


Homo sapiens 


Human polypeptide SEQ ID NO 
3487. 


1454 




954 


AAM42128 


Homo sapiens 


Human polypeptide SEQ ID NO 
7059. 


765 


60 


955 


gi2 17072 16 


Homo sapiens 


LOC146556 


2100 


99 


955 


AAG81399 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:316. 


1910 


100 


955 


AAB61421 


Homo sapiens 


Human TANGO 300 protein. 


1904 


99 


956 


gi2 1667020 


Homo sapiens 


mutant I beta- 1,6-N- 
acetylglucosammyltransferase C 
form 


1637 


99 


956 


gi2 16670 11 


Homo sapiens 


Ibeta-1,6-N- 

acetylglucosaminyltransferase C 
form 


1637 


99 


956 


gi2 16670 15 


Homo sapiens 


mutant I beta-l,6-N- 
acetylglucosaminyltransferase C 
form 


1633 


99 
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ID 


xlll 1U 


o pt;C lea 




s 

score 


Identity 


G^7 


glZUOD 1 OD 


nomo sapiens 


collagen iypc -/VI v 


773 


93 


957 


AAG00322 


Homo sapiens 


Human secreted protein, SEQ ID 
>jn- <d4ni 

XNW. ntuJ. 


482 


98 


957 


gi2S8875 


Gallus gallus 


collagen XIV 


461 


63 


958 


AACj/55Uy 


riomo sapiens 


riuman coion cancer anugen protein 


74R7 


Q4 


95c 


AAxJo4oUo 


nomo sapiens 


nllllllO aCIQ. ScmJCnt/C yJL illctLIi.A 

metalloproteinase collagenase 1. 


748? 


04 


93o 


A A t? 1 H/l 1 C 


riomo sapiens 


riuman mau/ix mexduopiuLinaoc-i 
(MMP-1) protein. 


74S? 


94 


y:>y 


AAYZDoDo 


nomo sapiens 


Wnman c£*r*t~^4'^f'i nrrttpin "Rrsi om f^trl - 

nuniaii scurctcu. pruioni iictgniwiiL 

encoded from gene 57. 




99 




gllD 14j /y / 


oiis scroia 


DaSic proime~i proLciii 


oo 


42 


959 


gi3413810 


Mus musculus 


Bassoon 


84 


31 


960 


AACj77o07 


Homo sapiens 


riuman rioiri_»Dy serme/inreonme 
pnospndLase pioicin bCL^uciice. 


47^ 


OR 


you 


A A D^OQQ 


riomo sapiens 


r^ung Lancer d.i>L>uL»id,icu. pviij'pcpi.iuc 
sequence SEQ ID 626. 




98 


yoo 


A AV^Q7/1/I 


riomo sapiens 


Unman Mrtft^dl rt^/otiotl tlCClTP* ft t*1*1 "t/f^ f^l 

nunisn norniai ovariaii uoouc ucnvcu 
protein 21. 


47^ 


98 


yoi 


gl4Uiy24/ 


Aienne 
herpesvirus 3 


myuiiuiiic iviiiabc 


71 

/ 1 


46 


yoz 


giyyjoyjo 


^ A 1 1 tr* m no /%i line 

ivius muscuius 


C 11 /',r C , r ^'2_Q Vir»mn1ocr < 3nv^OTi9 


L J 1 U 


86 


962 


AAM78701 


Homo sapiens 


Human protein SEQ ID NO 1363. 


1236 


100 


962 


AAWlo2ol 


riomo sapiens 


v^nroniaLin regulator protein 
oUv jyri. 


S14 
o it 




963 


AAG03840 


Homo sapiens 


Human secreted protein, SEQ ID 

KID- 7071 


439 


80 


yoi 


giioyooDo 


nomo sapiens 


iNAurn, udil[uiiiuiic uaiuui euuuiaoo 
subunitB13 


419 


80 


yoi 


rwi 1 O^^A AO'} 

gll/0D4UZJ 


nomo sapiens 


LyJrxLJxi. tlcny til vJ^CilaoC ^UUl(^[Uiiiuiiw^ 

1 alpha subcomplex, 5 (13kD, B13) 


419 


80 


yo4 


rrJ '271/17 
gU/34/ 


riomo sapiens 


i xvv_J gamma Cllalll ^jrVrv l-Jtuy 


1 657 


94 


yo4 


gl33y4U / 


riomo sapiens 


ii antigen *^t-j»o~d.baut'i<xLt;(j. ptuLcm 

pi CU Ui o\JI 


1677 


91 


yo4 


glo / U 1 o 


nuiiio oopiCllb 


pi C~g«llllllct C'lldlll y^rx/Tk. l i HJ y 


1436 


87 


965 


AAE06606 


Homo sapiens 


Human protein having hydrophobic 

remain WP107QJ. 

uornain, rur 


566 


96 


965 


AAM94615 


Homo sapiens 


Human reproductive system related 


566 


96 


965 


gil 8490535 


Mus musculus 


RIKEN cDNA 26 10528 Jl 1 gene 


388 


78 


yoo 


glL3o040oy 


r>os taurus 


I 1 firl 11/11 If/Ml T /l f"*^ ^7~\1 avQ C 1 i^i 

, JL^-giuutiroriyi y^ .j cpimcrcist; 


11 


97 


966 


gi 13442978 


Mus musculus 


D-glucuronyl C5-epimerase 


3139 


95 


yoo 


gil lyjj I / / 


ivius muscuius 


nepann/neparan sunate. glucuronic 
acid C5 epimerase 


11 14 


y*j 


967 


A A P770A/1 


nomo sapiens 


numan onactory receptor 
poiypepiiGe, oriv^ ijj inu. iooj. 


1 167 


11 


967 


A AG71875 


Homo saniens 


Human olfactorv receotor 
polypeptide, SEQ ID NO: 1556. 


1138 


74 


967 


AAG71816 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1497. 


1102 


72 


968 


A AB 73679 


Homo sapiens 


Human oxidoreductase protein ORP- 
12. 


1918 


100 


968 


gil2655133 


Homo sapiens 


Similar to CGI-63 protein 


1918 


100 


968 


ABB89788 


Homo sapiens 


Human polypeptide SEQ ID NO 


1907 


99 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


% 
Identity 








2164. 






969 


AAY28995 


Homo sapiens 


Tumour suppressor Del-27 protein 
sequence. 


2309 


59 


969 


AAY15344 


Homo sapiens 


Tumour suppressor protein del-27. 


2309 


59 


969 


gi6062874 


Homo sapiens 


candidate tumor suppressor protein 
DICE1 


2309 


59 


970 


gi387011 


Homo sapiens 


pyruvate dehydrogenase El -alpha 
precursor 


2187 


99 


970 


gi35381 


Homo sapiens 


pyruvate dehydrogenase El' subunit 
(AA 1 - 390) 


2049 


100 


970 


gi2 19982 


Homo sapiens 


alpha subunit 


2049 


100 


971 


gi2275569 


Homo sapiens 


TCRBV23S1 


660 


99 


971 


gi2104755 


Homo sapiens 


T cell receptor V-beta 23 


660 


99 


971 


gi495404 


Pan troglodytes 


T cell receptor beta chain 


657 


88 


972 


AAM95505 


Homo sapiens 


Human reproductive system related 
antigen SEQ ID NO: 4163. 


1469 


99 


972 


ABB96188 


Homo sapiens 


Human testicular antigen SEQ ID 
NO: 1572. 


1469 


99 


972 


AAB75360 


Homo sapiens 


Human secreted protein #19. 


1272 


100 


973 


AAE02937 


Homo sapiens 


Human TFRP protein. 


3040 


94 


973 


gi4996563 


Homo sapiens 


protein inhibitor of activatied STAT3 


3040 


94 


973 


gil2654633 


Homo sapiens 


protein inhibitor of activated STAT3 


3040 


94 


974 


gil 5099957 


Homo sapiens 


diacylglycerol acyltransferase 2-like 
protein 


855 


93 


974 


gil 5099955 


Mus musculus 


diacylglycerol acyltransferase 2-like 
protein 


675 


59 


974 


AAY94889 


Homo sapiens 


Human protein clone HP02485. 


576 


56 


975 


AAB92669 


Homo sapiens 


Human protein sequence SEQ ID 
NO:11033. 


1432 


100 


975 


ABG40833 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
30498. 


399 


98 


975 


AAM31319 


Homo sapiens 


Peptide #5356 encoded by probe for 
measuring placental gene expression. 


399 


98 


976 


gi4210474 


Navel orange 
infectious 
mottling virus 


polyprotein 


71 


21 


977 


gil 209685 


Homo sapiens 


salivary peroxidase 


3754 


96 


977 


gil63307 


Bos taurus 


lactoperoxidase 


3166 


80 


977 


gil 1990122 


Camelus 
dromedarius 


peroxidase 


3153 


81 


978 


AAY01604 


Homo sapiens 


Amino acid sequence of the human 
defensin (Def-X) protein. 


501 


97 


978 


gi29735 


Homo sapiens 


corticostatin/defensin HP-4 precursor 


214 


46 


978 


AAR15222 


Homo sapiens 


Chronic myelogenous leukaemia- 
derived myeloid-related protein. 


211 ■ 


47 


979 


AAG81415 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:348. 


848 


100 


980 


gi6633820 


Arabidopsis 
thaliana 


F1N19.20 


79 


26 


980 


gi6 180001 


Capra hircus 


pregnancy-associated glycoprotein- 8 


73 


30 


980 


gi20071290 


Mus musculus 


Similar to solute carrier family 26, 
member 7 


71 


25 


981 


gi4309953 


Homo sapiens 


T cell receptor gamma chain; similar 
to PID:g339160 


951 


95 
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ID 


Hit ID 
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Description 

— 


S 

score 


% 
laentity 


981 


gi4309950 


T T • 

Homo sapiens 


T cell receptor gamma chain; match 
to ISUoJzo (JrlL):gluo4/UJ 


y i / 


DO 


981 


ABG39493 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe ID 
29158. 


679 


77 


982 


gil223888 


synthetic 
construct 


T cell receptor alpha chain 


1 r»/C£ 
lUOO 


/y 


982 


gi338766 


TT_ « _ 

Homo sapiens 


T-cell receptor precursor 


yb / 


ID 


982 


gi3089419 


Homo sapiens 


T cell receptor alpha chain 


974 


71 


983 


ABB07518 


Homo sapiens 


Human drug metabolizing enzyme 
(DME) (ID: 7474438CD1). 


A 1 O 

41o 


zy 


983 


AAB73512 


TT * 

Homo sapiens 


tt„ -i t* tTTIjC 1 A O T~? (~\ 

Human transferase HTTo-19, oLQ 

JUL) JN<J: iy. 


A AO 

4uy 


OA 


983 


gi 14249942 


Homo sapiens 


Similar to KlivbN cjjna 
uoiuuuor lo gene 




jU 


984 


A A Hi lot 

AAG71286 


Homo sapiens 


Human gene y-encoaea secreiea 
protein HMSDL37, SEQ ID NO: 135. 


Oil 


yu 


9o4 


A A rf^"71 Off 1 

AA071251 


Homo sapiens 


Human gene y-encoueo secreiea 
proiem o.ivioLJi_»j / 3 ocv iu L\\j,yy. 


-5 11 


QH 


no A 

ys4 




ivius mus cuius 


ijimi iar to naorin 


74 


*T 1 T 


985 


gi45 19541 


Mus musculus 


thrombospondin type 1 domain 


1296 


88 


985 


ABrolo46 


Homo sapiens 


Human poiypeptiae oiiv iu jnl^ zuu. 


AAA 


to 


985 


AAB99220 


Homo sapiens 


Human thrombospondin-30. 


644 


46 


986 


AAG81417 


Homo sapiens 


Human AFP protein sequence SEQ 

TT\ VTA.') C ^> 

ID NO:352. 


287 


100 


987 


ABB53264 


Homo sapiens 


Human polypeptide #4. 


1904 


100 


987 


gi 18565270 


T X * 

Homo sapiens 


Lib 


1904 


i aa 
100 


987 


gil 8565266 


Rattus 

* 

norvegicus 


Lib 


1621 


83 


988 


AAB59022 


Homo sapiens 


Breast and ovarian cancer associated 
antigen protein sequence SEQ ID 
730, 


1385 


100 


988 


gi 14124958 


Homo sapiens 


nucleotide binding protein 2 (E.coli 
MinD like) 


1385 


100 


j~i 

988 


gil3559170 


T T * 

Homo sapiens 


C447E6. 1 (nucleotide binding 
protein 1 (E.coli MinD like) ) 


1 1 O A 


1 AA 

100 


>-v 

989 


AAU99292 


T T * 

Homo sapiens 


Human chordm-like associated 
protein ffl. 


391o 


1 AA 

100 


989 


AAU99293 


Homo sapiens 


Human chordin-like associated 
protein #2. 


3073 


81 


989 


AAE07119 


Homo sapiens 


Human gene 12 encoded secreted 
protein iragrnent, briv 1L> iNvJ.ioO. 


241 1 


98 


990 


AAB26105 


Homo sapiens 


Human DAN/Cerberus-related 
protein 6 (hDCKo) #1. 


1693 


87 


990 


AAE 17089 


Homo sapiens 


Human osteolevin protein. 


Aid 

4jy 


00 


i"V /"X /X 

990 


ABB07209 


Homo sapiens 


Human cloaked- 2 polypeptide 
sequence. 


43 y 


oo 


991 


giyyo4uyy 


^niamydia 
trachomatis 


inclusion mem Drane protein 




OD 


992 


gi21619848 


Homo sapiens 


Similar to immunoglobulin lambda 
joining 3 


1002 


89 


992 


gi4490538 


Homo sapiens 


lambda-immunoglobulin liglit chain 


927 


81 


992 


gi33746 


Homo sapiens 


immunoglobulin lambda light chain 


914 


80 


993 


AAG03466 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7547. 


97 


100 
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993 


gi4063766 


Aspergillus 
niuuians 


cmtmase 


01 

( 




993 


gi36 17766 


Anopheles 
gamoiae 


ICHIT protein 


88 


27 


994 


gl2U:524o 


Kattus 
norvegicus 


ecu auiiesion-iiKe moiecuic 




97 


994 


gl5143 /4 


Homo sapiens 


opioio-Dinamg ceu aanesiun 
molecule 


1 665 

1 UUJ 


Q7 


994 


gi586 


Bos taurus 


put. pre-UJr t-Alvl (AA 1 - jhD J 






995 


AAB53088 


Homo sapiens 


Human angiogenesis-associated 
protein rKUj/o, otiv i^ inlj.jl.jz. 


2391 


91 


995 


AAB 80260 


Homo sapiens 


riuman rKU32o protein. 


9^Q1 




995 


AAU12351 


Homo sapiens 


Human PR0328 polypeptide 
sequence. 


2391 


91 


996 


AAB85144 


Homo sapiens 


Human NKCR polypeptide (clone ID 

rllVl o LJLvlj j ) . 


1205 


83 


996 


AAY96226 


Homo sapiens 


Human high affinity Fc receptor, 
r cgammaKJ . 


354 


41 


996 


gi31332 


Homo sapiens 


FcRI (AA 1-374) 


354 


41 


997 


ABB50835 


Homo sapiens 


Human secretea protein encoaea oy 
gene 80 SEQ ID NO:788. 


1A 
/4 




998 


ABP61434 


Homo sapiens 


Human JNr-K±> activating protein 
SEQ ID NO 21. 


Hi / 


i c\n 

L\J\J 


998 


A AT A A *~\ 

AAU07442 


Homo sapiens 


Human Wnti upreguiateo. protein z 
CWUP2). 


HI 1 


1 nn 


998 


* AT T/f"7 A A t 

AAU07441 


Homo sapiens 


Human wnti upreguiatea protein i 

^ W Uri ). 


J.1 7 


1 on 


999 


AAB08732 


Homo sapiens 


Amino acid sequence of a human 

f~\T T\ K -r>^Urr\o-n+! Acs. 

poiypeptiue. 


1671 


85 


999 


■ o /~\ n '■toon 
gl2U3 72922 


Homo sapiens 


polynucleotide pnospnoryid.se-iiKc 
protein 


1 671 


S5 


999 


AAB92684 


Homo sapiens 


Human protein sequence ^tzy^ lu 
NO: 11065. 


IZDJ 


o o 


1000 


gi 1995 82 


Mus musculus 


i5(^z ^-microgioDuiin 






1000 


gi50105 


Mus musculus 


beta2-microglobulin precursor (aa - 
zu to y y ^ 


614 


95 


1000 


gil99576 


Mus musculus 


B(2)-microglobulin 


609 


94 


1001 


AAB93335 


Homo sapiens 


Human protein sequence oxiv^ iu 

JNU.IZ44I. 


679 
O /Z 


QQ 

yy 


1001 


AAE21620 


Homo sapiens 


Human gene 7 encoded secreted 
protein, oJb^ li-' iNu.yz. 


427 


91 


1001 


AAE216U4 


Homo sapiens 


Human gene / encoaea secreteu 
protein niruiioj, ohk,/ llj imu. /u. 


40? 


QO 


1002 


gil5559608 


Homo sapiens 


Similar to zinc finger protein 16 

^JS-WyV ¥ ) 


2079 


100 


1002 


gi488555 


Homo sapiens 


zinc finger protein ZNF135 


749 


65 


1002 


AAB21006 


Homo sapiens 


Human nucleic acia-bmaing protein, 
NuABP-10 


7J.7 




1003 


ABB79480 


Homo sapiens 


Human zinc finger protein 75.68. 


1689 


56 


1003 


gil 769491 


Homo sapiens 


kruppel-related zinc finger protein 


1672 


53 


1003 


AAM39130 


Homo sapiens 


Human polypeptide SEQ ID NO 
2275. 


1669 


52 


1005 


AAB23641 


Homo sapiens 


Human secreted protein SEQ ID NO: 
97. 


609 


100 


1005 


gil 3 129458 


Oryza sativa 


| polyprotein 


89 


31 
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[Oryza sativa 

(japonica 

cultivar-group)] 








1005 


gi7228457 


Oryza sativa 

(japonica 

cultivar-group) 


Similar to Sorghum bicolor Gypsy- 
Ty3 type retrotransposon RetroSorl, 
polyprotein. (AF098806) 


87 


29 


1006 


gil 1493473 


Homo sapiens 


PR02225 


163 


67 


1007 


AAM52305 


Homo sapiens 


Human zyxine. 


2815 


91 


1007 


AAG68191 


Homo sapiens 


Zyxm protein SEQ ID NO: 107. 


■O O 1 c 

2815 


91 


1007 


gil 545954 


Homo sapiens 


zyxin 


2815 


91 


1008 


AAM0O955 


Homo sapiens 


Human bone marrow protein, SEQ 
ID NO: 431. 


704 


1UU 


1008 


gil 8028488 


Homo sapiens 


cytosolic leucme-nch protein 


631 




1008 


gi2 16663 64 


Bos taurus 


leucine-rich protein 


588 


90 


1009 


AAE01420 


Homo sapiens 


Human secreted protein fragment, 
SEQ ID NO:l44. 


612 


100 


1009 


gil 7225457 


Homo sapiens 


autism-related protein l 


76 


38 


1009 


AAM79126 


Homo sapiens 


Human protein SEQ ID NO 1 78 8. 


72 


29 


1010 


gil6877231 


Homo sapiens 


Similar to RIKEN cDNA 
27000 19D07 gene 


997 


100 


1010 


AAM39593 


Homo sapiens 


Human polypeptide SEQ ID NO 
2738. 


94 


100 


1010 


gi7302568 


Drosophila 
melanogaster 


CG15073-PA 


91 


19 


1011 


AAB43434 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 879. 


413 


78 


1011 


AAY07039 


Homo sapiens 


Breast cancer associated antigen 
precursor sequence. 


413 


78 


1011 


gil 7932966 


Homo sapiens 


ribosomal protein PI 


413 


78 


1012 


AAW88457 


Homo sapiens 


Human lysophospholipase IHLP. 


1125 


/AO 

93 


1012 


ABP51416 


Homo sapiens 


Human MDDT SEQ ID NO 438. 


843 


100 


1012 


AAB75386 


Homo sapiens 


Human secreted protein #45. 


825 


100 


1013 


AAG81374 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:266. 


919 


98 


1013 


gi20977549 


Danio rerio 


DT1P1 AlO-like protein 


354 


41 


1013 


gi665970 


Saccharomyces 
cerevisiae 


Ylr435wp 


169 


26 


1014 


gi20988991 


Mus musculus 


RIKEN cDNA 2810405K02 gene 


920 


88 


1014 


AAM93895 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
4031. 


111 


100 


1014 


ABB84903 


Homo sapiens 


Human PRO 1198 protein sequence 
SEQ ID NO: 174. 


187 


36 


1015 


AAB56791 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1369. 


496 


98 


1015 


gil9401678 


Giardia 
intestinalis 


endosomal AAA ATPase-like protein 


76 


38 


1016 


ABB44579 


Homo sapiens 


Human wound healing related 
polypeptide SEQ ID NO 36. 


655 


100 


1016 


AAB53427 


Homo sapiens 


Human colon cancer antigen protein 
sequence SEQ ID NO: 967. 


655 


100 


1016 


gi2286227 


Bos taurus 


myocardial vascular inhibition factor 


655 


100 


1017 


AAB81188 


Homo sapiens 


Human zinc finger protein 52 (ZFP- 
52). 


2389 


98 


1017 


AAB95368 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17684. 


1712 


100 
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1017 


ABP51456 


Homo sapiens 


Human MDDT SEQ ID NO 478. 


1561 


99 


1018 


gi6434346 


Caenornabditis 
elegans 


Y105E8B.4 


T7 
/ / 


'X'X 

DO 


1019 


AAG66831 


T T * 

Homo sapiens 


Human DN A~aependent protein 
kinase y. 




yo 


1019 


AAB90816 


Homo sapiens 


iluman snear stress-iesponse protein 




TOO 


1019 


AAG66832 


Homo sapiens 


xiuman jjjn A-uepenaenx proxem 
icnase y iN-ierminai peptiuc 






1020 


gi 1377897 


Homo sapiens 


heart protein 


1631 


100 


1020 


gi7209525 


Homo sapiens 






QQ 

yy 


1020 


gi5 825391 


Mus rnusculus 


four and half LIM domain protein 2 


1524 


91 


1021 


gil 8 139947 


TT . ? 

Homo sapiens 






QQ 


1021 


gi7770259 


Homo sapiens 


PRQ2975 


794 


100 


1021 


gil237130 


Escherichia coh 


O antigen polymerase 






1022 


gil 92637 12 


Homo sapiens 


Similar to LOC 146557 


972 


100 


1022 


AAG81348 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:214. 


coo 


02 


1022 


AAM88837 


Homo sapiens 


Human immune/haematopoietic 
antigen SEQ ID NO: 16430. 


1 70 

i /y 


oU 


1023 


AAG01390 


Homo sapiens 


Human secreted protein, SEQ ID 

X T/~\ . C A *7 1 

NO: 5471. 


297 


100 


1023 


gi456681 


Pseudorabies 
virus 


helicase 


OA 

oU 


Zj 


1023 


AAM85692 


X T 

Homo sapiens 


Human imrnune/haernatopoietic 
antigen SEQ ID NO:l3285. 


/o 


£1 
OJ 


1024 


gi4235144 


XX * 

Homo sapiens 


BC39498_1 




60 


1024 


gi21265141 


X X 

Homo sapiens 


Similar to zinc finger protein 91 
(HPF7, HTF10) 




60 
OU 


1024 


gil4348591 


XT ' 

Homo sapiens 


KRAB zinc finger protein 


I4U3 


JO 


1025 


gil 8490643 


XT * 

Homo sapiens 


Similar to recombination activating 
gene 2 




QQ 

yy 


1025 


gil65680 


Oryctolagus 
cuniculus 


recombination activating protein 


2666 


91 


1025 


gi2576246 


Mus rnusculus 


RAG-2 protein 


2594 


88 


1026 


AAG75278 


XX • 

Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:6042. 


1 n^o 


Q^ 


1026 


gi600255 


Gallus gallus 


caldesmon 


1 An 




1026 


gi211896 


Gallus gallus 


h-caldesmon 


147 


28 


1027 


AAY87341 


Homo sapiens 


XX _ ■ -i i • 1 _ __„j • * „ 

Human signal peptide containing 
protein HSPP-1 18 SEQ ID NO: 118. 


oyy 


QQ 

yy 


1027 


gil2311853 


Leishmania 
major 


possible surface antigen 


it 


1 1 


1028 


AAM92844 


TT • 

Homo sapiens 


XT . T * - »' j * * . 

Human digestive system antigen 
SEQ ID NO: 2193. 


OO 


3o 


1028 


ABB11195 


T T — ■ 

Homo sapiens 


Human transmembrane protein 
homologue, SEQ ID NO: 1565. 


CO 

oz 




102o 


A 1 *\ 

Ah>VJDOol3 


nomo sapiens 


riuiiicin pruoLdLc spe^mc protein 
DEX0283 123. 


75 


60 


1029 


AAE02058 


Homo sapiens 


Human four disulfide core domain 
(FDCD)-containing protein. 


587 


41 


1029 


AAM79986 


Homo sapiens 


Human protein SEQ ID NO 3632. 


579 


41 


1029 


AAM79002 


Homo sapiens 


Human protein SEQ ID NO 1664. 


579 


41 


1031 


ABP42929 


Homo sapiens 


Human ovarian antigen HPDRS87, 
SEQ ID NO:4061. 


820 


93 
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1031 


ABP41129 


Homo sapiens 


Human ovarian antigen HE2RG21, 
SEQ ID NO:2261. 


820 


93 


1031 


AAU01195 


Homo sapiens 


Human cyclophilin A protein. 


820 


93 


1032 


AAE06643 


Homo sapiens 


Human G-protein coupled receptor 
(NGPCR) #2, 


671 


99 


1032 


AAE06642 


Homo sapiens 


Human G-protein coupled receptor 
(NGPCR) #1. 


671 


99 


1032 


gi7291589 


Drosophila 
melanogaster 


CG18679-PA 


214 


34 


1033 


AAG03055 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7136. 


269 


100 ' 


1034 


AAB38043 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 10 clone 
HWHGP71. 


125 


36 


1034 


gi5305335 


Mycobacterium 
tuberculosis 


proline-rich mucin homolog 


105 


33 


1034 


gi59 17666 


Zea mays 


extensin-like protein 


104 


37 


1035 


AAM93942 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
4126. 


3226 


99 


1035 


ABB 11422 


Homo sapiens 


Human Zn finger protein homologue, 
SEQ ID NO: 1792. 


2760 


96 


1035 


gi6467206 


Homo sapiens 


gonadotropin inducible transcription 
repressor-4 


2032 


57 


1036 


AAB95007 


Homo sapiens 


Human protein sequence SEQ ID 
NO:16685. 


518 


86 


1036 


gi21410398 


Mus musculus 


RIKEN cDNA 2610034E13 gene 


81 


32 


1036 


gi45906 


Proteus vulgaris 


hlyC protein (AA 1-54) 


72 


45 


1037 


AAY27616 


Homo sapiens 


Human secreted protein encoded by 
gene No. 50. 


562 


99 


1037 


gil7902598 


Rice black 
streaked dwarf 
virus 


P6 protein 


71 


25 


1038 


gil6588681 


Homo sapiens 


anion transporter/exchanger-9 


4295 


95 


1038 


AAE21166 


Homo sapiens 


Human TRICH-10 protein. 


3612 


91 


1038 


gi 13344999 


Homo sapiens 


solute carrier family 26 member 6 


1298 


37 


1039 


AAU 12254 


Homo sapiens 


Human PR04343 polypeptide 
sequence. 


780 


100 


1039 


AAM40835 


Homo sapiens 


Human polypeptide SEQ ID NO 
5766. 


780 


100 


1039 


AAY76141 


Homo sapiens 


Human secreted protein encoded by 
gene 18. 


780 


100 


1040 


ABB89694 


Homo sapiens 


Human polypeptide SEQ ID NO 
2070. 


622 


77 


1040 


AAY59672 


Homo sapiens 


Secreted protein 108-006-5-0-E6-FL. 


622 


77 


1040 


AAB94543 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15290. 


618 


76 


1041 


AAY92710 


Homo sapiens 


Human membrane-associated protein 
Zsig24. 


704 


97 


1041 


ABB89722 


Homo sapiens 


Human polypeptide SEQ ID NO 
2098. 


566 


99 


1041 


AAY87250 


Homo sapiens 


Human signal peptide containing 
protein HSPP-27 SEQ ID NO:27. 


566 


99 


1042 


AAU29316 


Homo sapiens 


Human PRO polypeptide sequence 
#293. 


2932 


99 


1042 


ABB05749 


Homo sapiens 


Human G protein-coupled receptor 


1591 


44 
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NOVla protein SEQ ID NO:2. 






1042 


gil4572521 


Homo sapiens 


NEPH1 


ij iy 


^ i 
D l 


1043 


AAU00688 


Homo sapiens 


Human CD59 protein. 


710 


100 


1043 


AAW26318 


Homo sapiens 


Human CD59. 


710 


1 A.A 


1043 


AAR80240 


Homo sapiens 


Human membrane attack complex 
inhibition factor. 


710 


100 


1044 


gil7390957 


Mus musculus 


Similar to RIKEN cDNA 
2010001E11 gene 


1 AC C 

1455 


/4 


1044 


gi6841140 


Homo sapiens 


HSPC100 


498 


1 AA 
100 


1044 


gil 7985273 


Brucella 
melitensis 


GLUCOSE/GALACTOSE 
TRANSPORTER 


124 


22 


1045 


AAB56632 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1210. 


3377 


99 


1045 


gil 3097708 


Homo sapiens 


ribopnorm II 


O 1 CO 

3152 


1 AA 


1045 


gi4730801 


Homo sapiens 


dJ343K2.2.1 (nbophorm II (isoform 

D) 


O 1 -CO 

3152 


1 AA 


1046 


AAB70690 


Homo sapiens 


Human hDPP protein sequence SbQ 
ID NO: 7. 


coo 




1046 


AAG89279 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 399. 


C AO 


1 AA 


1046 


gil 31 82757 


Homo sapiens 


w 1 -1 — ■■ PP» P^ w 1 — W — 

HTPAP 


cao 

598 


1 AA 


1047 


gi2276448 


Homo sapiens 


MHC class I HLA-A 


1794 




1047 


gi6815812 


Homo sapiens 


MHC class I antigen heavy chain 


1794 




1047 


gil245460 


Homo sapiens 


MHC class I HLA-A 


1786 


no 
92 


1048 


ABP41629 


Homo sapiens 


Human ovarian antigen HOOJQ91, 
SEQ ID NO:2761. 


675 


90 


1048 


AAB95392 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17743. 


564 


78 


1048 


AAM79768 


Homo sapiens 


Human protein SEQ ID NO 3414. 


CCA 

564 


/o 


1049 


gil 40 17773 


Mus musculus 


Cg 10671 -like 


1517 


96 


1049 


gil4017764 


Mus musculus 


CGI 0671 -like 


1517 


9o 


1049 


ABB89676 


Homo sapiens 


Human polypeptide SEQ ID NO 
2052. 


957 


OA 

89 


1050 


AAG81431 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:380. 


503 


97 


1050 


AAE23305 


Homo sapiens 


Human nectin-4 protein #4. 


128 


32 


1050 


gil9353148 


Mus musculus 


Similar to pohovirus receptor-related 
4 


127 


27 


1051 


gi20072551 


Mus musculus 


RIKEN cDNA 493 0511 Jll gene 


/l OA 

420 




1051 


gil7974542 


Homo sapiens 


voltage-dependent calcium channel 
gamma- 8 subunit 


147 


25 


1051 


gil2836893 


Gallus gallus 


IPR328-like protein 


147 


29 


1052 


ABB84978 


Homo sapiens 


Human PRO4430 protein sequence 
SEQ ID NO:324. 


436 


70 


1052 


ABB95584 


Homo sapiens 


Human angiogenesis related protein 
PRO4430 SEQ ID NO: 324. 


436 


70 


1052 


AAU29273 


Homo sapiens 


Human PRO polypeptide sequence 
#250. 


436 


70 


1053 


AAB88325 


Homo sapiens 


Human membrane or secretory 
protein clone PSEC0020. 


912 


99 


1053 


AAB53257 


Homo sapiens 


Human colon cancer antigen protein 
sequence SEQ ID NO:797. 


859 


99 


1053 


AAY87264 


Homo sapiens 


Human signal peptide containing 
protein HSPP-41 SEQ IDNO:41. 


315 


63 


1054 


AAU12201 


Homo sapiens 


Human PRO 1779 polypeptide 


1819 


54 
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sequence. 






1054 


AAB25594 


Homo sapiens 


Protein encoded by human secreted 
protein gene#l. 


1 ni n 

1819 


54 


1054 


gil234787 


Xenopus laevis 


up-regulated by thyroid hormone in 

11 1 . /v 11 • 

tadpoles; expressed specifically m 
the tail and only at metamorphosis; 
membrane bound or extracellular 
protein; C-termmal basic region 


1799 


53 


1055 


AAW55035 


Homo sapiens 


HPURR amino acid sequence. 


2014 


100 


1055 


AAW47066 


Homo sapiens 


Human brain P2X-1 receptor 
polypeptide. 


2014 


100 


1055 


gi4099139 


Homo sapiens 


P2X4 purinoreceptor 


2014 


1 AA 

100 


1056 


AAE03560 


Homo sapiens 


Human differentially expressed 
kidney cDNA 22360 encoded 
protein. 


1255 


O C 
OJ 


1056 


AAM42468 


Homo sapiens 


Human kidney related polypeptide 
SEQ ID NO 337. 


1 A C 

145 




1056 


AAM99653 


Homo sapiens 


Human excretory related polypeptide 
SEQ ID NO 390. 


1 /I c 

145 




1057 


ABB53267 


Homo sapiens 


Human polypeptide #7. 


3372 


AO 


1057 


AAO 14449 


T T * 

Homo sapiens 


Protein of human Zona Pellucida 1 
(Zzpl). 


330/ 


no 


1057 


gi972946 


Mus museums 


ZP1 precursor 




o / 


1058 


gil5779156 


Homo sapiens 


Similar to RIKEN cDNA 
1 8 10073N04 gene 


1858 


100 


1058 


gi 13 097045 


Mus museums 


fl • * 1 j T» TT7"n\T T\\T A 

Similar to RIKEN cDNA 
1810073N04 gene 


i /iy 


y i 


1058 


AAM79693 


T T * 

Homo sapiens 


Human protein SEQ ID JNU 333y. 


1 I JO 


1 on 

1UU 


1059 


AAM79993 


Homo sapiens 


Human protein SEQ JD NU 3o3y. 


1 / JO 


oy 


1059 


AAM79009 


Homo sapiens 


TT / _ : „ o T? /~\ TT"\ \Tr\ 1 /C7 1 

Human protein SEQ ID NU 16 / 1. 


1 /oo 


oy 


1059 


ABB 12000 


Homo sapiens 


Human prostaglandin DP receptor 
homologue, bbQ ID NU:23 /U, 


1736 


89 


1060 


AAU79946 


Homo sapiens 


Human transporter protein sequence. 




QQ 

yy 


1060 


AAE21181 


T T * 

Homo sapiens 


Human 1 KlCri-zj protein. 


ZD /Z 


Q1 

y l 


1060 


gi2811122 


Xenopus laevis 


NaDC-2 


1742 


54 


1061 


AAM79483 


Homo sapiens 


TT j • npA TT^\ \TA Ollfi 

Human protein SEQ ID NO 3129. 




OO 


1061 


AAM78499 


Homo sapiens 


Human protein SEQ ID NO 1 161 . 


1698 


88 


1061 


ABB11938 


Homo sapiens 


Human cystinosm homologue, SEQ 
ID NO:2308. 


1698 


o o 
OO 


1062 


gil2656590 


Danio reno 


P2x punnoceptor subumt 4 


72 


A A 
4U 


1063 


AAG68335 


Homo sapiens 


Human CSP2 protein SEQ ID NO:4. 


1354 


99 


1063 


gil9525540 


Homo sapiens 


lymphocyte effector toxicity 
activation hgand 


1330 


98 


1063 


AAY36071 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 456. 


1252 


92 


1064 


gil5277509 


Homo sapiens 


Similar to transmembrane 7 
superfamily member 2 


1548 


97 


1064 


ABP41089 


riomo sapiens 


Human ovarian anxigen rioiAjooo, 
SEQ ID NO:2221. 




1 00 

A \J\J 


1064 


gil8138238 


Bos taurus 


C-14 sterol reductase 


1376 


87 


1065 


AAM93346 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
2891. 


5017 


98 


1065 


AAM93761 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3754. 


4620 


99 


1065 


AAB92756 


Homo sapiens 


Human protein sequence SEQ ID 


2856 


99 
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NO:l l2lo. 






1066 


AAE23544 


Homo sapiens 


Human FAIL protein. 


1730 


99 


1066 


AAE23554 


Homo sapiens 


TT T~> A TT a_ * JJ.O 

Human FAIL protein #3. 


1 /ZO 


QQ 

yy 


, 1066 


AAE23556 


Homo sapiens 


Human FAIL protein #5. 


1726 


99 


1067 


gil8480772 


Mus mus cuius 


olfactory receptor MORlOl-2 


izo / 


oZ 


1067 


gil8479346 


Mus musculus 


olfactory receptor MORlOl-l 


1193 


79 


1067 


AAG72119 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1800. 


1043 


ol 


1068 


ABB89454 


Homo sapiens 


Human polypeptide SEQ ID NO 
1830. 






1068 


ABP41764 


Homo sapiens 


Human ovarian antigen HE6CR19, 
SEQ ID NO:2896. 


530 


95 


1068 


AAU04352 


Homo sapiens 


Mammalian toxicological response 
marker protein #4. 




yD 


1069 


AAY33300 


Homo sapiens 


Human hALK-2 clone HP53 protein. 


1572 


100 


1069 


AAR85206 


Homo sapiens 


Human ALK.-2. 


1572 


1 


1069 


gil381584 


Bos taurus 


activin receptor type I 


1572 


100 


1070 


gil6359163 


Homo sapiens 


Similar to RIKEN cDNA 
2310014B08 gene 


1332 


C\A 

94 


1070 


gi 18043464 


Mus musculus ! 


RIKEN cDNA 2310014B08 gene 


1226 


77 


1070 


AAB64401 


Homo sapiens 


Amino acid sequence of human 
intracellular signalling molecule 
INTRA33. 


212 


35 


1071 


AAW60043 


Homo sapiens 


Human MHC class I cham-related 
gene A (MICA) polypetide. 






1071 


gil405893 


Homo sapiens 


MHC class I chain-related protein A 


Io94 




1071 


gil6877353 


Homo sapiens 


MHC class I polypeptide-related 
sequence A 


1838 


90 


1072 


gil5292437 


Drosophila 
melanogaster 


LP10272p 




3y 


1072 


AAB80378 


Homo sapiens 


Secreted protein encoded by gene #8. 


210 


28 


1072 


AAY87336 


Homo sapiens 


Human signal peptide containing 
protein HSPP-1 13 SEQ ID NO: 1 13. 


Tin 

210 


ZO 


1073 


AAB58289 


Homo sapiens 


Lung cancer associated polypeptide 
sequence SEQ ID 627. 


ZJUJ 


CO 


1073 


ABB55767 


Homo sapiens 


Human polypeptide SEQ ID NO 140. 




OU 


1073 


AAU39058 


Homo sapiens 


Human secreted protein pe584_2. 


2163 


86 


1074 


AAB58289 


Homo sapiens 


Lung cancer associated polypeptide 
sequence SEQ ID 627. 


2303 


OO 


1074 


ABB55767 


Homo sapiens 


Human polypeptide SEQ ID NO 140. 


2163 


86 


1074 


AAU39058 


Homo sapiens 


Human secreted protein pe584_2. 


2163 


86 


1075 


AAM93703 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3632. 


1061 


93 


[1075 


AAE04780 


Homo sapiens 


Human vesicle trafficking protein-23 
(VETRP-23) protein. 


864 


100 


1075 


ABB08160 


Homo sapiens 


Human cytoskeleton-associated 
protein (CSAP)-4 (ID: 

"7/1 H1T)AC^T\ 1 


758 


87 


1076 


ABB06255 


Homo sapiens 


Human G protein-coupled receptor 
TGR17-6 protein SEQ ID NO: 15. 


1511 


100 


1076 


ABB06254 


Homo sapiens 


Human G protein-coupled receptor 
TGR17-5 protein SEQ ID NO:13. 


1511 


100 


1076 


ABB06252 


Homo sapiens 


Human G protein-coupled receptor 
TGR17-3 protein SEQ ID NO:7. 


1511 


100 


1 1077 


AAE16786 


Homo sapiens 


Human transporter and ion channel- 


2679 


99 
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23 (TRICH-23) protein. 






1077 


ABB08218 


Homo sapiens 


Human membrane transporter protein 
57256. 


1759 


68 


1077 


AAM38697 


Homo sapiens 


Human polypeptide SEQ ID NO 
1842. 


1249 


53 


1078 


AAB85029 


Homo sapiens 


Protein encoded by BAP28 cDNA 
consisting of exons 1 to 45. 


3444 


81 


1078 


AAW54099 


Homo sapiens 


Homo sapiens BAP28 sequence. 


2219 


88 


1078 


AAB92729 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11 159. 


1588 


92 


1079 


ABB07526 


Homo sapiens 


Human drug metabolizing enzyme 
(DME) (ID: 1962105CD1). 


2580 


99 


1079 


AAM93720 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3669. 


2331 


92 


1079 


gi57806 


Rattus sp. 


gamma-glutamyltranspeptidase (AA 
1-568) 


236 


28 


1080 


AAM78536 


Homo sapiens 


Human protein SEQ ID NO 1 198. 


6839 


100 


1080 


AAM79520 


Homo sapiens 


Human protein SEQ ID NO 3166. 


6820 


99 


1080 


ABG40303 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
29968. 


2092 


100 


1081 


gi20809440 


Homo sapiens 


sterol O-acyltransferase (acyl- 
Coenzyme A: cholesterol 
acyltransferase) 1 


1783 


65 


1081 


gi4878022 


Homo sapiens 


acyl-coenzyme A: cholesterol 
acyltransferase 


1779 


65 


1081 


AAW38416 


Homo sapiens 


Human acyl-coenzyme A: cholesterol 
acyltransferase I. 


1774 


65 


1082 


gi22002433 


Homo sapiens 


pi 50 target of rapamycin (TOR)- 
scaffold protein containing WD- 
repeats 


7014 


100 


1082 


gi2 1979456 


Homo sapiens 


raptor 


7014 


100 


1082 


gi22002435 


Mus museums 


pi 50 target of rapamycin (TOR)- 
scaffold protein containing WD- 
repeats 


6819 


96 


1083 


AAY33301 


Homo sapiens 


Human hALK-3 clone ONF5 protein. 


2647 


89 


1083 


AAR85207 


Homo sapiens 


Human ALK-3. 


2647 


89 


1083 


AAR55368 


Homo sapiens 


Human Activin receptor-like kinase 3 
(hALK-3). 


2647 


89 


1084 


AAY33301 


Homo sapiens 


Human hALK-3 clone ONF5 protein. 


1829 


92 


1084 


AAR85207 


Homo sapiens 


Human ALK-3. 


1829 


92 


1084 


AAR55368 


Homo sapiens 


Human Activin receptor-like kinase 3 
(hALK-3). 


1829 


92 


1085 


AAW90873 


Homo sapiens 


Human brain- specific dysferlin 
protein. 


1329 


53 


1085 


AAW90868 


Homo sapiens 


Human dysferlin protein. 


1329 


53 


1085 


AAY82643 


Homo sapiens 


Human dysferlin protein sequence 
SEQ ID NO:2. 


1329 


53 


1086 


gil9343765 


Mus musculus 


Similar to dysferlin 


1860 


47 


1086 


ABB89615 


Homo sapiens 


Human polypeptide SEQ ID NO 
1991. 


1853 


47 


1086 


gi6731235 


Homo sapiens 


myoferlin 


1853 


47 


1087 


AAY92321 


Homo sapiens 


Human alpha-2-delta-D calcium 
channel subunit. 


5881 


99 


1087 


AAB62262 


Homo sapiens 


Human calcium channel alpha2delta 


5742 


99 
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subunit. 






1087 


AAU01038 


Homo sapiens 


Human secreted soluble alpha2delta 
calcium channel subunit #18 protein. 


5742 


99 


1088 


gil 8676422 


Homo sapiens 


FLJ00088 protein 


3492 


96 


1088 


ABP51380 


Homo sapiens 


Human MDDT SEQ ID NO 402. 


2083 


99 


1088 


gi2 104689 


Mus mus cuius 


alpha glucosidase II, alpha subunit 


1987 


53 


1089 


AAY01143 


Homo sapiens 


Secreted protein encoded by gene 9 
clone HSIDY06. 


238 


100 


1090 


AAY81261 


Homo sapiens 


Human DNA structure-specific 
recognition protein 1 (SSRP1). 


3683 


100 


1090 


AAW39212 


Homo sapiens 


Human SSRP1 protein. 


3683 


100 


1090 


AAR38744 


Homo sapiens 


Human SSRP. 


3683 


100 


1091 


gil77814 


Homo sapiens 


alpha- 1 -antitrypsin-r elated protein 


1922 


90 


1091 


gil5990507 


Homo sapiens 


Similar to serine (or cysteine) 
proteinase inhibitor, clade A (alpha- 1 
antiproteinase, antitrypsin), member 
1 


1408 


66 


1091 


AAY26925 


Homo sapiens 


Human alpha 1-anti-trypsin type Ml 
protein. 


1407 


66 


1092 


gil 6877 139 


Homo sapiens 


Similar to RIKEN cDNA 
13 000 19N 10 gene 


2273 


100 


1092 


AAB56819 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1397. 


1054 


100 


1092 


AAM95577 


Homo sapiens 


Human reproductive system related 
antigen SEQ ID NO: 4235. 


627 


91 


1093 


gil 86055 12 


Homo sapiens 


Similar to CAP-binding protein 
complex interacting protein 2 


1523 


100 


1093 


AAO06814 


Homo sapiens 


Human polypeptide SEQ ID NO 
20706. 


1314 


100 


1093 


AAY57946 


Homo sapiens 


Human transmembrane protein 
HTMPN-70. 


1128 


100 


1094 


AAM93603 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3418. 


2973 


99 


1094 


gil9571657 


Caenorhabditis 
elegans 


similar to Yeast YEH4 like protein 


964 


41 


1094 


ABB 89291 


Homo sapiens 


Human polypeptide SEQ ID NO 
1667. 


590 


85 


1095 


gi4959568 


Homo sapiens 


nuclear pore complex interacting 
protein NPIP 


1650 


94 


1095 


AAO 17206 


Homo sapiens 


Human secreted protein SEQ ID NO: 
105. 


1336 


79 


1095 


ABB90262 


Homo sapiens 


Human polypeptide SEQ ID NO 
2638. 


872 


69 


1096 


gil8031730 


Homo sapiens 


GK006 


1405 


98 


1096 


AAB92609 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 10874. 


1074 


99 


1096 


AAM40309 


Homo sapiens 


Human polypeptide SEQ ID NO 
3454. 


1074 


99 


1097 


AAG03767 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7848. 


612 


90 


1097 


AAB43694 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1 1 39. 


612 


90 


1097 


gi285910 


Homo sapiens 


ATP synthase subunit c precursor 


612 


90 


1098 


gi897827 


Homo sapiens 


iron-responsive element-binding 
protein/iron regulatory protein 2 


4968 


99 
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1098 


AAE 19851 


Homo sapiens 


Human wild-type IRP-2 protein. 


4909 


99 


1098 


gi897581 


Homo sapiens 


iron-regulatory protein 2 


4909 


99 


1099 


gi3551150 


Canine 
herpesvirus 


immediate-early protein 


77 


28 


1099 


gi437051 


Acipenser 
transmontanus 


vitellogenin 


76 


23 


1099 


gi21539886 


Arabidopsis 
thaliana 


transcription activator 


75 


26 


1100 


AAM84273 


Homo sapiens 


Human immune/haematopoietic 
antigen SEQ ID NO: 11 866. 


328 


92 


1100 


gi3551150 


Canine 
herpesvirus 


immediate-early protein 


77 


28 


1100 


gi535260 


Plasmodium 
reichenowi 


STARP antigen 


76 


21 


1101 


gi3551150 


Canine 
herpesvirus 


immediate-early protein 


77 


28 


1101 


gi437051 


Acipenser 
transmontanus 


vitellogenin 


76 


23 


1101 


gi535260 


Plasmodium 
reichenowi 


STARP antigen 


75 


21 


1102 


AAU12296 


Homo sapiens 


Human PR07171 polypeptide 
sequence. 


168 


67 


1102 


ABB85001 


Homo sapiens 


Human PR028631 protein sequence 
SEO ID NO:370. 


90 


40 


1102 


ABB95607 


Homo sapiens 


Human angiogenesis related protein 
PR028631 SEQ ID NO: 370. 


90 


40 


1103 


gil 1558264 


Homo sapiens 


sphingosine- 1 -phosphatase 


2032 


89 


1103 


gil3447199 


Homo sapiens 


sphingosine- 1 -phosphate 
phosphatase 


1994 


87 


1103 


gil 5778670 


Mus musculus 


sphingosine- 1 -phosphate 
phosphatase 


1721 


76 


1104 


ABB72215 


Homo sapiens 


Human protein isolated from skin 
cells SEO ID NO: 331. 


*t r* A A 

1544 


100 


1104 


ABB72150 


Homo sapiens 


Human protein isolated from skm 
cells SEO ID NO: 189. 


1 C A A 

1544 


1 A A 

1U0 


1104 


ABB 84843 


Homo sapiens 


Human PRO301 protein sequence 
SEQ ID NO:54. 


1 C/l/l 

1544 


1 nn 


1105 


AAG03757 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7838. 


506 


1 An 


1105 


gil78836 


Homo sapiens 


apolipoprotem C-II 


506 


1 An 


1105 


gi757915 


Homo sapiens 


apoCII protein 


506 


100 


1106 


AAU97773 


Homo sapiens 


Human Fortihn polypeptide. 


794 


97 


1106 


AAR55698 


Homo sapiens 


Tumor protein p2 1 . 


794 


97 


1106 


gil 8482460 


Sus scrofa 


translationally controlled tumor 
protein 


794 


97 


1107 


gil2082725 


Mus musculus 


B cell phosphoinositide 3-kinase 
adaptor 


3519 


84 


1107 


gil2082723 


Gallus gallus 


B cell phosphoinositide 3-kinase 
adaptor 


ZovO 




1107 


gi20987486 


Homo sapiens 


similar to B cell phosphoinositide 3- 
kinase adaptor 


1829 


97 


1108 


gi21708117 


Homo sapiens 


similar to hepatocellular carcinoma- 
associated antigen HCA557b 


318 


37 


1108 


gil8252514 


Homo sapiens 


hepatocellular carcinoma-associated 
antigen HCA557b 


318 


37 
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1108 


gi20071260 


Mus museums 


RIKEN cDNA 2310038H17 gene 


306 


37 


1109 


gi 150765 11 


* 

Homo sapiens 


nonmuscle myosin light chain 2 


001 

881 


99 


1109 


gi 13436446 


Homo sapiens 


myosin regulatory light chain 


881 


99 


1109 


gi22137716 


Mus museums 


myosin regulatory light chain 


n n 1 

881 


99 


1110 


gil7391357 


Homo sapiens 


FXYD domam-contaming ion 
transport regulator 7 


390 


96 


1110 


gil9354238 


Mus musculus 


FXYD domain- containing ion 
transport regulator 1 


153 


45 1 


1110 


gi4206711 


Mus museums 


phospholemman precursor 


153 


45 


1111 


AAM49040 


Homo sapiens 


Human testicular development- 
specific protein 10 (NYD-SP10). 


1486 


85 


1111 


gil 3272522 


Homo sapiens 


transcription factor NYD-splO 


1 A O/Z 

14oo 




1111 


gi2 1040409 


Homo sapiens 


regulatory factor X, 4 (influences 
HLA class II expression) 


1486 


85 


1112 


gi20269720 


Homo sapiens 


neuropilin and tolloid like-1 


1894 


99 


1112 


gi20269724 


Mus musculus 


neuropilin and tolloid like-1 


lo39 


90 


1112 


ABB55774 


Homo sapiens 


Human polypeptide SEQ ID NO 154. 


1057 


58 


1113 


AAB61150 


Homo sapiens 


Human NOV19 protein. 


758 


98 


1113 


AAB61149 


Homo sapiens 


Human NOV18 protein. 


758 


98 


1113 


AAY33297 


Homo sapiens 


Human membrane spanning protein 
MSP-4. 


758 


98 


1114 


ABB90021 


Homo sapiens 


Human polypeptide SEQ ID NO 
2397. 


476 


89 


1114 


ABB 11 874 


Homo sapiens 


Human secreted protein homologue, 
SEQ ID NO:2244. 


476 


89 


1114 


AAY94914 


Homo sapiens 


Human secreted protein clone 
pw337_6 protein sequence SEQ ID 
NO:34. 


476 


89 


1115 


AAG72407 


Homo sapiens 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2088. 


1281 


100 


1115 


AAG72267 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1948. 


1281 


100 


1115 


gi21928991 


Homo sapiens 


seven transmembrane helix receptor 


1274 


99 


1116 


AAU80496 


Homo sapiens 


Human G-coupled receptor 
(GCREC) protein, Seq ID No 4. 


1905 


98 


1116 


ABP51568 


Homo sapiens 


Human G protein coupled receptor 
SEQ ID NO: 18. 


1543 


98 


1116 


AAU85 147 


Homo sapiens 


G-coupied olfactory receptor #8. 


1538 


100 


1117 


gi5802817 


Homo sapiens 


envelope protein 


479 


77 


1117 


gi3 150438 


Human 
endogenous 
retrovirus K 


pol-env 


466 


77 


1117 


gi9558705 


Homo sapiens 


envelope 


466 


77 


1118 


AAE15241 


Homo sapiens 


Human RNA metabolism protem-4 
(RMEP-4). 


514 


100 


1118 


ABB89053 


Homo sapiens 


Human polypeptide SEQ ID NO 
1429. 


514 


100 


1118 


AAG89341 


tr *■* ■ 

Homo sapiens 


Human secreted protein, SEQ ID 
NO: 461. 


DUX 


OQ 

yy 


1119 


AAE23979 


Homo sapiens 


Human LP217 secreted protein. 


4563 


50 


1119 


AAB20155 


Homo sapiens 


Secreted protein SECP1 . 


4522 


50 


1119 


AAM39295 


Homo sapiens 


Human polypeptide SEQ ID NO 
2440. 


4518 


50 


1120 


AAB28199 


Homo sapiens 


Human HMG-17 non histone 
chromosomal protein. 


429 


94 
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1120 


gi32329 


Homo sapiens 


put. HMG-17 protein 


429 


94 


1120 


gi306864 


Homo sapiens 


high mobility group protein 17 


429 


94 


1121 


ABP43105 


Homo sapiens 


Human ovarian antigen HVCBB19, 
SEQ ID NO:4237. 


456 


69 


1121 


AAE 13797 


Homo sapiens 


Human lung tumour-specific protein 
SALT-T8. 


456 


69 


1121 


AAB44456 


Homo sapiens 


Human lung tumour- specific antigen 
encoded by cDNA #71. 


456 


69 


1122 


AAM93711 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3650. 


2974 


99 


1122 


ABB89767 


Homo sapiens 


Human polypeptide SEQ ID NO 
2143. 


2214 


97 


1122 


gi7303971 


Drosophila 
melanogaster 


CG8230-PA 


1154 


41 


1123 


AAU76036 


Homo sapiens 


Human sugar transporter- 1 (HST-1) 
protein sequence. 


1055 


99 


1123 


AAB60112 


Homo sapiens 


Human transport protein TPPT-32. 


775 


100 


1123 


AAB61903 


Homo sapiens 


Atherosclerosis-associated 
polypeptide. 


380 


48 


1124 


AAR28120 


Homo sapiens 


NKG2 transmembrane protein-D. 


725 


95 


1124 


ABB 11 846 


Homo sapiens 


Human integral membrane protein 
homologue, SEQ ID NO:2216. 


722 


94 


1124 


gi35063 


Homo sapiens 


Type II integral membrane protein 


722 


94 


1125 


AAM78418 


Homo sapiens 


Human protein SEQ ID NO 1080. 


1878 


94 


1125 


gi215 18639 


Homo sapiens 


TSLCl-like2 


1870 


97 


1125 


gil9068139 


Mus musculus 


membrane glycoprotein 


1849 


96 


1126 


AAB94738 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15776. 


3079 


99 


1126 


AAM41695 


Homo sapiens 


Human polypeptide SEQ ID NO 
6626. 


2456 


99 


1126 


AAM39909 


Homo sapiens 


Human polypeptide SEQ ID NO 
3054. 


1272 


100 


1127 


AAB75594 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 37 SEQ ID 
NO:148. 


678 


99 


1127 


AAB80437 


Homo sapiens 


Gene #20 associated peptide #1. 


381 


98 


1127 


AAM78175 


Homo sapiens 


Human bone marrow expressed 
probe encoded protein SEQ ID NO: 
38481. 


365 


100 


1128 


gi291529 


Mouse 

cytomegalovirus 
1 


tegument protein 


89 


24 


1128 


gil4573798 


Caenorhabditis 
elegans 


C. elegans SRD-60 protein 
(corresponding sequence C13B7.3) 


85 


28 


1128 


gil91992 


Mus musculus 


APC 


79 


20 


1129 


gi20987535 


Mus musculus 


RIKEN cDNA 3300002C04 gene 


741 


67 


1129 


gi20799661 


Mus musculus 


mucolipin-2 


741 


67 


1129 


AAB93412 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12616. 


632 


55 


1130 


gil9354289 


Mus musculus 


RIKEN cDNA 2010107G23 gene 


111 


42 


1130 


ABB79328 


Homo sapiens 


Human ovary specific protein SEQ 
ID NO: 125. 


107 


42 


1130 


gil5488920 


Homo sapiens 


Similar to RIKEN cDNA 
2010107G23 gene 


107 


42 


1131 


AAU70675 


Homo sapiens 


Human otoferlin. 


2432 


42 
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1131 


gi4588470 


Homo sapiens 


otoferlin 


2432 


42 


1131 


AAU70673 


Homo sapiens 


Human otoferlin #2. 


2420 


42 


1132 


gi20071179 


Homo sapiens 


monocyte to macrophage 
differential on-associated 


1032 


77 


1132 


gi 1006665 


Homo sapiens 


expression associated with monocyte 
to macrophage differentiation 


1028 


77 


1132 


gil 83 14462 


Mus musculus 


monocyte to macrophage 
differentiation-associated 


1028 


77 


1133 


AAU95752 


Homo sapiens 


Human olfactory and pheromone G 
protein-coupled receptor #239. 


1522 


92 


1133 


AAU85278 


Homo sapiens 


G-coupled olfactory receptor #139. 


1499 


91 


1133 


AAU24658 


Homo sapiens 


Human olfactory receptor 
AOLFR156. 


1499 


91 


1134 


AAE13275 


Homo sapiens 


Human transporters and ion channels 
(TRICH)-2. 


1472 


80 


1134 


gil7384411 


Homo sapiens 


bA251017.3 (similar to aquaporin 7) 


1469 


80 


1134 


AAY70455 


Homo sapiens 


Human membrane channel protein-5 
(MECHP-5). 


1295 


73 


1135 


ABG42409 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
32074. 


317 


100 


1135 


AAM32826 


Homo sapiens 


Peptide #6863 encoded by probe for 
measuring placental gene expression. 


317 


100 


1135 


AAM19638 


Homo sapiens 


Peptide #6072 encoded by probe for 
measuring cervical gene expression. 


317 


100 


1136 


AAB47977 


Homo sapiens 


BCY5. 


2663 


99 


1136 


gi2463632 


Homo sapiens 


monocarboxylate transporter 
homologue MCT6 


2574 


97 


1136 


gi21265165 


Homo sapiens 


solute carrier family 16 
(monocarboxylic acid transporters), 
member 7 


602 


31 


1137 


ABB08456 


Homo sapiens 


Human tumour specific antigenic 
peptide #2. 


705 


94 


1137 


AAU08592 


Homo sapiens 


Human V-ATPase 16kDa subunit. 


705 


94 


1137 


gil 4424534 


Homo sapiens 


ATPase, H+ transporting, lysosomal 
(vacuolar proton pump) 16kD 


705 


94 


1138 


gil6741167 


Mus musculus 


RIKEN cDNA 0610010D20 gene 


1506 


87 


1138 


gil5080314 


Homo sapiens 


Similar to RIKEN cDNA 
0610010D20 gene 


514 


100 


1138 


gil 05 80053 


Halobacterium 
sp. NRC-1 


dihydrodipicolinate synthase; DapA 


375 


33 


1139 


AA014199 


Homo sapiens 


Human transporter and ion channel 
TRICH-16. 


1425 


85 


1139 


ABB80588 


Homo sapiens 


Human sbgl020829SGLT protein. 


1425 


85 


1139 


AAE06614 


Homo sapiens 


Human protein having hydrophobic 
domain, HP 03 974. 


1425 


85 


1140 


ABB90752 


Homo sapiens 


Human Tumour Endothelial Marker 
polypeptide SEQ ID NO 236. 


1890 


76 


1140 


gi6708478 


Mus musculus 


formin-like protein 


1559 


65 


1140 


gi4101720 


Mus musculus 


lymphocyte specific formin related 
protein 


1532 


65 


1141 


AAB94131 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 14389. 


996 


96 


1141 


gi497984 


Bos taurus 


Ac45 


317 


39 


1141 


AAM93671 


Homo sapiens 


Human polypeptide, SEQ ID NO: 


307 


36 
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3556. 






1142 


AAG67137 


Homo sapiens 


Amino acid sequence of a human 
enzyme. 


2701 


96 


1142 


gi21707691 


Mus musculus 


RIKEN cDNA 1500002020 gene 


2645 


93 


1142 


AAB94358 


Homo sapiens 


Human protein sequence SEQ ID 
NO:14883. 


2579 


96 


1143 


AAW54370 


Homo sapiens 


G-protein coupled receptor 
HLTEX11. 


1815 


100 


1143 


AAB64854 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 36 SEQ ID 
NO: 140. 


1792 


100 


1143 


AAB64853 


Homo sapiens 


Gene 36 human secreted protein 
homologous amino acid sequence 
#139. 


1792 


100 


1144 


ABB90324 


Homo sapiens 


Human polypeptide SEQ ID NO 
2700. 


1321 


100 


1144 


AAU82004 


Homo sapiens 


Human secreted protein SECP30. 


989 


78 


1144 


AAM95005 


Homo sapiens 


Human reproductive system related 
antigen SEQ ID NO: 3663. 


548 


85 


1145 


AAY13458 


Homo sapiens 


Amino acid sequence of human 
Fe65. 


3759 


100 


1145 


gi2734083 


Homo sapiens 


stat-like protein 


3759 


100 


1145 


gi3924936 


Homo sapiens 


Fe65 protein 


3759 


100 


1146 


AAB08900 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 10 SEQ ID NO:57. 


845 


90 


1146 


AAY27071 


Homo sapiens 


Human JWA protein. 


845 


90 


1146 


AAW75110 


Homo sapiens 


Human secreted protein encoded by 
gene 54 clone HETGL41. 


845 


90 


1147 


AAM93733 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3697. 


764 


87 


1147 


gi2970431 


Florometra 
serratissima 


NADH dehydrogenase subunit 4 


91 


31 


1147 


gi 15 128604 


Inversidens 
japanensis 


NADH dehydrogenase subunit 4 


77 


29 


1148 


AAB93562 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12957. 


2402 


100 


1148 


gi2 1626993 


Drosophila 
melanogaster 


CG15078-PA 


990 


39 


1148 


gil 7945442 


Drosophila 
melanogaster 


RE18318p 


990 


39 


1149 


ABB89832 


Homo sapiens 


Human polypeptide SEQ ID NO 
2208. 


1640 


99 


1149 


ABB89833 


Homo sapiens 


Human polypeptide SEQ ID NO 
2209. 


838 


97 


1149 


gil6359249 


Mus musculus 


RIKEN cDNA 1300010M03 gene 


630 


36 


1150 


ABB 89832 


Homo sapiens 


Human polypeptide SEQ ID NO 
2208. 


1640 


99 


1150 


ABB89833 


Homo sapiens 


Human polypeptide SEQ ID NO 
2209. 


838 


97 


1150 


gil6359249 


Mus musculus 


RIKEN cDNA 1300010M03 gene 


564 


35 


1151 


ABB78999 


Homo sapiens 


Human 9-27 protein sequence SEQ 
ID NO:4483. 


569 


93 


1151 


gil 177476 


Homo sapiens 


interferon-inducible protein 


569 


93 


1151 


gil 2654 159 


Homo sapiens 


interferon induced transmembrane 
protein 1 (9-27) 


569 


93 
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1152 


AAU85153 


Homo sapiens 


G-coupled olfactory receptor #14. 


1730 


100 


1152 


AAU24529 


Homo sapiens 


Human olfactory receptor AOLFR14. 


1730 


100 


1152 


gi21928719 


Homo sapiens 


seven transmembrane helix receptor 


1630 


100 


1153 


gi784997 


Homo sapiens 


homologue to Drosophila tumour 
supressor gene 


5300 


95 


1153 


g i 1944491 


Homo sapiens 


homologue of the murine Llglh gene 


4891 


90 


1153 


gi22347352 


Rattus 
norvegicus 


RGL1 


4878 


88 


1154 


AAM79219 


Homo sapiens 


Human protein SEQ ID NO 1881. 


325 


64 


1155 


AAG63804 


Homo sapiens . 


Amino acid sequence of a human 
amino acid transporter. 


2487 


100 


1155 


gi9309293 


Homo sapiens 


asc-type amino acid transporter 1 


2487 


100 


1155 


gil5277644 


Homo sapiens 


amino acid transporter 


2487 


100 


1156 


gi6760373 


Homo sapiens 


ODZ3 


2323 


100 


1156 


gi4760780 


Mus musculus 


Ten-m3 


2248 


96 


1156 


AAU09891 


Homo sapiens 


Human heregulin, gamma-HRG. 


1202 


52 


1157 


AAY51559 


Homo sapiens 


Human RGDI protein. 


724 


81 


1157 


AAW74804 


Homo sapiens 


Human secreted protein encoded by 
gene 75 clone HBIAB39. 


724 


81 


1157 


gi4877285 


Homo sapiens 


prenylated Rab acceptor 1 (PRA1) 


724 


81 


1158 


gil780976 


Human 
endogenous 
retrovirus K 


protease 


912 


59 


1158 


gi5 802824 


Homo sapiens 


Gag-Pro-Pol protein 


909 


59 


1158 


gi5802814 


Homo sapiens 


Gag-Pro-Pol-Env protein 


906 


58 


1159 


AAY73339 


Homo sapiens 


HTRM clone 2056042 protein 
sequence. 


867 


80 


1159 


gil3111941 


Homo sapiens 


vesicle-associated soluble NSF 
attachment protein receptor (v- 
SNARE; homolog of S. cerevisiae 
VTI1) 


867 


80 


1159 


gi3861488 


Homo sapiens 


vesicle soluble NSF attachment 
protein receptor VTI2 


867 


80 


1160 


gi 19228 91 


Mus musculus 


alpha 3B chain of laminin-5 


10336 


75 


1160 


gi5777581 


Homo sapiens 


alpha 3B chain of laminin-5 


9398 


99 


1160 


AAB48459 


Homo sapiens 


Human laminin 5 polypeptide, SEQ 
ID NO: 8. 


8690 


100 


1161 


ABP43534 


Homo sapiens 


Human secreted protein (SCEP) 58. 


1984 


91 


1161 


AAG67516 


Homo sapiens 


Amino acid sequence of a human 
secreted polypeptide. 


1984 


91 


1161 


AAE01167 


Homo sapiens 


Human gene 4 encoded secreted 
protein HKAAV61, SEQ ID NO:68. 


1984 


91 


1162 


AAM42034 


Homo sapiens 


Human polypeptide SEQ ID NO 
6965. 


900 


99 


1162 


AAM40248 


Homo sapiens 


Human polypeptide SEQ ID NO 
3393. 


821 


100 


1162 


AAP60254 


Homo sapiens 


Interferon-pseudo-omega-2. 


800 


98 


1163 


AAB 19924 


Homo sapiens 


Human interleukin-1 Hy2 (long 
version). 


818 


100 


1163 


AAB 19923 


Homo sapiens 


Human interleukin-1 Hy2 (extended 
form, partial sequence). 


818 


100 


1163 


AAB 19922 


Homo sapiens 


Human interleukin-1 Hy2 (short 
version). 


818 


100 


1164 


AAB37512 


Homo sapiens 


Human VEGF C subunit SEQ ID 
NO: 49. 


1061 


90 
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1164 


AAY57035 


Homo sapiens 


Human A215 amino acid sequence. 


1061 


90 


1164 


AAY94803 


TT ' 

Homo sapiens 


TT \ rn/^p 

Human VEGF. 


1061 


90 


1165 


ABB84974 


Homo sapiens 


Human PR04342 protein sequence 
SEQ ID NO:316. 


170 


99 


1165 


ABB95580 


Homo sapiens 


TT ' * 1 J. J _ i ■ 

Human angiogenesis related protein 
rK04342 bbQ ID JNU: 310. 


770 


99 


1165 


AAB66664 


IT _ • 

Homo sapiens 


Protein encoded by extended 

DOT TT7T _T\\T A 1 ; U,. r , ,. T r 

HZnrL&zvw cDJNA library 
sequence #2, 


770 


99 


1166 


gil321816 


Gorilla gorilla 


interleukin-8 receptor type B 


602 


90 


1 166 


A A T T1 f\C C~l 

AAU10557 


Homo sapiens 


Human mterleulan 8 receptor beta 
(l-LaKB) polypeptide. 




no 


1166 


AAU80482 


Homo sapiens 


Human CXCR2 receptor #1. 


599 


88 


1167 


gi 1160967 


T T _ • 

Homo sapiens 


palmitoyl-protein thioesterase 


1 /I /^O 

1463 


92 


1167 


gil3 14355 


Homo sapiens 


palmitoyl protein thioesterase 


1463 


92 


1167 


gil4250054 


Homo sapiens 


palmitoyl-protem thioesterase 1 
(ceroid-lipofuscinosis, neuronal 1, 
infantile) 


1463 


92 


1168 


gil77814 


Homo sapiens 


alpha- 1-antitrypsin-related protein 


1952 


90 


1168 


AAY78890 


Homo sapiens 


Human alphal -antitrypsin ammo 
acid sequence. 


1450 


69 


1168 


AAW56709 


T T • 

Homo sapiens 


Ammo acid sequence of the alpha- 1- 
antitrypsin. 


1450 


69 


1169 


AAO 12931 


Homo sapiens 


TT 1 i * J OT" 1 /"V TT^V 'X.Ty^W 

Human polypeptide SEQ ID NO 
26823. 


346 


100 


1169 


AAO02697 


T T * 

Homo sapiens 


Human polypeptide SEQ ID NO 
16589. 


143 


66 


1169 


AAO08307 


Homo sapiens 


Human polj^peptiae SEQ ID NO 
22199. 


137 




1170 


AAR15222 


T T * 

Homo sapiens 


Chronic myelogenous leukaemia- 
derived myeloid-related protein. 


635 


1 AA 

100 


1170 


• o i /i aa 

gi32402 


Homo sapiens 


TTT> 1 f A A 1 r\ /I A 

HP-1 (AA 1-94) 


493 


1 AA 
100 


t 1 ta 

H70 


gil81529 


TT * 

Homo sapiens 


defensm 1 




1 AA 
1UU 


1171 


A AT Trt O 1 1 

AAU98513 


T T • 

Homo sapiens 


Transmembrane receptor-like protein 
pTB2185. 


71 


A C 


1171 


AAU98512 


Homo sapiens 


Transmembrane receptor-like protein 
pTB2184. 


71 


46 


1171 


gi 100 1697 


Synechocystis 
sp. PCC 6803 


sensory transduction histidme kinase 


67 


37 


1172 


A A 1 — \ *t A A e* A 

A AE 14454 


Homo sapiens 


T T . ■ 1 1 , 

Human protein phosphatase-4. 


2131 


A A 

99 


1172 


gi2665458 


Mus musculus 


protein-tyrosine-phosphatase 


1361 


63 


1172 


gi22328117 


Homo sapiens 


similar to protem-tyrosine- 
phosphatase homolog 
DKFZp566K0524.l - human 
(fragment) 


729 


96 


1173 


AAM25683 


Homo sapiens 


Human protein sequence SEQ ID 

X Ty~\ 1 1 AO 

NO:ll98. 


956 


a^t i 

96 


1 1 TO 


A A QOA/C 

AAY4o22o 


Homo sapiens 


Human prostate cancer-associatea 
protein 12. 




A£T 


1173 


ABP41699 


Homo sapiens 


Human ovarian antigen HFTCG52, 
SEQ ID NO:2831. 


950 


94 


1174 


gi20379895 


Homo sapiens 


Similar to small inducible cytokine 
A4 


124 


77 


1174 


AAM52447 


Homo sapiens 


HIV Nefl fusion protein #14. 


122 


100 


1174 


AAM52446 


Homo sapiens 


HIV Nefl fusion protein #13. 


122 


100 
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1175 


gi20379895 


Homo sapiens 


Similar to small inducible cytokine 

1 A 

A4 


305 


100 


1175 


AAM52447 


Homo sapiens 


HIV Nefl fusion protein #14. 


123 


92 


1175 


AAM52446 


Homo sapiens 


HIV Nefl fusion protein #13. 


123 


92 


1176 


AAG03315 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7396. 


314 


100 


1176 


gil6415877 


Octopus salutii 


cytochrome oxidase subunit III 


70 


28 


1176 


gi602047 


Octopus 
albescens 


cytochrome oxidase subunit III 


69 


28 


1177 


AAG03757 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7838. 


453 


89 


1177 


gil78836 


Homo sapiens 


apolipoprotein C-II 


453 


89 


1177 


gi757915 


Homo sapiens 


apoCII protein 


453 


89 


1178 


AAO07986 


Homo sapiens 


Human polypeptide SEQ ID NO 
21878. 


73 


31 


1178 


gi4884689 


Neisseria 
meningitidis 


lactoferrin-binding protein precursor 


70 


34 


1178 


gil72002 


Saccharomyces 
cerevisiae 


homologue of bacterial MutS protein 


69 


24 


1179 


AAB60502 


Homo sapiens 


Human cell cycle and proliferation 
protein CCYPR-50, SEQ ID NO:50. 


1656 


92 


1179 


AAB12144 


Homo sapiens 


Hydrophobic domain protein isolated 
from WERI-RB cells. 


1649 


92 


1179 


gi 19880267 


Homo sapiens 


metallo phosphoesterase 


1649 


92 


1180 


AAU91402 


Homo sapiens 


Human secreted protein sequence 
#55. 


435 


100 


1180 


AAU91376 


Homo sapiens 


Human secreted protein sequence 
#29. 


435 


100 


1180 


AAU91352 


Homo sapiens 


Human secreted protein sequence #5. 


435 


100 


1181 


AAG01183 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5264. 


275 


94 


1181 


AAO04719 


Homo sapiens 


Human polypeptide SEQ ID NO 
18611. 


99 


36 


1181 


AAO03100 


Homo sapiens 


Human polypeptide SEQ ID NO 
16992. 


99 


42 


1182 


ABB 12063 


Homo sapiens 


■WW , 1 J * 1 1 

Human secreted protein homologue, 
SEQ ID NO:2433. 


326 


1 a r\ 
100 


1182 


AAE06730 


Homo sapiens 


Human CASB765 protein. 


200 


100 


1182 


AAU81960 


Homo sapiens 


Human PR0536. 


141 


81 


1183 


AAB75349 


Homo sapiens 


Human secreted protein #8. 


748 


9b 


1183 


AAG74502 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:5266. 


748 


98 


1183 


AAU27661 


Homo sapiens 


Human protein AFP485790. 


748 


98 


1184 


ABB84887 


Homo sapiens 


Human PR0791 protein sequence 
SEQ ID NO: 142. 


1025 


90 


1184 


ABB90305 


Homo sapiens 


Human polypeptide SEQ ID NO 
2681. 


1025 


90 


1184 


ABB95493 


Homo sapiens 


Human angiogenesis related protein 
PR0791 SEQ ID NO: 142. 




on 


1185 


gi2745733 


Homo sapiens 


S2P 


75 


36 


1185 


gi4164134 


Homo sapiens 


SP2 metalloprotease 


75 


36 


1185 


gil5865970 


Varanus 
spenceri 


NADH dehydrogenase subunit 2 


73 


27 


1186 


AAU69480 


Homo sapiens 


Human purified secretory 
polypeptide #49. 


283 


92 
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1186 


gi7770223 


Homo sapiens 


PR02714 


28 1 


84 


1187 


ABB09144 


Homo sapiens 


Human phospholipase A2 protein 
cPLA2 SEQ ID NO:l. 


4446 


99 


1187 


ABB07494 


Homo sapiens 


Human lipid metabolism molecule 
(LMM) polypeptide (ID: 
1281946CD1). 


4407 


99 


1187 


AAE05958 


Homo sapiens 


Human phospholipase-like protein 
#3. 


4215 


99 


1188 


gil47 17809 


Mus musculus 


capsule seleno-protein 


96 


30 


1188 


gil4717800 


Mus musculus 


seleno-protein 


96 


30 


1188 


gi2992470 


Mus sp. 


mitochondrial capsule selenoprotein; 
MCS 


96 


30 


1189 


AAE05095 


Homo sapiens 


Human inter-alpha trypsin inhibitor 
(ITI) light chain. 


1775 


94 


1189 


AAM79366 


Homo sapiens 


Human protein SEQ ID NO 3012. 


1775 


94 


1189 


AAM78382 


Homo sapiens 


Human protein SEQ ID NO 1044. 


1775 


94 


1190 


gi673422 


Homo sapiens 


T-cell receptor alpha-chain 


1323 


93 


1190 


ABB95403 


Homo sapiens 


Human P501S specific T cell clone 
4E5 Va chain protein SEQ ID NO 
906. 


1095 


77 


1190 


AAM01298 


Homo sapiens 


P501S-specific T cell clone 4E5 Va 
chain T cell receptor amino acid. 


1095 


77 


1191 


gi673422 


Homo sapiens 


T-cell receptor alpha-chain 


859 


95 


1191 


gi623119 


Macaca mulatta 


T-cell receptor alpha 


603 


86 


1191 


ABB95403 


Homo sapiens 


Human P501S specific T cell clone 
4E5 Va chain protein SEQ ID NO 
906. 


593 


65 


1192 


gi53861 


Mus musculus 


Q300 protein (AA 1-77) 


69 


41 


1193 


ABB89241 


Homo sapiens 


Human polypeptide SEQ ID NO 
1617. 


247 


69 


1193 


AAB08894 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 4 SEQ ID NO: 5 1. 


208 


57 


1193 


gi21070180 


Danio rerio 


envelope protein 


102 


40 


1194 


AAG03963 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 8044. 


417 


80 


1194 


ABB10412 


Homo sapiens 


Human cDNA SEQ ID NO: 720. 


289 


100 


1194 


ABB10168 


Homo sapiens 


Human cDNA SEQ ID NO: 476. 


289 


100 


1196 


AAM41429 


Homo sapiens 


Human polypeptide SEQ ID NO 
6360. 


328 


100 


1196 


AAM39643 


Homo sapiens 


Human polypeptide SEQ ID NO 
2788. 


328 


100 


1196 


AAM79610 


Homo sapiens 


Human protein SEQ ID NO 3256. 


328 


100 


1197 


AAE19830 


Homo sapiens 


Human patched (Ptch) protein. 


213 


63 


1197 


AAB67163 


Homo sapiens 


Human patched protein. 


213 


63 


1197 


AAW47157 


Homo sapiens 


Nevoid basal cell carcinoma 
syndrome (NBCCS) (PTC) protein. 


213 


63 


1198 


AAB25674 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 10 SEQ ID NO:oi. 


643 


84 


1198 


AAB36613 


Homo sapiens 


Human FLEXHT-35 protein 
sequence SEQ ID NO:35. 


626 


80 


1198 


gil4603247 


Homo sapiens 


Similar to RIKEN cDNA 
5730409G15 gene 


626 


80 


1199 


AAO04718 


Homo sapiens 


Human polypeptide SEQ ID NO 
18610. 


160 


75 


1199 


AAO04681 


Homo sapiens 


Human polypeptide SEQ ID NO 


160 


75 
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18573. 






1199 


AAO02320 


Homo sapiens 


Human polypeptide SEQ ID NO 
16212. 


154 


73 


1200 


AAU 12292 


Homo sapiens 


Human PRO6027 polypeptide 
sequence. 


990 


98 


1200 


AAU27673 


Homo sapiens 


Human protein AFP235412. 


987 


99 


1200 


gi7303340 


Drosophila 
melanogaster 


CG4676-PA 


189 


30 


1201 


ABG60015 


Homo sapiens 


Human DITHP polypeptide #73. 


28 1 


66 


1201 


ABG60059 


Homo sapiens 


Human DITHP polypeptide #117. 


249 


60 


1201 


gi2 1928245 


Homo sapiens 


seven transmembrane helix receptor 


218 


60 


1202 


ABG60015 


Homo sapiens 


Human DITHP polypeptide #73. 


276 


68 


1202 


ABG60059 


Homo sapiens 


Human DITHP polypeptide #117. 


257 


47 


1202 


AAB43928 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1373. 


223 


55 


1203 


AAE04368 


Homo sapiens 


Human kinase (PKIN)-9. 


120 


85 


1203 


AAM79153 


Homo sapiens 


Human protein SEQ ID NO 1815. 


120 


85 


1203 


AAY68784 


Homo sapiens 


Amino acid sequence of a human 
phosphorylation effector PHSP-16. 


120 


85 


1204 


AAU74760 


Homo sapiens 


Human protease PRTS-20 protein 
sequence. 


1042 


98 


1204 


AAE04733 


Homo sapiens 


Human protease homologue #1. 


1037 


98 


1204 


AAE04734 


Homo sapiens 


Human protease homologue #2. 


1019 


97 


1205 


AAW67842 


Homo sapiens 


Human secreted protein encoded by 
gene 36 clone HODCL36. 


45 1 


76 


1205 


gi7717366 


Homo sapiens 


tryptophan rich protein, congenital 
heart disease 5 protein CHD5 


451 


76 


1205 


gi 1946205 


Homo sapiens 


congenital heart disease 5 protein 


445 


75 


1206 


AAG01971 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6052. 


314 


100 


1206 


gi3 183989 


Lycopersicon 
esculentum 


P69E protein 


78 


41 


1206 


gi4200340 


Lycopersicon 
esculentum 


P69D protein 


77 


41 


1207 


gi 140432 11 


Homo sapiens 


Similar to RIKEN cDNA 
4931428F04 gene 


889 


83 


1207 


gil 1276027 


Rattus 
norvegicus 


LSC 


92 


30 


1207 


gil389756 


Mus musculus 


Lsc 


92 


30 


1208 


AAY91653 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 62 SEQ ID 
NO:326. 


813 


79 


1208 


AAY91512 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 62 SEQ ID 
NO: 185. 


813 


79 


1208 


AAY44720 


Homo sapiens 


Human immune system molecule, 
ISMO-1. 


747 


78 


1209 


AAY2764S 


• 

Homo sapiens 


Human secreted protein encoded by 
gene No. 82. 


dZZ 




1209 


gi7959897 


Homo sapiens 


PR02379 


68 


39 


1211 


AAU19622 


Homo sapiens 


Human diagnostic and therapeutic 
polypeptide (DITHP) #208. 


339 


62 


1211 


AAB08765 


Homo sapiens 


A human leukocyte and blood related 
protein (LBAP). 


339 


62 


1211 


AAB74718 


Homo sapiens 


Human membrane associated protein 


314 


66 
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MEMAP-24. 






1212 


AAG65893 


Homo sapiens 


Amino acid sequence of GSK gene 
Id 90060. 


1901 


71 


1212 


AAB80264 


Homo sapiens 


Human PR0332 protein. 


928 


41 


1212 


AAU12356 


Homo sapiens 


Human PR0332 polypeptide 
sequence. 


928 


41 


1213 


AAE24240 


Homo sapiens 


Human 23566 (carboxypeptidase) 
protein. 


1551 


99 


1213 


AAG66547 


Homo sapiens 


Human secreted 

metallocarboxypeptidase-like 

polypeptide. 


1551 


99 


1213 


AAG66565 


Homo sapiens 


Human secreted 

metall ocarboxypepti dase- like van ant 
polypeptide. 


1548 


98 


1214 


AAB82317 


Homo sapiens 


Human immunoglobulin receptor 
IRTA4 protein. 


528 


100 


1214 


AAB85464 


Homo sapiens 


Human immunoglobulin domain- 
containing polypeptide. 


528 


100 


1214 


gil5528833 


Homo sapiens 


Fc receptor-like protein 2 


528 


1 /-V /"V 

100 


1215 


ABP43492 


Homo sapiens 


Human secreted protein (SCEP) 16. 


1033 


100 


1215 


gi20380668 


Homo sapiens 


similar to MANNOSE-P- 
DOLICHOL UTILIZATION 
DEFECT 1 PROTEIN HOMOLOG 


1033 


100 


1215 


AAU27663 


Homo sapiens 


Human protein AFP285042. 


905 


90 


1216 


AAU97104 


Homo sapiens 


Human MK61 protein, HMK61T4. 


979 


99 


1216 


AAU97106 


Homo sapiens 


Human MK61 protein, hMK61T6. 


846 


88 


1216 


AAU97101 


Homo sapiens 


Human MK61 protein, hMK61Tl. 


683 


83 


1217 


AAU97106 


Homo sapiens 


Human MK61 protein, hMK61T6. 


833 


94 


1217 


AAU97104 


Homo sapiens 


Human MK61 protein, hMK61T4. 


805 


85 


1217 


AAU97101 


Homo sapiens 


Human MK61 protein, hMK61Tl. 


529 


92 


1218 


AAB92697 


Homo sapiens 


Human protein sequence SEQ ID 
NO:11091. 


71 


44 


1218 


AAB92547 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 10726. 


71 


44 


1218 


gi386638 


Mus sp. 


mesenchyme fork head l 


70 


34 


1220 


gill71589 


Plasmodium 
falciparum 


frameshift 


74 


33 


1220 


gi45 12010 


Escherichia coli 


OrfY 


66 


50 


1221 


AAO 15424 


Homo sapiens 


Human genset metabolic gene 
(GMG-10) protein. 


1462 


94 


1221 


AAB50371 


Homo sapiens 


Human ZACRP7. 


1462 


94 


1221 


AAE09444 


Homo sapiens 


Human SBhACRP30a protein #2. 


1462 


94 


1222 


ABG40131 


Homo sapiens 


Human peptide encoded by genome- 
derived single exori probe SEQ ID 
29796. 


71 


34 


1222 


AAM05935 


Homo sapiens 


Peptide #4617 encoded by probe for 
measuring breast gene expression. 


71 


34 


1222 


AAM30815 


Homo sapiens 


Peptide #4852 encoded by probe for 
measuring placental gene expression. 


71 


34 ! 


1223 


gi8 850245 


Homo sapiens 


activated p21cdc42Hs kinase 


5605 


100 


1223 


gi2921447 


Mus musculus 


non-receptor protein tyrosine kinase 
Ack 


5164 


92 


1223 


gi2078388 


Bos taurus 


Cdc42-associated tyrosine kinase 
ACK-2 


3503 


91 


1224 


AAB 84696 


Homo sapiens 


Amino acid sequence of a human 


358 


33 
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~ vj 

zkunlO polypeptide. 






1224 


gi211622 


Gallus gallus 


alpha-3 collagen type VI 


300 


32 


1224 


ABP41277 


XT * 

Homo sapiens 


Human ovarian antigen HOCQH66, 
SEQ ID NU:24Uy. 


Zo 1 




1225 


AAB66065 


Homo sapiens 


r f ^ rp k TvT/~t/~v TO/1 

Human IAJnuU 2y4. 


J.L ID 


QQ 


1225 


AAE11931 


Homo sapiens 


Human COioz (or Cj9j lipase 
protein #2. 


0 1 1 7. 




1225 


AAB66067 


Homo sapiens 


Human TANGO 294 mature protein. 


2015 


99 


1226 


AAM06483 


T T * 

Homo sapiens 


Human toetal protein, oEQ ID JN(J: 
214. 


Zoo 




1227 


AAR60521 


Homo sapiens 


Human tetranectin. 






1227 


gi37409 


Homo sapiens 


Tetranectin 


869 


83 


1227 


gi825722 


Homo sapiens 


tetranectin i 


869 




1228 


gi5790207 


Taenia saginata 


ATPase subunit 6 


68 


32 


1228 


gi8778323 


Arabidopsis 
thaliana 


F14J16.24 


66 


52 


1229 


AAE01790 


Homo sapiens 


Human gene 21 encoded secreted 
protein HDPTW65, SEQ ID NO: 1 1 1 . 


142 


59 


1229 


AAE01838 


Homo sapiens 


Human gene 21 encoded secreted 
protein HDPTW65, SEQ ID NO: 159. 


140 


5 / 


1229 


ABB 11479 


Homo sapiens 


Human reverse transcriptase 
homologue, SEQ ID NO: 1849. 


91 


55 


1230 


ABB50466 


Homo sapiens 


Human secreted protein encoded by 
gene 166 SEQ ID NO:414. 


333 


100 


1230 


AAW88699 


Homo sapiens 


Secreted protein encoded by gene 
166 clone HCEQA68. 


333 


100 


1230 


AAU83594 


Homo sapiens 


Human PRO protein, Seq ID No 6. 


327 


100 


1231 


AAG00381 


Homo sapiens 


Human secreted protem, SEQ ID 
NO: 4462. 


O /TO 

263 


91 


1231 


AAU19357 


Homo sapiens 


Human G protein-coupled receptor 
nGPCR-2290. 


1 1 A 

119 


CO 

52 


1231 


AA013251 


Homo sapiens 


Human polypeptide SEQ ID NO 
27143. 


105 


39 


1232 


AAM06558 


Homo sapiens 


Human foetal protein, SEQ ID NO: 
289. 


301 


98 


1233 


AAM06562 


Homo sapiens 


Human foetal protein, SEQ ID NO: 
293. 


383 


100 


1233 


AAU03519 


Homo sapiens 


TT , • t " im n 

Human protein kinase #19. 


lb 


J4 


1234 


gil3561474 


Mertensiella 
luschani atifi 


NADH dehydrogenase subunit 2 




ZD 


1234 


gil3561462 


Mertensiella 
luschani biiiae 


NADH aenyarogenase suounit z 


/U 


ZD 


1234 


gil3561454 


Salamandra 
infraimmacul at a 


NADH dehydrogenase subunit 2 


69 


32 


1236 


gi992917 


Glycine max 


acetyl CoA carboxylase 


67 


53 


1236 


gil066857 


Glycine max 


acetyl-CoA carboxylase 


65 


53 


1237 


AAE20197 


Homo sapiens 


Human TIE ligand, FLS139 protein. 






1237 


AAiiiyo2o 


Homo sapiens 


numan pl/ij lj¥ nga.no piutciii. 


1947 


94 


1237 


ABB80020 


Homo sapiens 


TIE lifiand FLS139 amino acid. 


1947 


94 


1238 


AAM06568 


Homo sapiens 


Human foetal protein, SEQ ID NO: 
299. 


142 


57 


1245 


gi3 873768 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01461 (7TM chemoreceptor), 
Score=28.9, E-value=1.5e-16, N=2 


69 


28 


1245 


AAU83682 


Homo sapiens 


Human PRO protein, Seq ID No 182. 


67 


39 
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1245 


AAU291 65 


Homo sapiens 


Human PRO polypeptide sequence 
#142. 


67 


39 


1247 


ABB89102 


Homo sapiens 


Human polypeptide SEQ ID NO 
1478. 


517 


100 


1247 


AAM38788 


Homo sapiens 


Human polypeptide SEQ ID NO 
1933. 


83 


21 


1247 


gi2l 109777 


Xanthomonas 

* * 

axonopodis pv. 
citri str. 306 


transcriptional regulator 


83 


28 


1248 


gil5292277 


Drosophila 
melanogaster 


LD45324p 


70 


26 


1249 


gi298848 


Equine 
herpesvirus 1 


glycoprotein C homolog 


67 


A 1 

41 


1249 


gi330808 


Equine 
herpesvirus l 


membrane glycoprotein C 


67 


41 


1249 


gi330899 


Equine 
herpesvirus 1 


glycoprotein gpl3 precursor 


67 


41 


1251 


AAOl 1677 


Homo sapiens 


Human polypeptide SEQ ID NO 
25569. 


74 


54 


1255 


gil6551105 


Crotalus 
adamanteus 


NADH dehydrogenase subumt 5 


66 


28 


1257 


ABB53264 


Homo sapiens 


Human polypeptide #4. 


3044 


100 


1257 


gi 18565270 


Homo sapiens 


Lib 


3028 


99 


1257 


gi 18565266 


Rattus 
norvegicus 


Lib 


2551 


83 


1259 


gi335876 


Vesicular 
stomatitis virus 


matrix (M) protein 


71 


26 


1259 


gi61842 


Vesicular 
stomatitis virus 


M protein (aa 1-237) 


71 


26 


1259 


gi336034 | 


Vesicular 
stomatitis virus 


M-protein 


71 


26 


1261 


AAB95686 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18490. 


983 


99 


1261 


gi 18446901 


Drosophila 
melanogaster 


AT07234p 


80 


32 


1262 


gi9858058 


Tortula ruralis 


rehydrin 


85 


29 


1262 


gi 19882261 


Gallus gallus 


• 

paranemm 


80 


27 


1262 


gi9837383 


Bos taurus 


retinitis pigmentosa GTPase 
regulator 


76 


29 


1263 


AAU98893 


T *¥" * 

Homo sapiens 


Human protease PRTS 1 1 . 


1600 


80 


1263 


gi965014 


•mm- 1 

Mus mus cuius 


ADAM 4 protein precursor 


1297 


51 


1263 


gil061159 


Macaca 
fasciculans 


testicular Metalloprotease-like, 
Dismtegrm-like, Cysteme-rich 

j * Tt r 

protein IVa 


1253 


38 


1264 


AAE21462 


Homo sapiens 


Human gene 1 1 encoded secreted 
protein HWBBT49, SEQ ID NO:78. 


2107 


99 


1264 


AAM80033 


Homo sapiens 


Human protein SEQ ID NO 3679. 


1895 


98 


1264 


AAM79049 


T T * 

Homo sapiens 


Human protein SEQ ID NO 171 1. 


1895 


9o 


1265 


AAB44605 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 10 SEQ ID NO:70. 


93 


70 


1266 


AAOl 5039 


Homo sapiens 


Human clusterin protein sequence 1 . 


1981 


87 


1266 


gi30251 


Homo sapiens 


SP-40,40 prepropetide (AA -22 to 
427) 


1981 


87 


1266 


gi!4714741 


Homo sapiens 


clusterin (complement lysis inhibitor, 
SP-40,40, sulfated glycoprotein 2 3 


1981 


87 
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testosterone-repressed prostate 
message 2, apolipoprotein J) 






1267 


AAB37963 


Homo sapiens 


Human antithrombm III ammo acid 
sequence. 


2136 


91 : 


1267 


AAY92224 


Homo sapiens 


Human antithrombin III. 


2136 


91 


1267 


AAR42895 


Homo sapiens 


Human antithrombin III (wild-type). 


2136 


91 


1268 


AAB37963 


Homo sapiens 


Human antithrombin III amino acid 
sequence. 


2276 


96 


1268 


AAY92224 


Homo sapiens 


Human antithrombin III. ; 


2276 


96 


1268 


AAR42895 


Homo sapiens 


Human antithrombin III (wild-type). 


2276 


96 


1269 


gi203519 


Rattus 
norvegicus 


cytochrome c oxidase subunit Vic 


247 


67 


1269 


gi 193 54073 


Mus musculus 


cytochrome c oxidase, subunit VIc 


244 


67 


1269 


AAB56523 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1 101. 


229 


61 


1270 


gi7981304 


Homo sapiens 


dJ551D2.1.2 (cadherin-like protein 
VR20, isoform 2) 


575 


84 


1270 


AAM96033 


Homo sapiens 


Human reproductive system related 
antigen SEQ ID NO: 4691. 


512 


84 


1270 


gi9622236 


Homo sapiens 


cadherin-like protein VR20 


272 


100 


1271 


gi6692690 


Arabidopsis 
thaliana 


F12K11.14 


105 


26 


1271 


gil58696 


Drosophila 
melanogaster 


tropomyosin isoform 9D 


100 


25 


1271 


gil58695 


Drosophila 
melanogaster 


tropomyosin isoform 33 (9C) 


100 


25 


1272 


gi6692690 


Arabidopsis 
thaliana 


F12K11.14 


106 


24 


1272 


gil 1935051 


Oryctolagus 
cuniculus 


sarcolemmal associated protein 1 


99 


27 


1272 


gil850913 


Entamoeba 
histolytica 


myosin heavy chain 


98 


19 


1273 


AAM79617 


Homo sapiens 


Human protein SEQ ID NO 3263. 


1481 


94 


1273 


AAM78633 


Homo sapiens 


Human protein SEQ ID NO 1295. 


1481 


94 


1273 


gil6876913 


Homo sapiens 


mercaptopyruvate sulfurtransferase 


1481 


94 


1274 


AA021899 


Homo sapiens 


Protein of human protease. 


772 


49 


1274 


AAE 18722 


Homo sapiens 


Human DESCl-like serine protease. 


767 


48 


1274 


AAB85039 


Homo sapiens 


Human SER5 protein sequence. 


754 


47 


1275 


gil2584839 


Homo sapiens 


HT036-ISO 


1003 


94 


1275 


gil 25 84841 


Homo sapiens 


HT036 


826 


93 


1275 


gil7985315 


Brucella 
melitensis 


HYDROXYPYRUVATE 
ISOMERASE 


512 


43 


1277 


AAB92449 


Homo sapiens 


Human protein sequence SEQ ID 
NO:10478. 


26 1 


100 


1277 


AAB83299 


Homo sapiens 


Human homoglutamine-rich factor 
56. 


26 1 


100 


1277 


AAB82342 


Homo sapiens 


Winged helix/zinc finger 
transcription factor FOXP1 variant. 


26 1 


100 


1278 


AAM25840 


Homo sapiens 


Human protein sequence SEQ ID 
NO:1355. 


208 


88 


1279 


AAU72886 


Homo sapiens 


Human aspartyl protease partial 
protein sequence #1 1. 


82 1 


98 


1279 


AAU98884 


Homo sapiens 


Human protease PRTS2. 


82 1 


98 


1279 


gi7023943 


Homo sapiens 


down-regulated in gastric cancer 


350 


35 


1280 


AAG77975 


Homo sapiens 


Protein of a human soluble adenylyl 


265 


37 
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cyclase. 






1280 


AAE21187 


Homo sapiens 


Human soluble adenylyl cyclase 
(SAC) protein. 


265 


37 


1280 


AAB81929 


Homo sapiens 


Human soluble adenylyl cyclase. 


265 


37 '■ 


1282 


gi4079809 


Homo sapiens 


HERC2 


68 


42 


1283 


ABG45272 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
34937. 


153 


67 


1283 


AAM35920 


Homo sapiens 


Peptide #9957 encoded by probe for 
measuring placental gene expression. 


153 


67 


1283 


AAM20697 


Homo sapiens 


Peptide #7131 encoded by probe for 
measuring cervical gene expression. 


153 


67 


1284 


AAG81367 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:252. 


849 


90 


1284 


gil8088345 


Homo sapiens 


Similar to RIKEN cDNA 
1110066C01 gene 


832 


90 


1284 


gi20381141 


Mus musculus 


RIKEN cDNA 1110066C01 gene 


642 


68 


1285 


AAG81367 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:252. 


939 


91 


1285 


gil 8088345 


Homo sapiens 


Similar to RIKEN cDNA 
1110066C01 gene 


827 


89 


1285 


gi20381141 


Mus musculus 


RIKEN cDNA 1110066C01 gene 


724 


70 


1286 


AAM79782 


Homo sapiens 


Human protein SEQ ID NO 3428. 


95 


36 


1286 


AAM7879S 


Homo sapiens 


Human protein SEQ ID NO 1460. 


93 


41 


1286 


gil77179 


Homo sapiens 


alpha- 2 type VIII collagen 


93 


41 


1287 


gi5689766 


Homo sapiens 


zinc ringer 2.2 


2092 


99 


1287 


AAM39130 


Homo sapiens 


Human polypeptide SEQ ID NO 
2275. 


1048 


46 


1287 


gi3 135968 


Homo sapiens 


b34I8.1 (zinc finger protein 184 
(Kruppel-like)) 


1048 


46 


1288 


ABP43144 


Homo sapiens 


Human ovarian antigen HWCD65, 
SEQ ID NO:4276. 


281 


86 


1288 


AAB44228 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1673. 


281 


86 


1288 


ABB06807 


Homo sapiens 


Human nGPCR-Seql019 protein 
sequence SEQ ID NO:76. 


69 


34 


1290 


AAG03150 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7231. 


307 


98 


1290 


AAW48931 


Homo sapiens 


Schwannomin-binding protein C- 
terminal fragment. 


286 


100 


1291 


AAB60098 


Homo sapiens 


Human transport protein TPPT-18. 


2360 


92 


1291 


gil537070 


Rattus 
norvegicus 


nucleoporin p54 


2292 


88 


1291 


gi7688695 


Homo sapiens 


nucleoporin p54 protein 


2246 


89 


1292 


AAY94621 


Homo sapiens 


Epidermal growth factor-like variant 
in skin-2 amino acid sequence. 


422 


79 


1292 


ABB72266 


Homo sapiens 


Human protein isolated from skin 
cells SEQ ID NO: 417. 


415 


78 


1292 


AAB56066 


Homo sapiens 


Skin cell protein, SEQ ID NO: 417. 


415 


78 


1293 


AAM79352 


Homo sapiens 


Human protein SEQ ID NO 2998. 


1018 


98 


1293 


ABB 11835 


Homo sapiens 


Human secreted protein homologue, 
SEQ ID NO:2205. 


1018 


98 


1293 


AAW78245 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 19. 


1018 


98 


1294 


AAM99920 


Homo sapiens 


Human polypeptide SEQ ID NO 36. 


669 


85 
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1294 


AAM99933 


Homo sapiens 


Human polypeptide SEQ ID NO 49. 


629 


82 


1294 


AAG75526 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO: 6290. 


188 


38 


1295 


gi2598167 


Homo sapiens 


zinc finger protein 


2772 


99 


1295 


gi5640019 


Mus musculus 


zinc finger protein ZFP235 


1707 


65 


1295 


gil 3277768 


Mus musculus 


zinc finger protein 93 


1434 


54 


1296 


gi4567180 


Homo sapiens 


BC37295 2 (partial) 


2995 


100 


1296 


ABB50238 


Homo sapiens 


Human transcription factor TRFX- 
89. 


2734 


100 


1296 


gi9502202 


Homo sapiens 


endothelial zinc ringer protein 
induced by tumor necrosis factor 
alpha 


2734 


100 


1297 


ABG38718 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
28383. 


97 


55 


1297 


AAM04653 


Homo sapiens 


Peptide #3335 encoded by probe for 
measuring breast gene expression. 


97 


55 


1297 


AAM29443 


Homo sapiens 


Peptide #3480 encoded by probe for 
measuring placental gene expression. 


97 


55 


1298 


AAE14459 


Homo sapiens 


Human protein phosphatase-9. 


601 


-t /""V 

100 


1298 


AAM79176 


Homo sapiens 


Human protein SEQ ID NO 1838. 


601 


100 


1298 


AAE 14251 


Homo sapiens 


Human 16051b protein. 


563 


100 


1299 


AAB74690 


Homo sapiens 


Human protease and protease 
inhibitor PPIM-23. 


2420 


96 


1299 


AAB 85427 


Homo sapiens 


Human MPROT45 polypeptide. 


2420 


96 


1299 


AAU12243 


Homo sapiens 


Human PR04339 polypeptide 
sequence. 


2420 


96 


1300 


AAE 16953 


Homo sapiens 


Human precursor interleukin-18 
(Pro-IL-18) protein. 


138 


92 


1300 


AAG63830 


Homo sapiens 


Amino acid sequence of human 
mterleukin 18 (IL-18). 


138 


92 


1300 


AAB30541 


Homo sapiens 


A human IL-18 with a caspase-8 
cleavage site. 


138 


92 


1301 


AAE05302 


Homo sapiens 


Human TANGO 457 protein. 


901 


78 


1301 


AAE05303 


Homo sapiens 


Human mature TANGO 457 protein. 


889 


79 


1301 


AAE05305 


Homo sapiens 


Human TANGO 457 protein 
cytoplasmic domain. 


883 


78 


1302 


gi2202 


Canis sp. 


Clox 


68 


73 


1302 


gi6066468 


Leishmania 
major 


probable DNA polymerase zeta 
catalytic component 


67 


A -r 

41 


1303 


gi20068312 


Atropa 
belladonna 


maturase 


65 


34 


1304 


gi297146 


Homo sapiens 


retinoic acid receptor gamma 2 


274 


100 


1304 


gil 803 1837 


Mesocricetus 
auratus 


retinoic acid receptor gamma-2 


256 


94 


1304 


gi200672 


Mus musculus 


retinoic acid receptor gamma 


252 


92 


1305 


AAM39737 


Homo sapiens 


Human polypeptide SEQ ID NO 
2882. 


992 


99 


1305 


ABG34053 


Homo sapiens 


Human Pro peptide #24. 


875 


100 


1305 


AAM41523 


Homo sapiens 


Human polypeptide SEQ ID NO 
6454. 


875 


100 


1307 


gi212485 


Gallus gallus 


ovoinhibitor 


133 


47 


1307 


ABB72111 


Homo sapiens 


Human protein isolated from skin 
cells SEQ ID NO: 150. 


132 


42 


1307 


AAB55911 


Homo sapiens 


Skin cell protein, SEQ ID NO: 150. 


132 


42 



WO 03/054152 



PCT/US02/39555 



Table 2B 
261 



SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


% 
Identity 


1308 


AAU27671 


Homo sapiens 


Human protein AFP35547 1 . 


4oo 


1 Art 
1UU 


1308 


gil256888 


Saccharomyces 
cerevisiae 


Ylr086wp 

. 


Id 




1308 


gi21280444 


Homo sapiens 


AF15ql4 


n 1 


Z / ! 


1309 


AAR15222 


Homo sapiens 


Chronic myelogenous leukaemia- 
derived myeloid-related protein. 




1 f\f\ 


1309 


gi32402 


Homo sapiens 


HP-1 (AA 1-94) 




1UU 


1309 


gilS1529 


Homo sapiens 


defensm 1 




1 AA 


1310 


AAB39114 


Homo sapiens 


-»--r , 1 , ■ XI 

Human secreted protein #22. 


i ah 


1 AA 

1UU 


1310 


gi23 16086 


Pisum sativum 


gibberellin 3B-hydroxylase 


67 


43 


1310 


gi2316018 


Pisum sativum 


gibbereilm 3 beta-hydroxylase 


67 


A1 

4J 


1311 


ABP47852 


Homo sapiens 


Human polypeptide SEQ ID NO 282. 


one 

205 


A A 

4U 


1311 


ABB90743 


Homo sapiens 


Human Tumour Endothelial Marker 
polypeptide SEQ ID NO 218. 


loo 


/lO 

4z 


1311 


gi3 127926 


Homo sapiens 


collagen type VI, alpha 3 chain 


1 Q< 

loo ! 


/JO 

4z 


1312 


gil6555334 


Homo sapiens 


Rig protein 


1 f\ 1 o 

1U1Z 


1 AA 


1312 


gil6508176 


Homo sapiens 


small GTP-binding tumor suppressor 
1 


1012 


100 


1312 


gi21040535 


Homo sapiens 


similar to Rig protein 


1012 


1UU 


1313 


gi21741597 


Oryza sativa 


OS JNBa0052Pl 6.6 


84 


Af\ 

40 


1313 


gi21741539 


Oryza sativa 


OSJNBa0052P16.7 


o4 


A C\ 

4U 


1313 


gi21743203 


Oryza sativa 


OSJNBa0085C10.10 


81 


3 / 


1314 


gi2689446 


Homo sapiens 


R27945 1 


loo9 


91 


1314 


AAM71801 


Homo sapiens 


Human bone marrow expressed 
probe encoded protein SEQ ID NO: 
32107. 


1872 


1 AA 


1314 


AAM79549 


Homo sapiens 


Human protein SEQ ID NO 3 195. 


1 1 /f C 

114d 


rr 
DJ 


1315 


AAM79404 


Homo sapiens 


Human protein SEQ ID NO 3050. 




/y 


1315 


gi3228237 


Homo sapiens 


ultra high sulfer keratin 


791 


oy 


1315 


gi32472 


Homo sapiens 


high-sulpher keratin 


TOO 

/OD 


/o 


1316 


gi 12655446 


Homo sapiens 


keratin associated protein 4.4 


768 


78 


1316 


gil3278909 


Homo sapiens 


Similar to RIKEN cDNA 
lll0054Pl9gene 


738 


75 


1316 


gil2655460 


Homo sapiens 


keratin associated protein 4.1/ 


738 


75 


1317 


gil2655462 


Homo sapiens 


keratin associated protein 4.14 


1 102 


bo 


1317 


gil2655452 


Homo sapiens 


keratin associated protein 4.7 


lOoo 


QA 

o4 


1317 


gil2655456 


Homo sapiens 


keratin associated protein 4.9 


1002 


82 


1318 


AAM39466 


Homo sapiens 


Human polypeptide SEQ ID NO 
2611. 


893 


78 


1318 


AAM41252 


Homo sapiens 


Human polypeptide SEQ ID NO 
6183. 


885 


78 


1318 


gi3228237 


Homo sapiens 


ultra high sulfer keratin 


872 




1319 


ABB79480 


Homo sapiens 


Human zinc finger protein 75.68. 


800 


65 


1319 


gi6467200 


Homo sapiens 


gonadotropin inducible transcription 
repress or- 1 


773 


63 


1319 


gi20306351 


Homo sapiens 


similar to gonadotropin inducible 
transcription repressor- 1 


773 


63 


1320 


gi3036963 


Ciona savignyi 




163 


60 


1320 


gi2 1667044 


Ustilago maydis 


GTP binding protein Cdc42 


162 


60 


1320 


gil5072535 


Schizophyllum 
commune 


small GTPase CDC42 


162 


60 


1321 


AAE02058 


Homo sapiens 


Human four disulfide core domain 
(FDCD)-containing protein. 


5ll 


43 


1321 


gil2655452 


Homo sapiens 


keratin associated protein 4.7 


492 


43 


1321 


gi200964 


Mus musculus 


serine 2 ultra high sulfur protein 


48 1 


42 
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S 
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% 
lueniivy 


1322 


ABB 12490 


Homo sapiens 


Human bone marrow expressed 
protein oJbQ lu JNU. 3 zy. 


169 


72 


1323 


AAM39144 


Homo sapiens 


Human polypeptide SEQ ID NO 

ZZ&y. 


378 


92 


1323 


gi5921473 


Homo sapiens 


G8 protein 


378 


92 


1323 


AAM40930 


T T * 

Homo sapiens 


Human polypeptide kEQ ID NO 
5861. 


3 /U 


Q 1 


1324 


gi 18446967 


Drosophua 
melanogaster 


AT14419p 
— — 


/V 


on 
Z9 


1325 


gi 15277229 


Homo sapiens 


Homologue to Drosophila 
photoreceptor protein calphotin 


/O 


34 


1325 


gil 136400 


T T * 

Homo sapiens 


similar to Drosophila photoreceptor 
cell-specific protein, calphotin. 


/O 


34 


1327 


AAU07343 


Homo sapiens 


1-aminocyclopropane carboxylate 
(AO" syntnase tttz. 


O^O 

ZOZ 


3o 


1327 


AAU91279 


Homo sapiens 


Human NOV3a protein. 


259 


33 


1327 


gi3 13720 


Ovis aries 


JvAfj.4 keratin protein 


1 in 

i / / 


ZD 


1328 


AAY08325 


Homo sapiens 


Human granulysin P520 active 
fragment. 


316 


75 


1328 


AAW59874 


T T * 

Homo sapiens 


Amino acid sequence of the cDNA 
clone CAT-1 (HTXET53). 


^ 1 




1328 


AAR23732 


Homo sapiens 


Gene 5 19 cDNA derived peptide. 


1 

3 iO 


/J 


1329 


AAM41252 


Homo sapiens 


Human polypeptide bbQ ID JNO 
6183. 


34j 


43 


1329 


AAM39466 


Homo sapiens 


Human polypeptide oJbv,/ ID JNU 
/oil. 




4j 


1329 


gi200964 


Mus musculus 


serine 2 ultra high sulfur protein 


342 


49 


1330 


gi 13937769 


Homo sapiens 


oimiiar to Kiivh/JN cl^jn a 
1200013F24 gene 


1 


1UU 


1330 


gt75 82294 


Homo sapiens 


DM A 1 1 

rsM-Ui l 


/ol 


OS 


1330 


AAG67014 


Homo sapiens 


Human sperm-specific protein EMI, 

"OA A /Z A O 

bMo-4o, 


249 


30 


1331 


gil4718451 


Homo sapiens 


sialic acid-binding lectin 11 


793 


70 


1331 


gi 197 16086 


Homo sapiens 


Sialic acid-binding Ig-like lectin 
Siglec-12 


793 




1331 


AAU29082 


Homo sapiens 


Human PRO polypeptide sequence 
#59. 


551 


K A 


1332 


ABP41951 


T T * 

Homo sapiens 


Human ovarian antigen HDABR73, 
SEQ ID NO:3083. 


Q A O 

o4o 


yo 


1332 


AAB43821 


TT 

Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1266. 


QytQ 


no 


1332 


AAB 14201 


Homo sapiens 


Human placental bikunin protein 
rragment # lo. 


O4o 


ys 


1333 


AAB 93 1 64 


TT . • 

Homo sapiens 


Human protein sequence SEQ ID 

JNU. LZvy 1. 


ZlOD 


yo 


1333 


AAM93693 


Homo sapiens 


Human polypeptide, SEQ ID NO: 


2159 


100 


ijjj 


ATVR50904 




Human trflTmr.rintion faofor TTRT-OT- 

55. 


1206 


57 


1334 


gil 2804907 


Homo sapiens 


Similar to metaxin 1 


1512 


100 


1334 


gi25649l3 


Homo sapiens 


metaxin 


1470 


90 


1334 


gil 8606009 


Mus musculus 


metaxin 


1333 


81 


1335 


ABB89371 


Homo sapiens 


Human polypeptide SEQ ID NO 
1747. 


487 


100 


1335 


ABB90319 


Homo sapiens 


Human polypeptide SEQ ID NO 


381 


83 
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2695. 






1335 


ABP61852 


Homo sapiens 


Human polypeptide SEQ ID NO 206. 


381 


83 


1336 


AAB 18456 


Homo sapiens 


A human TANGO 216 polypeptide 
clone. 


2257 


99 


1336 


AAB 18447 


Homo sapiens 


Amino acid sequence of human 
TANGO 216 polypeptide. 


2257 


99 


1336 


AAB18457 


Homo sapiens 


A human TANGO 216 polypeptide 
clone. 


2254 


99 


1337 


AAB18456 


Homo sapiens 


A human TANGO 216 polypeptide 
clone. 


2257 


99 


1337 


AAB 18447 


Homo sapiens 


Amino acid sequence of human 
TANGO 216 polypeptide. 


2257 


99 


1337 


AAB 18457 


Homo sapiens 


A human TANGO 216 polypeptide 
clone. 


2254 


99 


1338 


AAY86303 


Homo sapiens 


Human secreted protein HOGCK20, 
SEQ ID NO:218. 


2382 


88 


1338 


AAY86333 


Homo sapiens 


Human secreted protein HOGCK20, 
SEQ ID NO:248. 


2215 


87 


1338 


AAB65254 


Homo sapiens 


Human PR01379 (UNQ716) protein 
sequence SEQ ID NO:340. 


2117 


96 


1339 


gi20072551 


Mus musculus 


RIKEN cDNA 49305 1 1 Jl 1 gene 


412 


43 


1339 


gil2836893 


Gallus gallus 


IPR328-like protein 


150 


29 


1339 


gil 7974542 


Homo sapiens 


voltage-dependent calcium channel 
gamma- 8 subunit 


149 


25 


1340 


gi20072551 


Mus musculus 


RIKEN cDNA 49305 1 1 Jl 1 gene 


420 


45 


1340 


gil 7974542 


Homo sapiens 


voltage-dependent calcium channel 
gamma- 8 subunit 


147 


25 


1340 


gil2836893 


Gallus gallus 


IPR328-like protein 


147 


29 


1341 


AAG89353 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 473. 


692 


100 


1341 


ABB11882 


Homo sapiens 


Human transmembrane protein 
homologue, SEQ ID NO:2252. 


692 


100 


1341 


AAW85737 


Homo sapiens 


Polypeptide with transmembrane 
domain. 


692 


100 


1342 


ABB 1203 2 


Homo sapiens 


Human SIGP 2328134 homologue, 
SEQ ID NO:2402. 


1202 


82 


1342 


AAY21851 


Homo sapiens 


Human signal peptide-contianing 
protein (SIGP) (clone ID 2328134). 


1202 


82 


1342 


gi4101574 


Homo sapiens 


54TMp 


1196 


81 


1343 


gi3002925 


Homo sapiens 


T cell receptor beta chain 


1658 


100 


1343 


AAE13850 


Homo sapiens 


Human lung tumour-specific N- 
terminal protein 14F10. 


1526 


94 


1343 


AAE13S48 


Homo sapiens 


Human lung tumour-specific T cell 
receptor beta chain. 


1526 


94 


1344 


gil4973269 


Streptococcus 

pneumoniae 

TIGR4 


cell wall surface anchor family 
protein 


413 


19 


1344 


gi560649 


Neocallimastix 
patriciarum 


Xylanase B; XYLB 


243 


19 


1344 


gil3094677 


Mus musculus 


ribosome receptor isoform mRRp61 


240 


22 


1345 


AAY07751 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 8. 


293 


100 


1345 


gil 142588 


Trypanosoma 
brucei 


CR3 


79 


42 


1345 


gi3037018 


Bodo saltans 


NADH dehydrogenase subunit 5 


78 


35 



WO 03/054152 PCT/US02/39555 



Table 2B 
264 



SEQ 
ID 


Hit ID 


Species 


Description 
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1346 


gi 18857903 


T X • 

Homo sapiens 


TCBA1 


867 


100 


1346 


AAG78000 


~~T X * 

Homo sapiens 


Human actin 14. 


663 


100 


1346 


ABB89045 


T" X • 

Homo sapiens 


Human polypeptide SEQ ID NO 
1421. 


644 


98 


1347 


gi9837433 


Homo sapiens 


sialic acid binding immunoglobulm- 
like lectin 8 long splice variant 


2206 


88 


1347 


AAW94995 


Homo sapiens 


SAF-2 polypeptide. 


2031 


93 


1347 


gi6980022 


Homo sapiens 


siglec SAF2 


2031 


93 


1348 


gi 1545 1469 


Homo sapiens 


siglec-like protein splice variant- 1 


2689 


99 


1348 


gil5217166 


Homo sapiens 


sialic acid-binding Ig-like lectin 10 


2682 


99 


1348 


gil4164613 


Homo sapiens 


sialic acid binding immunoglobulin- 
like lectin 1 0 


2356 


98 


1349 


AAU76036 


Homo sapiens 


Human sugar transporter- 1 (HST-1) 
protein sequence. 


1496 


89 


1349 


AAB60112 


Homo sapiens 


Human transport protein TPPT-32. 


775 


100 


1349 


AAB61903 


Homo sapiens 


Atherosclerosis-associated 
polypeptide. 


445 


38 


1350 


ABB06115 


Homo sapiens 


Human NS protein sequence SEQ ID 
NO:207. 


357 


97 


1350 


AAY76219 


Homo sapiens 


Human secreted protein encoded by 
gene 96. 


336 


94 


1350 


gi2906006 


Homo sapiens 


WASP interacting protein 


125 


30 


1351 


AAB08767 


Homo sapiens 


A human leukocyte and blood related 
protein (LBAP). 


87 


27 


1351 


ABB 8 93 84 


Homo sapiens 


Human polypeptide SEQ ID NO 
1760. 


86 


28 


1351 


gi5 76631 


Torpedo 
marmorata 


14 kDa transmembrane protein 


86 


32 


1352 


AAE 16765 


Homo sapiens 


Human transporter and ion channel-2 
(TRICH-2) protein. 


312 


96 


1352 


gi7576452 


Homo sapiens 


potent brain type organic ion 
transporter 


159 


37 


1352 


AAY53009 


Homo sapiens 


Human secreted protein clone 
fhl49 12 protein sequence SEQ ID 
NO:24. 


153 


36 


1353 


AAU83670 


Homo sapiens 


Human PRO protein, Seq ID No 158. 


2566 


99 


1353 


ABB84896 


Homo sapiens 


Human PRO 1309 protein sequence 
SEQ ID NO: 160. 


2566 


99 


1353 


ABB95502 


Homo sapiens 


Human angiogenesis related protein 
PRO1309 SEQ ID NO: 160. 


2566 


99 


1354 


AAM93665 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3544. 


498 


40 


1354 


AAU29109 


Homo sapiens 


Human PRO polypeptide sequence 
#86. 


498 


40 


1354 


AAU27785 


Homo sapiens 


Human full-length polypeptide 
sequence #110. 


498 


40 


1355 


AAB87570 


Homo sapiens 


Human PR01268. 


603 


100 


1355 


A A T TTA 1 TO 

AAU29173 


Homo sapiens 


Human PRO polypeptide sequence 
#150. 


603 


100 


1355 


AAY78808 


Homo sapiens 


Hydrophobic domain containing 
protein clone HP 10537 protein 
sequence. 


603 


100 


1356 


gi2 1518639 


Homo sapiens 


TSLCl-like 2 


1991 


97 


1356 


gil9068139 


Mus museums 


membrane glycoprotein 


1970 


96 


1356 


AAM78418 


Homo sapiens 


Human protein SEQ ID NO 1080. 


1905 


97 
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TTJ4, in 

Hit ID 


Species 


tiescnpuon 


a cure 


/o 

lueiiu ty 


1357 


AAY49573 


Homo sapiens 


Human CLA-1 protein sequence. 


2503 


94 


1357 


gi397607 


Homo sapiens 








1357 


ABB 120 12 


Homo sapiens 


Human oK-ril class jd scavenger 
nomoiogue, u>iiv^ id rskj.zjoz. 




i?4 


1358 


gi854065 


Human 
herpesvirus 6 


U88 


340 


31 


1 o c o 


gl2192o439 


Homo sapiens 


seven xransmemorane nenx recepior 


9QQ 




135b 


A AB 95 124 


Homo sapiens 


Human protein sequence LMiv<? lu 


977 




1359 


AAE05302 


Homo sapiens 


Human TANGO 457 protein. 


1518 


96 


1359 


AAb05303 


Homo sapiens 


Human mature r ain i proxem. 






1359 


AAE05305 


Homo sapiens 


Human TANGO 457 protein 
cytoplasmic domain . 


1260 


100 


1360 


gi20799661 


Mus musculus 


mucolipin-2 


2020 


76 


1360 


gi209S7535 


Mus musculus 


RIKEN cDNA 33000U2U04 gene 


ZU1 / 


Id 


1360 


gi 19072754 


Homo sapiens 


mucolipin-3 


1406 


53 


1361 


AAE04122 


Homo sapiens 


Human gene 23 encoded secreted 
protem HboUK/3, oxsi^ ID jnu. lUo. 


210 


61 


1361 


A A "f 1 f\ A 1 /"A 

AAE04169 


TT _ • 

Homo sapiens 


Human gene encoaecl secreted 
protein HE80K73, SEQ ID NO: 158. 


9m 


ou 


1361 


AAG00392 


Homo sapiens 


Human secreted protein, oiii^ 1JJ 

JNIJ. 44 /J. 


1 1Z 


4J 


1362 


A A VT70C5 

AAY27853 


Homo sapiens 


Human secreted protein encooeo oy 
gene No. 101. 


974 


04 


1362 


gi2 17405 60 


Oryza sativa 


UoJiNrSaUUojVjrlO. iu 




47 


1362 


gi403081 


Culex pipiens 


reverse transcriptase 


66 


32 


1363 


AAM39421 


Homo sapiens 


Human polypeptide bri^i 11J JNU 
2566. 


477^ 
4 / / J 


oo 


1363 


gi 170 16967 


TT " ^ 

Homo sapiens 


jnuancb 


4/ /D 


oO 


1363 


gil7861384 


Homo sapiens 


nesprin-2 gamma 


4 / / J 


oO 


1364 


AAB37381 


Homo sapiens 


Human secreted protein wiv encoded 

Dy CD IN A if 


^47 
04 / 




1 O C A 

1364 


A A "D OCA 

AAB9D8D4 


Homo sapiens 


Human protein sequence oiiy il> 
invj. icy iz. 


70 


^7 


13od 


a DDon i cn 


Homo sapiens 


tiuman polypeptide ocy llj inu 

Zjjj. 








AAWj£yoj4 


riomo sapiens 


T-Tiimon oc»f»rpfpr1 WAfpin \ /T /t 1 

in Hindu becLeicu. plulclii i-zivit-uu i« 




53 


1365 


gi22328096 


Homo sapiens 


LOC 145053 


136 


48 


1366 


AAU76873 


Homo sapiens 


Human CRF-like protein LP231. 


1021 


91 


1366 


AB 13:5.3290 


Homo sapiens 


Human polypeptide ??ju. 


yjj 


0J 


1366 


gil0566471 


Mus musculus 


Cjliacolin 


QAA 
i744 


DO 


1367 


gi2585988 


Homo sapiens 


ribonuclease k6 precursor 


700 


87 


1367 


gil 8088595 


T T * 

Homo sapiens 


ribonuclease, RNase A family, k6 


/uu 


O / 


1367 


gi2745750 


Pan troglodytes 


ribonuclease k6 precursor 


695 


86 


1368 


AAE09651 


Homo sapiens 


Human gene 13 encoded lipid 
metabolism protein HTJNI73, SEQ 
ID NO:45. 


/I O /l 

484 


AO 

98 


1300 




riomo sapiens 


xiuman lainoaa crystanin. 


400 


07 


1368 


ABB89956 


Homo sapiens 


Human polypeptide SEQ ID NO 
2332. 


381 


98 


1369 


AAM25241 


Homo sapiens 


Human protein sequence SEQ ID 
NO:756. 


484 


95 


1369 


AAE09651 


Homo sapiens 


Human gene 13 encoded lipid 
metabolism protein HTJNI73, SEQ 
ID NO:45. 


352 


100 
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266 



SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


% 
Identity 


1369 


A A f A 1 C ST 

AAG64355 


TT — . • 

Homo sapiens 


. _ 

Human lambda crystallm. 




no 


1371 


gi2 1685525 


Homo sapiens 


similar to chloride channel protein 
oWrCICH TORMA 


4250 


94 


1371 


gi6653659 


Oryctolagus 
cuniculus 


chloride channel CLC-6 


4155 


91 


1371 


ABB 11826 


Homo sapiens 


Human CI channel homologue, SEQ 
IDNO.2196. 


4100 


91 


1372 


gi21928599 


T T * 

Homo sapiens 


seven transmembrane helix receptor 


749 


1 00 


1372 


AAG71967 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1648. 


725 


97 


1372 


AAG71962 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1643. 


714 


95 


1373 


ABG45325 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
34990. 


278 


100 


1373 


AAM35993 


T T * 

Homo sapiens 


Peptide #10030 encoded by probe for 
measuring placental gene expression. 


278 


1 r\r\ 

100 


1373 


AAM20725 


TT „ „ • 

Homo sapiens 


Peptide #7159 encoded by probe tor 
measuring cervical gene expression. 


27o 


1 Art 

1UU 


1374 


A ATT11 AH 1 

AAU 12071 


Homo sapiens 


Human PHT1 variant protein from 
Caco-2 cells. 




LUU 


1374 


A A TI1 1 f\^7i~i 

AAU 12070 


Homo sapiens 


Human PHT1 variant protein from 
BeWo cells. 






1374 


AAU 12069 


tt : — 

Homo sapiens 


Human PHT1 protein splice variant. 


9zU 


1 AA 


1375 


ABB77396 


Homo sapiens 


Human cathepsin L. 


1597 


87 


1375 


A A 11 7"/l 1AO 1 

AAW47031 


Homo sapiens 


Human procathepsin L. 


1597 


87 


1375 


gi29715 


TT „ ■ 

Homo sapiens 


pro-(cathepsin L) 




O / 


1376 


AAU 11 764 


Homo sapiens 


Human alpha la adrenergic receptor- 
like GPCR. 


2554 


98 


1376 


AAG64126 


Homo sapiens 


Human G protein-coupled receptor 
GPRv72, 


2554 


98 


1376 


A i TTAA1/"f\ 

AAU 043 69 


T T * 

Homo sapiens 


Human G-protein coupled receptor, 
hRUP15. 


2554 


98 


1377 


gil 8676524 


TT ■ _ 

Homo sapiens 


FLJ00 159 protein 


154 


52 


1377 


gi2 13 92066 


Drosophila 
melanogaster 


RE04357p 


139 


34 


1377 


AAB94312 


Homo sapiens 


Human protein sequence SEQ ID 

*K7f~\. 1 A TOO 

NO:147o3. 


82 


38 


1378 


ABB08011 


Homo sapiens 


Human secretin receptor-like GPCR. 


2759 


100 


1378 


AAU79494 


T T 

Homo sapiens 


TT y*l j * 1 J j. . ^ 

Human G protem-coupled receptor 
TGR18-3. 


2582 


97 


1378 


A A T" 1 1 D ^ C f\ 

AAE 18652 


Homo sapiens 


Human G-protein coupled receptor 

(C_rCKrsC-13). 




96 


1379 


AAY30735 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


280 


100 


1379 


AAY40092 


Homo sapiens 


Peptide sequence derived from a 
human secreted protein. 


55 


100 


13 /y 


gll/D 11709 


riomo sapiens 


aown-reguiatea oy LtnnDi, a 


oz> 


ZD 


1380 


gi2459682 


Homo sapiens 


MAGE-B4 


883 


54 


1380 


gi3687196 


Homo sapiens 


MAGE-B1 


864 


54 


1380 


gi4033512 


Homo sapiens 


DAM10=DSS-AHC critical interval 
MAGE superfamily protein 


863 


54 


1381 


AAE21804 


Homo sapiens 


Human TREK2 protein. 


2791 


100 


1381 


gil 97 16290 


Homo sapiens 


potassium channel TREK2 splice 
variant b 


2784 


99 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 


% 


1381 


AA014193 


Homo sapiens 


Human transporter and ion channel 


2747 


98 


1382 


ABB 11297 


Homo sapiens 


Human Coxsackie adenovirus 
receptor nomoiogue, oxivc J-U 
NO: 1667. 


699 


97 


1382 


gi 14279421 


Damo reno 


coxsacKie virus ana adenovirus 
recepior-iiKe protein 






1382 


ABB 84956 


Homo sapiens 


Human PR05723 protein sequence 

olil^ Li-J l\U.zoU. 


355 


33 


1383 


AAU 99321 


Homo sapiens 


Human protooncogene ivvj- 1 5? 
protein. 




L\J\J 


1383 


gilo4ooo0o 


Homo sapiens 


cervical cancer i proio-ont/Ogenc- 
Dinomg proiem ivu 1 y 




luU 


1383 


gi21961229 


Homo sapiens 


BRI3 binding protein 


753 


100 


1 1 O A 

1384 


AAA /Tfi/CQ/C<£ 

AAMUoooo 


Homo sapiens 


Human ioeidi protein, ojc-v^ ilj l\kj. 
1074. 


11^1 




1 O O A 

1384 


gl 151)99951 


K J i i « -aaa ~» f-t x"» ■» till j— ■ 

JVlus muscuius 


uiacyigiyceroi acyitransier«ise & 




w» I. 


1384 


ABB/5o7/ 


Homo sapiens 


ureas t protem-euKaryoxiu t/onocrvcu 




so 




AAUOOjjj 


liOTTio Scipiens 


Human PW? nrntein SFO TD NO*4 


1354 


99 


1385 


gil9525540 


Homo sapiens 


lymphocyte effector toxicity 

d-OLl VdLlL/11 llgdllU 


1330 


98 


lOOf 

I Jo J 


AAYjOU / 1 


nomo sapiens 


T7vf<=*Tirlf»r^ In iim an d^^rffpH TYrofFMn 
JZ/ALCI1U.CU. IlUIIlCtll oCWCLCU piv^ut-ni 

sequence, SEQ ID NO. 456. 


1252 


92 


1 tlQ^ 

looo 


A A Y y^DZ / 


nomo sapiens 


li Lllllu.il oLclLIlCIlll JJUJtClll. 


260 


83 


1386 


gi338611 


Homo sapiens 


statherin precurser 


260 


83 


1 O O ZT 

1386 


gl33o50o 


Homo sapiens 


starnenn protein 




OJ 


1387 


AAB94721 


Homo sapiens 


Human protein sequence SEQ ID 
NO:15739. 


1011 


100 


1387 


gi 1738425 6 


Homo sapiens 


mucin j 


i n^ 


9S 


1387 


gil334899 


Human 
herpesvirus 4 


r>iJL,r J late reaaing rrame yxiNyvi/o 




9S 


1388 


ABP41842 


Homo sapiens 


Human ovarian antigen nirivi.Lob', 
SEQ ID NO:2974. 


01 R 




1 o o o 

1388 


AAB73302 


Homo sapiens 


Human cyciopmiin r> {y,ypt>) L.- 
terminal deletion mutant, CypB- 

A.T ATTTJ 
Al/VlSJi. 


y i o 




1388 


AAB73301 


Homo sapiens 


Human cyclophilin B (CypB). 


918 


98 


1389 


AAEU71 12 


Homo sapiens 


Human gene o encouea secretea 
protein rragment, oiiv^ iu invj. izy. 


ZH- / U 


0Q 


1389 


AAM93449 


Homo sapiens 


Human polypeptide, SEQ ID NO: 

JU70. 


2378 


99 


1 o on 

1 Joy 


A A AyTO"! Oil 


Homo sapiens 


ouiiidii puiypeptiue, ±u i\u. 
3881. 




99 


13VU 


gl lODoyUDO 


Homo sapiens 


type IX gUilaUULIUpin-iClt/Ct£>lllg 

hormone receptor 


i j / \j 




1390 


giiyoy/oyo 


Homo sapiens 


onivH receptor 11 j i ivi 


1 1AS 


yo 


1 ^QO 

1 J2?U 


A ATT10R1Q 


r±<Jinu oapiciio 


Human Tvoe II GnRH-R snlice 
variant 1 protein #2. 


1363 


93 


1391 


gil6359249 


Mus muscuius 


RIKEN cDNA 1300010M03 gene 


2226 


91 


1391 


AAM93450 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3100. 


587 


34 


1391 


ABB89832 


Homo sapiens 


Human polypeptide SEQ ID NO 
2208. 


522 


39 


1392 


AAE04896 


Homo sapiens 


Human transporter and ion channel-9 


825 


100 
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SEQ 
ID 


Hit ID 


species 




c 

score 


/o 

Identity 








^ i K-iori-y j protein. 






1392 


AAE04899 


Homo sapiens 


Human transporter and ion channel- 

17 /"TPTfT-T 1 ON nrntpin 

iz t -i xviori- 1 proLcin. 


688 


52 


1392 


AAE06616 


Homo sapiens 


Human protein having hydrophobic 

QOniain, JulJr 1U / jj . 


688 


52 


1393 


gi21928487 


Homo sapiens 


seven transmembrane helix receptor 


1558 


100 


1393 


AAG72603 


Homo sapiens 


Human UK-nice poiypeptiae query 
sequence, oni^ ru r>u. 


1 All 




1393 


AAG71515 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1 196. 


1433 


89 


1394 


ABB 89241 


Homo sapiens 


rluman poiypeptiae iu jno 
1617. 




D / 


1394 


AAB08894 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 4 SEQ ID NO:5 1. 


1 


jy 


i on/I 

1394 


gi210701SO 


Danio rerio 


envelope protein 


1 1 1 
ill 


ZlS 


1395 


gil5157307 


Agrobacter i urn 
tumefaciens str. 
Cdo (Cereonj 


AGR_C_3928p 


85 


21 


1395 


gi9295309 


frog adenovirus 
1 


pllla protein 


82 


41 


1395 


ABB08204 


Homo sapiens 


Human lipid metabolism enzyme-4 

/T \/fT7 /I^ 

(J_,iVlri-4J. 


76 


27 


1397 


AAG65914 


Homo sapiens 


Amino acid sequence of GSK gene 
la Z / I4Z. 


5760 


97 


1397 


gil9570398 


Homo sapiens 


hDDM36 


5760 


97 


1397 


A AT TT7/tAf 

AAU77405 


Homo sapiens 


rluman jnuv i protein, nomoiogue oi 

XTr^PP/PT FNTf^ Tct nrntpin e 


do iy 




1398 


gi 15292481 


Drosophila 
melanogaster 


SD03655p 


1165 


59 


1398 


A A TTi O O O TO 

AABS8372 


Homo sapiens 


Human memorane or secretory 
protein clone PSEC0108. 


9521 


O / 


1399 


ABB07553 


Homo sapiens 


1 r-(_ AD rragm e ut determmea 
from I.M.A.G.E clone 52071. 


01 

y i 


70 


i i r\ r\ 

1399 


ABB07552 


Homo sapiens 


rluman rSo ir-LvAU poiypepxiae. 


y i 


7Q 


1399 


gi2 1959343 


Yersinia pestis 
KIM 


basal-body MS (membrane and 
supramembrane j-ring ana couar 
protein 


69 


35 


1401 


a a d nc 1 O /I 


Homo sapiens 


Human protein sequence oxsi^i iu 
NO:17122. 


1 AS 




i a rv 1 

1401 


'OCA n/rc 

gi854065 


Human 
herpesvirus 6 


T TO C? 






1401 


AAO09309 


Homo sapiens 


Human polypeptide bxiv^ iu rN<J 
23201. 


1 in 
1j / 


7^ 


1402 


AAE01332 


Homo sapiens 


Human gene lo encoded secreted 
protein fragment, SEQ ID NO: 197. 




1UU 


1402 


AAE01299 


Homo sapiens 

i 


Human gene 18 encoded secreted 
protein HFIIN69, SEQ ID NO: 162. 




1UO 


1 A07 


jc\i\i-i\J 1 ZH-y 


riomo Sapicnb 


Truman crfTif* 1 R f^r\rr^r\p*f\ cf*c % VF i i'f*n 

protein HFIIN69, SEQ ID NO:l 1 1. 


222 


100 


1403 


AAM06589 


Homo sapiens 


Human foetal protein, SEQ ID NO: 
320. 


237 


100 


1404 


AAU10510 


Homo sapiens 


Human leukocyte immunoglobulin 
receptor-like (LIR-Iike) protein #8. 


1411 


100 


1404 


AAU10511 


Homo sapiens 


Human leukocyte immunoglobulin 
receptor-like (LIR-like) protein #11. 


595 


89 
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SEQ 
i n 

AJLF 


Hit ID 


Species 


Description 


S 

score 


% 
iu entity 


1404 


AAO09486 


Homo sapiens 


Human polypeptide SEQ ID NO 

Zjj /o. 


486 


98 


1406 


gi20380511 


Mus musculus 


RIKEN cDNA 3110007F17 gene 


293 


33 


1406 


gil3377867 


Gallus gallus 


claudin-3 


107 


24 


1 Ar\tz 


gllDDDii /j 


Danio reno 


claudm n 


106 


23 


1407 


ABB90122 


Homo sapiens 


Human polypeptide SEQ ID NO 
24yo. 


1029 


84 


1407 


AAY32204 


Homo sapiens 


Human receptor molecule (REC) 
encodea by mcyte clone 2132179. 


1029 


84 


1407 


AAY78801 


Homo sapiens 


Hydrophobic domain containing 
protein clone HP0063 1 amino acid 
sequence. 


1029 


84 


14Uo 


gll Jw>43y4U 


Homo sapiens 


•Similar to KLKJiJN cJJJNA 
zoiuui /ouy gene 


222y 


96 


i aar 


AAd 1Z i jo 


riomo sapiens 


riyaropnoDic domain protein isoiatea 
from HT-1080 cells. 


22 lo 


yo 




A A RQCAAX 
AAdoo4O0 


riomo sapiens 


riuman memorane or secretory 
proiem cione x oiiuuzou. 


2212 




1 40Q 




riomo Sapiens 


Human «r»lt7t-*<=t-»+-f rli=» CRA JV\ TsJO 

riuman poiypepnue ocy iu inw 
1788. 


D lOO 


yy 


1 ACiQ 


A ATTS1 0^1 


riomo sapiens 


riuman r xvvjy^-f j> . 


9/1 vt 1 
Zh-4 1 


yo 


1 40Q 


A AT T777QA 


nomo sapiens 


Uiimon PPAO/tl r-\f-/^+c»ir-» 

riuman x rv^jy^f j proiem. 




yo 


1H IU 




riomo sapiens 


riuman lnxerieuKin zz receptor ^ii--- 
22R) protein sequence. 




1 AA 
1UU 


H IU 




riomo sapiens 


numan Zi-cj/ror-n protein. 


1 /CQQ 


1 An 
1UU 


1 A1 A 


A A TiS7AA7 


riomo sapiens 


riuman rKUZUzo3. 




1 AA 

1UU 


1411 


a A non^o 


riomo sapiens 


rrotein ot tne numan 1 r M-z gene 
sequence. 


inn 

ill 


1 AA 
1UU 


1411 


AAE21184 


Homo sapiens 


Human TRICH-2S protein. 


111 


100 


i/iii 

It-I 1 


cri 1 8^40047 
gl 1 oO^tUU^f / 


riomo sapiens 


aromatic ammo acid transporter 


inn 
ill 


1 AA 
1UU 


IH 1Z 


a Atji loan 


riomo sapiens 


riuman transporters ana ion cnanneis 
(TRICHV7. 


1117 


7Q 
lo 


1 J.1 9 


A AT347971 


riorno sapiens 


llW/\ 1 1 . 


£97 
OZ / 


Hi 


1412 


AAY44278 


Homo sapiens 


Human organic anion transporter. 


627 


41 


i/iii 

14 1 D 


A A "\yl70077 

/v/\ivi /yy / / 


riomo sapiens 


Human protein blil^ 1JJ iNU 3oZJ. 


1 AA 
1UU 


j 1 


1J.1 3 


m*97Aid^A7 
glZ /04DU / 


i^ocusta 
migratoria 


nicotinic acetyicnoline receptor, 
alpha 1 subunit 


1 AA 
1UU 


DO 


1 A1 1 
1413 


rri 1 7AAA77 

gll /oou / / 


vjaiius gaiius 


wmgea nenx protein LWn-J 


1 AA 
1UU 


51 


1414 


ABP41590 


Homo sapiens 


Human ovarian antigen HAZAR95, 
oliv^ 1JJ JNO:2/zZ. 


1522 


100 


1414 


AAB43682 


Homo sapiens 


Human cancer associated protein 
sequence oiiv.j iu JNU: 1 12 /. 


1522 


100 


1414 


gi5931730 


Homo sapiens 


phosphate carrier 


1522 


100 


14iD 


• -I <n/ZA A1 <T 

gll /04U15 


Ciona 
mtestinaiis 




314 


A O 

42 


1415 


AAM42167 


Homo sapiens 


Human polypeptide SEQ ID NO 
7098. 


272 


30 




A APn^4S4 
/\J\£}UJ)4o4 


riomo sapiens 


numan gene iz encoded secreted 
protein HETHW90, SEQ ID NO: 167. 


9^A 
ZDU 


9Q 

2y 


1416 


AAB82315 


Homo sapiens 


Human immunoglobulin receptor 
isoform IRTA2c. 


1130 


88 


1416 


gil 5277746 


Homo sapiens 


Fc receptor-like protein 5 


1130 


88 


1416 


gil3591714 


Homo sapiens 


immunoglobulin superfamily 
receptor translocation associated 
protein 2c 


1130 


88 
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SEQ 
TD 


Hit ID 

i 


Species 


Description 


S 

score 


% 
identity 


1417 


AAY27182 

x\__/ v. x / x uZv 


T-Trvm r\ csiTvipric 


ri.uiiia.il XVLNJ7 J puiypcpuuc 


o / / 


on 


1417 


AAW69601 


T-TotTi t~\ c fa I'M pnc 
1IU1J1U oajJldlo 


lxUllld.Il ZJIIC UlllUlllg, piUldll ZjJj-Z,. 


0 / / 


on 
yu 


1417 


gi 13366064 


Homo sapiens 


HsRmal 


877 


90 


141 R 


A AF1 74Q3 


nuino t>d.pi ens 


riuman secretion ana Train citing 
protein-2 (SAT-2). 


1440 


1 A/i 

1UU 


141 8 






riuman xviiisugummzy-iiKe protein 

IN \J VJd, 


1 ^C/l 
1304 


y / 


1418 


AAU96848 


Homo sapiens 


Human NOV3a protein variant. 


1378 


97 


141 Q 




jvius museums 


ciauuin 14 


I 371 


40 


1419 


AAU77764 


Homo sapiens 


Tumour associated antigenic target 
poJ3^peptiae (lAlj Ijj. 


370 


40 


1419 


AAU29200 


Homo sapiens 


Human PRO polypeptide sequence 

#1 HI 

ifl 1 /, 


370 


40 






nomo sapiens 


riuman rro peptiae ff3y. 


! /Cm 


o7 


1490 


cri9 1 9fKRA£ 
glx. 1 ZUJOOO 


nomo sapiens 


i-ceii activation leucine repeat-ricn 
protein; TA-LRRP 


69/ 


87 


1490 


A AF171 ^9 


riuiiiu odpienb 


numan aoenyiyi ana guan^'iyi 
cyclase (ADGUQ-4. 


DUD 


o4 


1471 


A ATT7S0Q1 


rxunio &dpiens 


riuman nonerytnroia rvn glycoprotein 
RhBG protein sequence. 


toy i 


no 
OO 


1491 


cri 1 S71 52471 
gl 1 J / 1 o*t / 1 


riomo sapiens 


Kn type r> glycoprotein 


lsyi 


o o 


1421 


gil4346006 


Pan troglodytes 


Rh type B glycoprotein 


1886 


88 


1 499 




riomo sapiens 


Human bone marrow protein, SEQ 


215 


46 


1422 


gil4209836 


Mus musculus 


ATP-binding cassette transporter 
suo-iamiiy a memoer / 


175 


40 


1422 


AAO14210 


Homo sapiens 


Human transporter and ion channel 


174 


40 


1423 


ABB84987 


Homo sapiens 


Human PRO6006 protein sequence , 


718 


99 






nonio sapiens 


numan rro peptide ffju. | 


*71 O 


yy 


1423 


ABB95593 


Homo sapiens 


Human angiogenesis related protein 1 


718 


99 


1424 


gi 18539467 


Homo sapiens 


nucleolar RNA-associated protein 1 
beta | 


3851 


98 


1424 


gi 18539465 


Homo sapiens 


nucleolar RNA-associated protein 
alpha j 


3851 


98 


1494 


0x90088377 


nomo sapiens 


nucleolar jvln A-associatea protein 


Jo2j 


98 


1425 


AAE 15635 


Homo sapiens 


Human G-protein coupled receptor-5 
{^ouKiiC-j j protein. 


499 


92 


1425 


AAB66272 


Homo sapiens 


Human TANGO 378 SEQ ID NO: 

OQ 

Zy. j 


499 


92 


1 49 S 


A A F 03406 


riomo sapiens 


riuman secretea protein tragment, 

cda in XTO-£4 

odI^ 1U 1NLJ.UH. | 


499 


92 


1426 


gi 18676704 


Homo sapiens 


FLJ0025 1 protein 


4114 


99 


1 496 




i npneustes 
gratilla 


aynem neavy cnam isotype 5A i 


406 


24 


1426 


gi6706264 


Leishmania 
major 


dvnBin heavv chain 1 






1427 


AAE23980 


Homo sapiens 


Human LP220 secreted protein. | 


2964 


99 


1427 


AAB70072 


Homo sapiens 


Human secreted protein #11. | 


2475 


99 


1427 


AAE 17484 


Homo sapiens 


Human leucine-rich repeat- 8 1 
(ZLRR8) protein #2. 


2407 


100 


1428 


ABB97996 


Homo sapiens 


Human severing stimulating factor 
Kda subunit 86.35. 


415 


39 
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ID 


Hi* rn 

till LU 




ues crip tion 


c 

score 


0/ 

/o 

Identity 




cri9 1 A1 1 /I 
glZ lOl OJ 1H- 


nomo adpicnb 


oimiiar to oxiuation resistance i 


A1 *\ 
413 




1428 


gi 13540300 


Mus musculus 


nucleolar protein C7B 


408 


34 


i4zy 




nomo sapiens 


numan poiypeptiae ibri^ lu jn\J 
2758. 


DJZ 


03 


1 AOQ 

i4zy 


A T5D/i 1 OCC 

Aor4i /oj 


nomo sapiens 


Human ovarian antigen rlLrLo4U, 

oJl^ 1-L' rNVJ.zyi /. 




oi 


147Q 


AAlVi'+U.j y 1 


riorrio Sapiens 


numan poi5'peptiue orii^ iu rsu 
3536. 


Id 1 


OZ 




A AVn77<1 
/V A ill / i J 1 


nomo sapiens 


numan secretea protein fragment 
encoded from gene 8. 




1UU 


14JU 


m '1 1/lOCQQ 
gll 14ZDOO 


i rypanosoma 
brucei 




/O 


4o 




glJUJ / vlO 


doqo sanans 


iN/\iyn oenyarogenase suoumt d 


/O 




1431 


ABB04471 


Homo sapiens 


Human endoprotease 3 1 . 


940 


100 


1 /IT 1 

14 J 1 




Homo sapiens 


Human polypeptide, k-bv,) 1U JNU: 
3259. 


243 


36 




Ar>r>oyo44 


rlomo sapiens 


numan poiypeptiae oiiv^ il) jn(J 
1720. 


Tin 

ziy 




1 /I OO 
14JZ 




Homo sapiens 


Human membrane or secretory 
protein clone PSEC0131. 


306 


43 


14.3Z 


glZUo !U4y3 


rlomo sapiens 


oimilar to KlKJiiN cUiN A 
2810417M05 gene 


O A CZ 

30o 


43 


1 /t "20 

14JZ 


A A "DO 1 ooo 

AAiSZlZ /Z 


Homo sapiens 


Human gene 16 encoded secreted 
protein fragment, SEQ ID NO: 138. 


OAT 

207 


55 


1 All 
14jj 


A A XJCQQOC 

AAdoojocS 


rlomo sapiens 


numan memorane or secretory 
protein clone PSEC013L 


3Uo 


/to 
4z 


14:5.3 


<-riOr\0 1 f\A (hi 

gizuo iu4yj 


Homo sapiens 


oimilar to KIKiiiN cJDNA 
2810417M05 gene 


t r\ o 


>i 0 
42 


1 All 

143:5 


A A T70 1 0*70 

AAb2 1 z/z 


Homo sapiens 


Human gene 16 encoded secreted 
protein iragment, oiii^ id jnu:1jo. 


209 


54 


1434 


ABP43139 


Homo sapiens 


Human ovarian antigen HWBT60, 

oHV^ ID i\\J.HrZ/ 1. 


235 


77 




A RPA9 2 1 ^ 
At>.r4Zo 13 


nomo sapiens 


numan ovarian antigen nrOUijyD, 


ZJD 


/ / 


1434 


ABP42409 


Homo sapiens 


Human ovarian antigen HOCOC38, 


235 


77 


1 tJJ 


oil 0171 16? 
gl 1 1 / 1 1 oz 


nomo sapiens 


ventrnoiu iransmeniurane protein 


1 

1o/j 


QQ 


14^ 


A AF1 4^71 


nonio bdpienb 


riuindn rnoiiiDOiLi reiciiea proxeiii 5 

RRP3. 


1 SAO 


QQ 

yy 




oil 01 71 1 60 
gll ~1 / i ion 


iviub m us cuius 


venLrnoiu ird.nsrneriiDr<iiie proicm 


1 son 


Q-4 

y4 


1436 


gi7106866 


Homo sapiens 


HSPC238 


237 


56 




A A A/fGOfkCO 
1VJL yZVD Z 


nomo sapiens 


numan Digestive system antigen 
SEQ ID NO: 1401. 


ZJD 


*70 

/Z 


14.50 


gll334Z /KJ 1 


mus muscuius 


Kiis±sN cjjjna 200(^01^/1^14 gene 


1 QC 

iyo 


4/ 


i A'ln 
L*+j / 


A A T71 CO 1 9 
AAlil OZ1Z 


nomo sapiens 


numan a / ilji^4 protein. 


014U 


yy 


14.3 1 


A A r^soo'* 
AAUrOoZy D 


nomo sapiens 


numan semapnonn vj-UKe jnhx 
protein SEQ ID NO: 10. 


A. 1 "3/1 

01 j4 


QQ 

yy 


1437 


AA068294 

-T\. JV VJ YJ O 4->.S 1 


"HVvm r\ conifluc 
Xx(JIIl(J actpiCIlo 


T— fiiTncin c pm n t~w\ ~v\ (~\-\ \ \/~c* "MT-TP 
XlLlIIlctll tSdllciJJIKJilll Vj~11JvC 

protein SEQ ID NO: 12. 






1438 


AAU95381 


Homo sapiens 


Human calcium transport protein 
CaTrF2Ell. 


4607 


99 


1438 


AAY96479 


Homo sapiens 


Human vanilloid receptor 3. 


4607 


99 


1438 


gi2 1690507 


unidentified 


CaTrF2Ell 


4607 


99 


1439 


gi 11055322 


Homo sapiens 


vanilloid receptor-related osmotically 
activated channel 


3324 


100 
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Table 2B 
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1 

SEQ 
ID 


TI*i, TT\ 

Hit ID 


Species 


Ues crip tion 




0/ 

/o 

Tdf^ntirv 


1439 


AAU74935 


Homo sapiens 


Amino acid sequence of human 
vamiioiu receptor- iiice proiem ^.d. 
(VRL~2a). 


3318 


99 


1439 


A PD7Q1 01 


riomo sapiens 


"Human VP4 i^rr»tp>in QFH TD 7\TfY9 

nuniaii vi\. 4 r proieixi oj-jV<j ij^ inu.z. 




99 


1440 


gil4574118 


Caenorhabditis 
elegans 


C. elegans DPY-19 protein 
^corresponomg sequence rzzr> / . 10^ 


76 


24 


1440 


AAU83621 


Homo sapiens 


Human PRO protein, Seq ID No 60. 


74 


24 


1 yl A f~\ 

1440 


AAO05826 


Homo sapiens 


rluman polypeptide bxii^i id inu 
19718. 


f\j 


zo 


1 A A 1 

1441 


gi4235228 


Mus musculus 


leucine zipper-tir-nana containing 
transmembrane protein 1 


AO! 

Hz? / 




1 A A 1 

1441 


gil 5680275 


Homo sapiens 


bimiiar to leucine zipper-iir-nanu 
containing transmembrane protein 1 


AQA 


00 


•t A A 1 

1441 


gi 182045 89 


Jtiomo sapiens 


leucine zipper- E/Jr-nanu coniaining 
transmembrane protein 1 






1442 


AAbl /4oZ 


riomo sapiens 


riuman leucme-ncn repeat- / 
(ZLRR7) protein. 


1 1 07 


L\J\J 


1442 


ABB 11242 


riomo sapiens 


riuman oi^ii-z nomoiogue, csrit^i iu 
NO:1612. 


UJJ 


QQ 


1 yl yl O 

1442 


1 AAB07469 


Homo sapiens 


A numan leucme-ncn repeat protein 
designated Zlrr3. 






1443 


A A T Ti^T> 1 O 

AAU97218 


Homo sapiens 


riuman Lx protem-coupieo receptor, 
TGR25. 


iO^J 


R7 


1443 


AAE23415 


Homo sapiens 


Human Lr-protem coupled receptor- / 
(GCREC-7). 


lO'f^ 


<27 
0 / 


1 A A A 

1444 


gi 15487341 


Escherichia coli 


macroiide-speciiic Ai5^-type eniux 
carrier 


oy / 


01 


1 A yl yl 

1444 


gl4U624oJ 


rsscnencnia con 


Ar>o transporter prooaDie a i r- 
binding subunit homolog 


Di? / 


Q1 
y 1 


1 yf yf *C 
1443 


gil /9 /oyo3 


Araoidopsis 
thaliana 


At i g^o oy u/ r zj o^^f 


/J 


7? 
zz 


^ A A C 

1445 


gl4239789 


lreponema 
maltophilum 


major sneatn protein 


7zt 


Zj 


1 yl /i £T 

1446 


giz 1426922 


Homo sapiens 


TJT7T T>1 

rbLr 1 






1446 


AAW31185 


Homo sapiens 


Human p 160 polypeptide 160.L 


3176 


87 


1446 


gi3 168604 


Homo sapiens 


proline and glutamic acid rich 
nuclear protein isoiorm 


1641 


99 


1 A A >~t 

1447 


A A I~»rt/f A f\C 

A AB 94495 


. 

Homo sapiens 


riuman protein sequence oxiv xjj 
NO:15188. 


Oo 


jO 


1447 


AAG67254 


Homo sapiens 


Ammo acid sequence oi a numan 
uver-aSoociaLeti gene. 


Oo 


jO 


1447 


gi2 14894 12 


Bombus 

nypocrita 

sapporoensis 


cytochrome oxidase 1 


66 


35 


1 A A © 

144o 


AAr3oU41o 


riomo sapiens 


vjene 7?*+ associdieu pepiiQe tri. 


1770 


100 


1448 


AAB65162 


Homo sapiens 


Human PRO290 (UNQ253) protein 
sequence oii^ ij-j inw.oo. 


1972 


100 






T— T/~vtv^ r\ coniptic 

fioino sapiciib 


A/T^tnhrfinF'-tioii'nH "nrntRin T^T? O'? QO 

J.VXW11KJ1 tint UWU-llVJ. [JX VJH-/1H J. Ivvy//7v, 


1972 


100 


1449 


gtl61 17372 


Macropodid 
herpesvirus 1 


ICP4 


95 


27 


1449 


gi4063766 


Aspergillus 
nidulans 


chitinase 


92 


24 


1449 


gil 5277229 


Homo sapiens 


Homologue to Drosophila 
photoreceptor protein calphotin 


89 


31 


1450 


ABB09760 


Homo sapiens 


Amino acid sequence of human 


1029 


100 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


% 
Identity 








hearing delect related protein iis.iy. 






1450 


AAU27660 


Homo sapiens 


Human protem Arro / iudz. 






1450 


AAG74151 


Homo sapiens 


Human colon cancer antigen protein 

CI T? /~\ T"T\ "KT/~\ ./iniC 

SEQ ID NO:4915. 


748 


100 


1452 


AAE21175 


Homo sapiens 


Human TRICH-19 protein. 


1041 


93 


1452 


gil5559050 


Ethmostigmus 
rubripes 


ADP-ATP translocator 


733 


OO 


1452 


gi7542476 


Xenopus laevis 


adenine nucleotide translocase 


702 


67 


1454 


gi4929597 


Homo sapiens 


CGI-64 protein 


1879 


96 


1454 


gi6995987 


Homo sapiens 


mitochondnal carrier homolog 1 
isoform a 


1818 


99 


1454 


gi6995989 


Homo sapiens 


mitochondrial carrier homolog 1 
isoform b 


1790 


95 


1455 


gil7225210 


Podospora 
anserina 


beta transducm-hke protein HET- 
D2Y 


247 


25 


1455 


gi886024 


Thermomonospo 
ra curvata 


PkwA 


239 


29 


1455 


gil7131893 


Nostoc sp. PCC 
7120 


WD-repeat protein 


238 


26 


1456 


AAB36840 


Homo sapiens 


Human insulin receptor-related 
receptor protein with signal peptide. 


6736 


98 


1456 


ABB 11702 


Homo sapiens 


Human insulin receptor-related 
protein homologue, SEQ ID 
NO:2072. 


6731 


99 


1456 


AAB36836 


Homo sapiens 


Human insulin receptor-related 
receptor protein. 


6731 


99 


1457 


ABB 11803 


Homo sapiens 


Human GPI-122 homologue, SEQ ID 
NO:2173. 


6420 


99 


1457 


AAY50125 


Homo sapiens 


Human glycophosphatidyhnositol- 
anchored protein GPI-122. 


6323 


100 


1457 


AAB94751 


Homo sapiens 


Human protem sequence SEQ ID 
NO: 15805. 


5872 


99 


1458 


AAU00023 


Homo sapiens 


Human activated T-lymphocyte 
associated sequence 2, ATLAS-2. 


3623 


99 


145S 


AAE04546 


Homo sapiens 


Human G-protein coupled receptor-2 
(GCREC-2) protem. 


3325 


89 


1458 


gil9387136 


Homo sapiens 


P\ RIN-containmg APAFl-like 
protein 5 


3316 


89 


1459 


AAE14719 


Homo sapiens 


Human carbohydrate-associated 
protein (CARBAP)-l. 


3132 


99 


1459 


AAB92508 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 10631. 


2742 


100 


1459 


AAB95542 


Homo sapiens 


Human protein sequence SEQ ID 
NO:18155. 


2687 


99 


1460 


gi2960194 


Human T-cell 
lymphotropic 
virus type 2b 


rex 


81 


24 


1460 


gi404041 


Human 1- 
lymphotropic 
virus 2 


rex protein 






1460 


gi348140 


Human T- 
lymphotropic 
virus 2 


rex 


81 


24 


1461 


gil 103953 


Homo sapiens 


MHC class I HLA-B* 1523 


71 


28 


1461 


gi!815616 


Homo sapiens 


HLA-B-1521 


71 


28 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


% 
Identity 


1461 


gi 17467060 


Homo sapiens 


MHC class I 


71 


28 


1462 


gil 1066090 


Homo sapiens 


matrix metalloprotease MMP-27 


1188 


85 


1462 


AAU00469 


Homo sapiens 


Human TANGO 210 protein. 


H85 


85 


1462 


AAU12267 


Homo sapiens 


Human PR05992 polypeptide 
sequence. 


1185 


85 


1463 


gi601948 


Drosophila 
melanogaster 


Inscuteable 


132 


23 


1463 


gi7291194 


Drosophila 
melanogaster 


CG11312-PA 


132 


23 


1463 


gil657962 


Drosophila 
melanogaster 


Nem 


123 


24 


1464 


AAB45378 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 38 SEQ ID 
NO:l30. 


433 


96 


1464 


gi406058 


Mus museums 


protein kinase 


216 


53 


1464 


AAE 16277 


Homo sapiens 


Human kinase PKIN-23 protein. 


213 


53 


1465 


AAG00866 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4947. 


123 


C A 

54 


1466 


AAY97293 


Homo sapiens 


Lipid associated protein (LIPAP) 
3335404CDL 


2178 


68 


1466 


AAM39997 


Homo sapiens 


Human polypeptide SEQ ID NO 
3142. 


1214 


43 


1466 


AAB24231 


Homo sapiens 


Human vesicle associated protein 10 
SEQ ID NO: 10. 


1214 


43 


1467 


ABP41928 


Homo sapiens 


Human ovarian antigen H6EEO05, 
SEQ ID NO: 3 060. 


922 


96 


1467 


AAB94535 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15273. 


922 


96 


1467 


AAM79530 


Homo sapiens 


Human protein SEQ ID NO 3176. 


922 


96 


1468 


AAB94336 


Homo sapiens 


Human protein sequence SEQ ID 
NO:14836. 


694 


11 


1468 


AAB97250 


Homo sapiens 


HOMO RNA cyclase 41 protein. 


694 


77 


1468 


gil 2654401 


Homo sapiens 


Similar to RNA cyclase homoiog 


694 


77 


1469 


ABB05645 


Homo sapiens 


Human thyroglobuhn 38 protein SEQ 
ID NO: 2. 


273 


98 


1469 


AAM89857 


Homo sapiens 


Human immune/haematopoietic 
antigen SEQ ID NO:17450. 


81 


43 


1470 


AAO04234 


Homo sapiens 


Human polypeptide SEQ ID NO 
18126. 


185 


76 


1470 


AAM92673 


Homo sapiens 


Human digestive system antigen 
SEQ ID NO: 2022. 


184 


97 


1470 


gi65265 


Xenopus laevis 


a xenopus upstream binding factor 


130 


30 


1471 


ABB12219 


Homo sapiens 


Human secreted protein homologue, 
SEQ ID NO:2589. 


182 


97 


1471 


ABG60080 


Homo sapiens 


Human DITHP polypeptide #138. 


153 


82 


1471 


ABB90307 


Homo sapiens 


Human polypeptide SEQ ID NO 
2683. 


148 


96 


1472 


A AO 12550 


Homo sapiens 


riuman poiypepuoe oHQ 1JJ jnjj 
26442. 


ZjJ 




1472 


AAG03600 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7681. 


245 


86 


1472 


gi7770239 


Homo sapiens 


PR02831 


136 


71 


1473 


AAW49698 


Homo sapiens 


Human Notch3 protein. 


99 


55 


1473 


gi3065951 


Homo sapiens 


Notch3 


99 


55 


1473 


AAR67242 


Homo sapiens 


Huaman glial cell growth factor 


96 


42 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


% 
Identity 








HClCgUllH oCglllCllL l-> . 






1474 


gi22002433 


Homo sapiens 


pl50 target of rapamycin (TOR)- 
scaffold protein containing WD- 
repeats 


323 


54 


1474 


gi2iy/y4Do 


riomo sapiens 


raptor 




54 


1474 


gi22002435 


Mus musculus 


pi 50 target of rapamycin (TOR)- 
scaiioici protein conuumiig vv u~ 
repeats 


318 


53 


1475 


AAB38280 


Homo sapiens 


Human secreted protein sequence 
cncoucu oy gene ^i/ or-yv^ xiv 


289 


96 


IH- /O 


gl / 10U7/J 


riUlTlU sapient? 


VISING nrnt pin 


2254 


87 


1476 


AAM40410 


Homo sapiens 


Human polypeptide SEQ ID NO 

^ • 


2249 


87 


1 Alf\ 
L4/0 


cnfxl 09QQ6 
glO I UZ7!7 0 




V Alllil J 


1857 


71 


1477 


gil6877231 


Homo sapiens 


Similar to RIKEN cDNA 

97000 10D07 gene 


997 


100 


1478 


gi340201 


Homo sapiens 


voltage-dependent anion channel 


479 


74 


i /no 
14/o 




riomo sapiens 


t -f o nrp_ /^J P>r^^»n p>tn dni ATI f n TTH y 
VOltilge-UCpcIlUClll dlllUil C'lldllllC'l a« 


479 


74 


1478 


gi5 114261 


Homo sapiens 


voltage-dependent anion channel 


479 


74 


1 A 7Q 


A DUHW 1 1 


riomo sapiens 


T-TiTmon trr/it'^in T?T-T'7~1 ^ 

nuiiidii pi\jLciii rri£."Lj. 


192 


48 


i /i -7 a 


A A <T) 

AAr>y4oZZ 


riomo sapiens 


nmilali |JI ULClll aCt ; [UCllC'C kJ J— xi^f 

xro-1 5476 


131 


69 


14 ly 


AoJUj 1 




Hi lm an nvflrv ^npcific nroteiTi SEO 
ID NO* 183 


128 


37 


1480 


AAB93446 


Homo sapiens 


Human protein sequence SEQ ID 
NO- 12690 


204 


100 


1480 


AAM80227 


Homo sapiens 


Human protein SEQ ID NO 3873. 


204 


100 


i a on 


A A A/f700/1 "X 


nomo sapiens 


T-Tiiman Arnfpin S1FO TD NO 1 Q05 


204 


100 


14oz 


a atat/ioaa^ 
AAW 4ZUiO 


riomo sapiens i 


riU.Ifla.il IvdU JJ1ULC111 v^l llvilijry y . 


192 


80 


1482 


gi 1457954 


Homo sapiens 


Rab22b 


192 


80 


1 A OO 


mini TOf^^Q 

gllUl /yo fy 


riomo sapiens 


onid.il vjr rdisc rvrVJ->^ZfJL) 


192 


80 


1483 


ABG62142 


Homo sapiens 


Human prostate specific polypeptide 
#75. 


170 


65 


1483 


A A"C1yfA/CA 


riomo sapiens 


xiuman prostaie specmt protein 
(PSP)#3. 


170 


65 


1483 


gi7959 / /o 


riomo sapiens 


rrvUlD40 


1 UJ 




1484 


A A T l/Zt\A AT 

AAU69407 


riomo sapiens 


L,ung sman ceu carcinoma dmigen 

trl . 


J ou 


S5 


1484 


ABB03602 


Homo sapiens 


Human musculoskeletal system 
reiaieu puiypepLicjc oJ3"s<j xs~j inw 
1549. 


215 


69 


1 /I O A 

1464 


m^OIQOO 1 

gioy/yyzi 


urosopmia 
melanogaster 


XvliUVJ X 1 doC 


159 


32 


1 vl 

14oj 


A A T TOO AO Q 


riomo sapiens 


nunici.li giycobyi iranbicidijc *+ / iu/. 


391 


67 


1485 


gil8676612 


Homo sapiens 


FLJ00205 protein 


391 


67 


Iho -J 


<xi1 41 S04^0 


JXCtLLLtO 

norvegicus 


I IDP-GalNAc nnlvnentide N- 

acetylgalactosaminyltransferase T9 


343 


60 


1486 


AAO11830 


Homo sapiens 


Human polypeptide SEQ ID NO 
25722. 


283 


82 


1486 


AA012132 


Homo sapiens 


Human polypeptide SEQ ID NO 
26024. 


280 


81 


1487 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID NO:121. 


1249 


85 
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SEQ 
ID 


Hit ID 


Species 


Description 


Q 

score 


o/ 
/o 

Identity 


1488 


AAY27572 


Homo sapiens 


Human secreted protein encoaea oy 
gene ino. o. 






1488 


IOC CCSTICSTl 

gi 135 60707 


Homo sapiens 


group Alii secreieu piiospnonpa&c 
A2 


JO? 




1488 


gil269S926 


Mus musculus 


group -A^in secreiea pnospnoiipd.se 
A2 


^1 1 ' 
DLL 




1489 


gi2995442 


Homo sapiens 


UL>rVjal.LjJCiNAC Dl,*r 

galactosyltransferase 


1 Q90 




1489 


gi4520136 


Homo sapiens 


beta- 1 ,4-gaiacLOsynxansierase n 


1 090 




1489 


gi3 132896 


Homo sapiens 


beta- 1 ,4-galactosyltransferase 


1877 


88 


1490 


AAOH284 


T T • 

Homo sapiens 


Human polypeptide orsl^ JLU jnu 
25176. 


10J 




1490 


AAO06074 


T T * 

Homo sapiens 


Human polypeptide o-bk«) AAJ JNU 
19966. 


1 




1490 


AAO02002 


Homo sapiens 


Human polypeptiae oiii^ liJ JNv-' 
15894. 


IJ / 




1491 


AAB93371 


T T * 

Homo sapiens 


Human protein sequence SEQ ID 
NO:12521. 


/1C7 
4o / 


QQ 


1491 


AAM41334 


Homo sapiens 


Human polypeptide oxil^ 1JJ JNiJ 
6265. 


a sin 




1491 


AAM41333 


T T ' 

Homo sapiens 


Human polypeptide JSJiC^ 1JD JN U 

Ozo4. 


^fo / 




1492 


AAE19435 


Homo sapiens 


Human z/oio (^a memoer 01 numan 
adenylate kinase family). 


j't 1 


OJ 


1492 


AAU29224 


Homo sapiens 


Human r tsXJ polypeptide sequence 
#201. 


1 07 


^7 


1493 


• O 1/11 AC OT 

gi21410587 


Homo sapiens 


similar to ixiivc/iN cuin/v 
2310041H06 


OlO 


100 


1493 


gi6855513 


Gallus gallus 


syndesmos 


DD 1 


60 
Du 


1493 


gi!8034388 


Mus musculus 


syndesmos 


537 


58 


1494 


gi20987486 


Homo sapiens 


similar to B cell phosphoinositide 3- 
kinase adaptor 


805 


92 


1494 


gil2082725 


Mus musculus 


B cell phosphoinositide 3-kinase 
adaptor 


331 


62 


1494 


gil2082811 


Gallus gallus 


B cell phosphoinositide 3-kinase 
adaptor 


205 


48 


1495 


AAB64482 


Homo sapiens 


Human secreted protein sequence 
encoded by gene id on*\i lu 
NO: 120. 


273 


100 


1495 


AAB65921 


TT ' 

Homo sapiens 


Human secreted protein hliv^ 11J ino. 
01. 




1 oo 


1495 


AAB95728 


Homo sapiens 


Human protein sequence SEQ ID 
iNU:i oouo. 


273 


100 


1496 


gi56463 


~u'~~\ J I 

Rattus 

4 

norvegicus 


„0 1 / A A 1 ICCA^ 

gpZlU (AA 1-looOJ 


71 79 

/ 1 / 


79 


1496 


gi6650678 


Mus musculus 


nuclear pore memorane giycoproLem 

JrWlVlZ.lv/ 


71 14 


71 
/ 1 


1496 


HI 8676550 


Tfnm r» ^at^ifttis 


FLJ00172 protein 


5038 


95 


1497 


AAY40432 


Homo sapiens 


Amino acid sequence of vk65.15, a 
human VK gene fragment. 


607 


100 


1497 


AAW62185 


Homo sapiens 


Human DNA vkappa65.15 fragment. 


607 


100 


1497 


AAW41147 


Homo sapiens 


Human vkappa65.15 fragment. 


607 


100 


1498 


gi431857 


Homo sapiens 


delta 4-3-oxosteroid 5 beta-reductase 


460 


76 


1498 


gil 1640835 


Homo sapiens 


5-beta steroid reductase 


460 


76 


1498 


gi5689216 


Oryctolagus 


delta4-3 -oxosteroid 5beta-reductase 


442 


70 
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Table 2B 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


% 
identity 






cuniculus 








1500 


gi2 1748478 


TT 

Homo sapiens 


FL JO 0264 protein 


o o on 


AA 

yy 


1500 


AAB93159 


T T 1 

Homo sapiens 


Human protein sequence SEQ ID 

\T/-v . 1 OAO 1 

NO: 12081. 


22Uy 


OLA 

04 


1500 


AAB58796 


Homo sapiens 


Breast and ovarian cancer associated 
antigen protein sequence SbQ ID 
504. 


1456 


64 


1502 


AAB58202 


T T ' 

Homo sapiens 


Lung cancer associated polypeptide 

______ ^ _ _ ftT? A T T™\ iZ A {\ 

sequence JSlsQ ID z> 40. 


427 


1 C\C\ 

100 


1502 


AAU79340 


Homo sapiens 


Human caveolin-1 (Cav~l). 


345 


100 


1502 


AAE22092 


TT * 

Homo sapiens 


TX 1 • i „ v. „ • „ 

Human caveolm-1 protein. 


1 A G 

i4D 


1 AA 

1UU 


1503 


gi3 115996 


Homo sapiens 


dJ79C4.L2 (Homeobox protein 
r MX- 1 (rnUAl) isoiorm Z) 


410 


97 


1503 


gi3 115995 


Homo sapiens 


dJ79C4.1.1 (Homeobox protein 
r MX- 1 (rriUXlj isoiorm 1) 


410 


97 


1503 


gi460125 


Mus musculus 


homeobox protein 


405 


96 


1504 


gi8 163762 


Homo sapiens 


membrane cofactor protein CD46 
variant 


639 


63 


1504 


AAG75528 


Homo sapiens 


Human colon cancer antigen protein 
SEQ IDNO:6292. 


635 


82 


1504 


AAB58394 


T T * 

Homo sapiens 


Lung cancer associated polypeptide 
sequence SEQ ID 732. 


635 


82 


1505 


AAO21802 


Homo sapiens 


Lung-specific ammo acid sequence 

nF/*V TT* \T 1 1 O 

SEQ ID No 1 13. 


301 


61 


1505 


AAO07848 


Homo sapiens 


Human polypeptide SEQ ID NO 
21740. 


161 


89 


1 505 


AAO09394 


T T ' _ 

Homo sapiens 


Human polypeptide SEQ ID NO 

2325D. 


i i r 

1 ID 


D 1 


1506 


AAB94891 


Homo sapiens 


Human protein sequence SEQ ID 

JNU. lOZj 1. 




to 


1506 


gil335205 


Homo sapiens 


ORFII 


209 


69 


1507 


ABB44566 


Homo sapiens 


Human wound healing related 
polypeptide SEQ ID NO 23. 


1 AJ\ Q 
lUDo 


1 AA 


1507 


- ii/41/i o azT 

gil 1414896 


Homo sapiens 


nucieoporm 




1UU 


1507 


gill84173 


Homo sapiens 


nucleoporin 98 




1 AO 


1508 


\ A TV JHAC 1 O 

AAM795 1 o 


Homo sapiens 


Human protein oiiV^ lu invJ 3 io4. 


1 UDZ 


yo 


150S 


AAM78534 


Homo sapiens 


iiuman protein orsv^ 1JJ i iyo. 


1 C\fsD 




1508 


ABB 11951 


Homo sapiens 


Human K/Cl cotransporter 
nomologue, oxiO J-J-J ND.zdZl. 


1062 


96 


1509 


AAB21033 


Homo sapiens 


Human nucleic acid-binding protein, 

NUAor-J /. 


293 


81 


1509 


gi6691968 


Homo sapiens 


dJ148M19.1 (zinc finger protein) 


293 


81 


1509 


gi4096339 


Homo sapiens 


zmc finger protein 


O AO 


Q 1 


1511 


gi8809808 


Mus musculus 


KRAB zmc finger protein 


329 


3y 


1511 


A 11-^/4 j*\ ji ^ 

AAM40475 


Homo sapiens 


Human polypeptide SEQ ID NO 
5406. 


TOO 

322 


fy 


1511 


AAM38689 


Homo sapiens 


Human polypeptide SEQ ID NO 


322 


^7A 

79 


1512 


gi 15029737 


Mus musculus 


complement component 2 (within H- 
2S) 


89 


26 


1512 


gi3 986766 


Mus musculus 


C2 


89 


26 


1512 


gil92435 


Mus musculus 


complement component C2 


86 


26 


1513 


AAB60489 


Homo sapiens 


Human cell cycle and proliferation 
protein CCYPR-37, SEQ ID NO:37. 


1039 


100 


1513 


AAB38417 


Homo sapiens 


Fragment of human secreted protein 


1039 


100 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


% 
Identity 








encoded by gene 5 clone HCGMF16. 






1513 


gi4337460 


Homo sapiens 


neuroblastoma-amplified protein 


1039 


100 


1514 


gil621611 


Homo sapiens 


TRAF family member-associated 
NF-kB activator TANK 


2026 


87 


1514 


AAY88237 


Homo sapiens 


Human I-TRAF protein. 


1996 


86 


1514 


AAW27163 


Homo sapiens 


Human TRAF inhibitor protein I- 
TRAF. 


1996 


86 


1515 


ABP47963 


Homo sapiens 


Human polypeptide SEQ ID NO 393. 


239 


100 


1515 


AAG67216 


Homo sapiens 


Amino acid sequence of human 
Parkin- Associated Protein 1 (PAP1). 


239 


100 


1515 


gi 18490728 


Mus mus cuius 


synaptotagmin-like 3 


142 


60 


1516 


AAB93432 


Homo sapiens 


Human protein sequence SEQ ID 
NO:12661. 


1322 


86 


1516 


AAW88438 


Homo sapiens 


Disease associated protein kinase 
DAPK-7. 


1322 


86 


1516 


gi5931569 


Homo sapiens 


H91620p 


1322 


86 


1517 


gi5931821 


Homo sapiens 


dJ228H13.3 (zinc finger protein) 


2360 


100 


1517 


ABP51457 


Homo sapiens 


Human MDDT SEQ ID NO 479. 


1497 


100 


1517 


ABP51362 


Homo sapiens 


Human MDDT SEQ ID NO 384. 


1497 


100 


1518 


AAU80035 


Homo sapiens 


Beta 2 microglobulin (beta2M)/HFE 
monochain. 


602 


100 


1518 


ABP43154 


Homo sapiens 


Human ovarian antigen HWCG93, 
SEQ ID NO:4286. 


602 


100 


1518 


AAY44412 


Homo sapiens 


Wild type human beta-2 
microglobulin. 


602 


100 


1520 


AAO07974 


Homo sapiens 


Human polypeptide SEQ ID NO 
21866. 


158 


68 


1520 


AAO07413 


Homo sapiens 


Human polypeptide SEQ ID NO 
21305. 


149 


65 


1520 


AAO05994 


Homo sapiens 


Human polypeptide SEQ ID NO 
19886. 


121 


56 


1521 


gil 13861 13 


Homo sapiens 


FKSG25 


2457 


94 


1521 


gil3548673 


Homo sapiens 


SCOT-t 


2451 


93 


1521 


gi20988313 


Homo sapiens 


3-oxoacid CoA transferase 2 


2448 


94 


1522 


AAM42400 


Homo sapiens 


Human polypeptide SEQ ID NO 133. 


215 


56 


1522 


AAM92851 


Homo sapiens 


Human digestive system antigen 
SEQ ID NO: 2200. 


215 


56 


1522 


gil 7862572 


Drosophila 
melanogaster 


LD38503p 


179 


31 


1523 


gi3419880 


Homo sapiens 


MDC/ADAM11 


3883 


91 


1523 


AAR75352 


Homo sapiens 


Human fetal brain MDC protein. 


3878 


91 


1523 


gi5736634 


Mus musculus 


ADAM 11 


3658 


87 


1524 


gil 109782 


Homo sapiens 


protein-tyrosine phosphatase 


2517 


98 


1524 


gil781037 


Mus musculus 


neuronal tyrosine threonine 
phosphatase 1 


2083 


79 


1524 


AAM25744 


Homo sapiens 


Human protein sequence SEQ ID 
NO:1259. 


987 


45 


1525 


AAG75396 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:6160. 


442 


100 


1525 


AAB53529 


Homo sapiens 


Human colon cancer antigen protein 
sequence SEQ IDNO:1069. 


442 


100 


1525 


AAR84194 


Homo sapiens 


Human A2b adenosine receptor. 


442 


100 


1526 


ABB44587 


Homo sapiens 


Human wound healing related 
polypeptide SEQ ID NO 44. 


336 


75 


1526 


gil 84448 


Homo sapiens 


transcription factor 


336 


75 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


% 
Identity 


1526 


gil 83930 


Homo sapiens 


helix-loop-hehx protein 


336 


75 


1527 


AAB43940 


-w- • 

Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1385. 


1241 


78 


1527 


gi2 198557 


Homo sapiens 


spermidine aminopropyltransferase 


1190 


78 


1527 


gil4602778 


Homo sapiens 


spermine synthase 


1190 


78 


1528 


ABP41860 


Homo sapiens 


Human ovarian antigen HTPH072, 
SEQ ID NO:2992. 


1189 


62 


1528 


AAW21949 


Homo sapiens 


E6-binding protein E6-BPSD7. 


1189 


62 


1528 


gi469885 


Homo sapiens 


EF-hand protein 


1189 


62 


1529 


AAU83711 


Homo sapiens 


Human PRO protein, Seq ID No 240. 


606 


100 


1529 


AAB84327 


Homo sapiens 


Amino acid sequence of a human 
lyase and associated protein HLYAP- 
2. 


606 


100 


1529 


AAU12301 


Homo sapiens 


Human PRO6079 polypeptide 
sequence. 


606 


100 


1530 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID NO: 121 . 


3094 


92 


1530 


AAB38012 


Homo sapiens 


Human secreted protein encoded by 
gene 3 clone HNHCT15. 


3094 


92 


1530 


ABJ03747 


Homo sapiens 


Human ovary specific protein SEQ 
ID NO: 189. 


3085 


91 


1531 


gil3477353 


Homo sapiens 


Similar to rabphilin 3A-like (without 
C2 domains) 


428 


51 


1531 


ABB06134 


Homo sapiens 


Human NS protein sequence SEQ ID 
NO:226. 


427 


51 


1531 


gi5596433 


Homo sapiens 


candidate tumor suppressor protein 
NOC2 


330 


87 


1532 


AAU81224 


Homo sapiens 


Human lung cancer protein, Seq ID 
No 57. 


2576 


86 


1532 


gil531645 


Rattus 
norvegicus 


C2-HC type zinc finger protein r- 
MyT3 


1913 


66 


1532 


gi29 14751 


Rattus 
norvegicus 


neural zinc finger factor 3; NZF-3 


1908 


66 


1533 


AAE23388 


Homo sapiens 


Human intracellular signalling 
(INTSIG- 15) protein. 


643 


39 


1533 


ABB78745 


Homo sapiens 


Human Tigger 1 transposase protein 
sequence. 


438 


71 


1533 


AAO01214 


Homo sapiens 


Human polypeptide SEQ ID NO 
15106. 


366 


65 


1534 


AAU27733 


Homo sapiens 


Human full-length polypeptide 
sequence #58. 


3947 


90 


1534 


AAM78731 


Homo sapiens 


Human protein SEQ ID NO 1393. 


3943 


90 


1534 


AAB95860 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18924. 


3934 


89 


1535 


gi 15928572 


Mus musculus 


Similar to leucine rich repeat (in 
FLU) interacting protein 2 


103 


66 


1535 


gil7389307 


Homo sapiens 


Similar to leucine rich repeat (in 
FLU) interacting protein 2 


97 


63 


1535 


gi5257201 


Homo sapiens 


LRR FLI-I interacting protein 2 


97 


63 


1536 


AAG02639 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6720. 


160 


71 


1536 


AAG02753 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6834. 


141 


60 


1536 


gi7959778 


Homo sapiens 


PR01546 


140 


60 


1537 


ABG47195 


Homo sapiens 


Human peptide encoded by genome- 


507 


100 
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SEQ 
ID 


-W- "M" • J 'g ■ V 

Hit ID 


Species 


description 


c 
O 

score 


0/ 

/o 

Identity 








derived single exon prooe id 
36860. 






1537 


AAM38436 


Homo sapiens 


reptide #lz4/J encoaea oy prooe tor 
measuring placental gene expression. 


DV 1 




1537 


AAM22055 


TT _, 

Homo sapiens 


Peptide #o4oy encoded by prooe tor 
measuring cervical gene expression. 


jU / 


1UU 


1538 


gi3 133291 


Homo sapiens 


mitogen activated protein kinase 
activated protein kinase 


853 


85 


1538 


gi 175 12453 


Mus musculus 


MAP kinase-activated protein kinase 
5 


850 


84 


1538 


gi2911813 


Mus musculus 


mitogen-activated protein kinase- 
activated protein Kinase 


850 


84 


1539 


ABG38678 


Homo sapiens 


Human peptide encoded oy genome- 
derived single exon probe SEQ ID 




L\J\J 


1539 


AAM04612 


Homo sapiens 


Peptide #3294 encoded by probe for 
measuring breast gene expression. 


339 


100 


1539 


AAM29402 


Homo sapiens 


Peptide r¥o43y encoded oy prooe ror 
measuring placental gene expression. 




1 on 


1540 


gil3 162677 


Homo sapiens 


CjLU 14 enhancer ractor 


1 ylQO 




1540 


gi 14627 122 


Homo sapiens 


dJ583P 15.5.1 (Glut4 enhancer factor 
(isoform 3) ) 


1466 


65 


1540 


AAB58934 


T T * 

Homo sapiens 


Breast and ovarian cancer associated 
antigen protein sequence bJbv^ 1JJ 
642. 






1541 


ABB 90747 


XT * 

Homo sapiens 


Human Tumour Endothelial Marker 
polypeptide SEQ ID NO 226. 


10U0 




1541 


ABB50291 


T T * 

Homo sapiens 


Collagen type III alpha- 1 ovarian 
tumour marker protein, SEQ ID 

XT/~V7'"> 

JN\J: /z. 


louo 


QQ 


1541 


ABG60248 


Homo sapiens 


Human ovarian antigen #10. 


1606 


98 


1542 


AAE01436 


TT • „ 

Homo sapiens 


Human gene L encoded secreteu 
protein riWLrJ lu, oiiVi il' iN^.vi. 


1 £01 
10 A Z 


QQ 


1542 


AAE01515 


Homo sapiens 


Human gene i encoaea secreteo 
protein iragmeni, oj2v<? i-L* inu, i /z. 


1^1 8 
It) 1 o 




1542 


AAE01464 


Homo sapiens 


Human gene 1 encoded secreted 

uu/r pT1 n CPA TPi TvIO* 11Q 
proTein ri vv J-fir j iu, oiiVi inv_/. x iy. 


1618 


98 


1543 


gi 186043 


Homo sapiens 


immunoglobulin light chain variable 

• 

region 


546 


89 


1543 


gi219886 


Homo sapiens 


Ig kappa light chain 


543 


89 


-1 .C A 1 

1543 


AAYy0z9o 


Homo sapiens 


Human iotaivl-iu lmmunogioounn. 






1544 


AAE19183 


Homo sapiens 


Human protease, PRTS-20 protein. 


3530 


100 


i e A A 

1544 


A A T T'*7 A TZ" 1 

AA U 7476 I 


TT __ J _ 

Homo sapiens 


Human protease FKlo-zi protein 
sequence. 


00 1 Q 

zziy 


Oj 


■t A A 

1544 


AAO20514 


Homo sapiens 


xTotem or Arr related numan 
nomoiogue nurzuijoo. 


ZU J / 


OR 


^ C A C 

1545 




Kattus 

♦ 

LlKJL VCgll/UD 


oJNlr-D 




/ 0 


1545 


gi6693834 


Rattus 
norvegicus 


SNIP-a 


4410 


81 


1545 


gi3098418 


Mus musculus 


P140 


4130 


79 


1546 


gi 179433 


Homo sapiens 


biglycan 


2043 


95 


1546 


AAB85043 


Homo sapiens 


Human biglycan protein sequence. 


1894 


89 


1546 


gi6960459 


Homo sapiens 


biglycan 


1894 


89 


1547 


gi20987689 


Homo sapiens 


Similar to allantoicase 


1332 


64 
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Table 2B 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 


% 

THpntitv 


1 GA H 




ivius muscuius 


aiianioicase 


1 \J Z/ J 




1 ^An 


gll4/ lo04o 


xiomo sapiens 


ai l anxoicase 




66 


lD4o 




riomo sapiens 


LT nryi ,, n nmtpin QPf) TD 7\TO 


203 


78 


1 CA 0 

154o 


A A / n A'2 0'T7 


riomo sapiens 


riuman secreieci proienij iu 




78 


1 ZA O 

104o 


A A 1 1 A 

AAr>D / 1 1U 


±iomo sapiens 


numdn prostate CcincC'i diingcii 




78 


1 <AQ 
lD4y 


nil QQGQ^A 

glloJoiOO 


jjOS Taurus 


tannin cr •nt*r\ff a iTi V\ff q ^ icrt'fATm 
Uctppjllg piULCJIl, UCLclJ lolU.V/1111 


156 


100 


1549 


gi2239063 


Bos taurus 


actin-binding protein CP3 


156 


100 


i54y 


gijyjzD / 


riomo sapiens 


r-acun capping proieiri ocia ouduiiil 




100 


1550 


gi3370998 


Homo sapiens 


BAH -associated protein 1 


526 


91 


1550 


gil527oiy3 


Homo sapiens 


MAui-iL oeta 


JjLKJ 


Q1 


1550 


gil5278186 


Homo sapiens 


MAGI- 1 A 


526 


91 


1551 


ABG61604 


Homo sapiens 


Human UrKr-Z splice variant tto. 


ZC5DU 


JJ 


1551 


ABG61602 


Homo sapiens 


Human JJrKr -z splice variant tfi. 




OS 

yD 


1551 


ABG61592 


Homo sapiens 


Human DPPIV related serine 
protease UrKr-i. 


2850 


95 


1552 


AAG73867 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:4631. 




DO 


1552 


AAB58391 


Homo sapiens 


Lung cancer associated polypeptide 
sequence oxiC^ 1JJ /Zy. 


322 


36 


1552 


gi296667 


Homo sapiens 


ornithine decarboxylase (ODC) 




36 


1553 


AAW96153 


Homo sapiens 


Human r ADD-mteractmg protein 


1 

105 


1 no 

1UU 


1553 


gil61 18555 


Homo sapiens 


ELMOl 


168 


100 


1553 


gi2 15 94742 


Mus musculus 


Similar to engulfment and cell 
motility 1, ced-12 homolog (C. 
elegans) 


1 

loo 


1UU 


1554 


ABB89950 


Homo sapiens 


Human polypeptide SEQ ID NO 
2326. 


244 


81 


1554 


AAU86135 


Homo sapiens 


Human PR.0274 polypeptide. 


Z44 


53 1 
O 1 


1554 


AAB93352 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12476. 


244 


81 


1556 


AAM38652 


Homo sapiens 


Human polypeptide SEQ ID NO 
1191. 


1838 


85 


1556 


gi4530437 


Homo sapiens 


thyroid hormone receptor-associated 
protem complex component 
TRAP240 


1645 


78 


1556 


AAM40438 


T T 1 — — 

Homo sapiens 


Human polypeptide JSbQ ID NO 
53oy. 


loll 


1 1 


1557 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoaed by gene / ocrQ ID jnu. lzi . 


2327 


89 


1557 


AAB38012 


Homo sapiens 


Human secreted protein encoaed oy 
gene ^ clone xirsriL>i id. 






1558 


gll 106o46o 


Kattus 
norvegicus 


Knour-r giuiamaxe transport 
moauiator vjrirvf\Jr*fo 


^70A 


79 


1558 


gil9387126 


Mus musculus 


guanine nucleotide exchange factor 


1051 


37 




A ARQ074A 


riAJlliVJ odpivllo 


Human C\AT4'20 9 t^rntein ^eauence 

1 1 VI 1 11 till V T I L*\J £j jJl VLvJl 1 0\rfV^VAVrfAlW-> 

SEQ ID 186. 


1049 


50 


1559 


gil8033747 


Homo sapiens 


myosin IIIB 


1032 


66 


1559 


AAE24138 


Homo sapiens 


Human kinase (PKIN)-9 protein. 


964 


65 


1559 


AAU03552 


Homo sapiens 


Human protein kinase #52. 


964 


65 


1560 


AAM79228 


Homo sapiens 


Human protein SEQ ID NO 1890. 


3488 


64 


1560 


AAM39300 


Homo sapiens 


Human polypeptide SEQ ID NO 
2445. 


3467 


64 
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SEQ 
ID 


Hit ED 


Species 


Description 


S 

score 


% 
Identity 


1560 


AAM41086 


TT „ _ ^ ' ^ _ 

Homo sapiens 


Human polypeptide otiQ 1JJ in kj 
6017. 




£i 

OJ) 


1561 


AAB59019 


Homo sapiens 


Breast and ovarian cancer associated 
antigen protein sequence oii<si iu 
727. 


158 


47 


1561 


ABG35360 | 


TT * ^ ^ 

Homo sapiens 


Human peptide encoaea oy genome- 
oenveu single exon prooe oxz/V^ i-l' 


ni 
1j1 


ou 


1561 


AAM01340 


Homo sapiens 


Peptide #22 encoded by probe for 
rneasuriiig nuiriaii orodbi gene 

expression. 


131 


60 


1562 


gll5U4UU2 


Homo sapiens 


similar xo a numan iiicij or v^iviv- 

VtinHincr nrntf^in DOPT<T 1 5s 0 




82 




gUoliO 14/ 


jvius museums 




2457 


79 


1562 


AAW03515 


Homo sapiens 


Human DOCK180 protein. 


1636 


53 


1563 


AAB52017 


Homo sapiens 


UTimnn nonro+on v\t*/^'i*^tt - 1 C^AIlPtl^P 

riuman secreiea protein setiucucc 

t*r\ne\Ae*A hi\r crp>np» (\ QRO TF) TSJO*^^> 
CIlCOUcCl uy gcllc O V</ ix^» nu.uu. 




100 


1503 


AAJViy ID /O 


riomo sapiens 


jnuiiidii immune/ xiacniaLupuicn^ 

antigen SEQ ID NO: 19169. 


77 


37 


1 CCA 


A ATT1 f\£.Qf\ 

aau luoyu 


riomo sapiens 


riuman dgouu poiypcpuuc 




97 


1564 


gi540073 


Homo sapiens 


agouti precursor 


362 


97 


1564 


gi895344o 


Homo sapiens 


OJ/ojLrii7.3 ^agouti ^mousey- 
signaling protein) 




07 


1565 


AAM79853 


Homo sapiens 


Human protein oiii^ jljl/ inv-/ j^y:?. 


9109 


S9 


1565 


AAM78869 


Homo sapiens 


Human protein SEQ ID NO 1531. 


2302 


S2 


1565 


gi6l9877 


Homo sapiens 


nyar oxymetny l giutar yi- uoa 
synthase 


9109 


R9 


1566 


AAM41373 


Homo sapiens 


Human poi^^peptide c>rii^ iu 
6304. 




1 00 


1 C /I 

1566 


AAM39557 


Homo sapiens 


Human poiypeptiae oiiv^ ij-' in w 
2732. 


1*>fi 


100 


1 r* 

1566 


AAU 12276 


Homo sapiens 


Human rKUouu i poiypepnae 
sequence. 


1 S4 


41 


1 C ✓"'"7 

1567 


A A T I 1 

AAU27723 


Homo sapiens 


Human iun-iengtn poij^peptiae 
sequence tf^fo. 


919 




1567 


AAO10211 


Homo sapiens 


Human polypeptide SEQ ID NO 

Z^r IUj. 


219 


84 


1567 


AAO01262 


Homo sapiens 


Human polypeptide SEQ ID NO 

1^1 Z.A 


218 


74 


1568 


gil 1595428 


Homo sapiens 


Qj /ozj iy.i ^glycine cieava-gc system 
protein H (aminomethyl carrier)) 


S69 




1 C£LO 

iDoo 


glZ19o / 1 


Homo sapiens 


nyurogen cdmer pruiein pieeuit>ui 


856 


91 


1568 


gil2653985 


Homo sapiens 


glycine cleavage system protein H 
^ammometnyi camerj 


856 


91 


1569 


AAO 14211 


Homo sapiens 


Human transporter and ion channel 
TRICH-28. 


228 


59 


1569 


1 i mm mm /-y 

AAG77968 


Homo sapiens 


Human ion channel protein IC23949. 


Zl f 


J / 




A A 1\.yf3/1 1 


tiomo sapiens 


measuring placental gene expression. 


155 


100 


1570 


gi532505 


Homo sapiens 


bile acid CoA: Amino acid N- 
acyltransferase 


2049 


99 


1570 


gil6306985 


Homo sapiens 


bile acid Coenzyme A: amino acid 
N-acyl transferase (glycine N- 
choloyltransferase) 


2049 


99 


1570 


gil5215152 


Mus muscuius 


Similar to bile acid-Coenzyme A 


1410 


68 
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Table 2B 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


% 

T^l an T'i'rftT 

jLueiiuiy 








denydrogenase. ammo acia n- 
acyi Lrdii&ierd.iae 






15/1 


gllD4o/0/4 


riomo sapiens 




204 


100 


1571 


gi 17529999 


Homo sapiens 


ox3'sterol-binding protein-like protein 


204 


100 


1571 


AAG89290 


Homo sapiens 


Human secreted protein, SEQ ID 

"MTV /I 1 a 

JNU. 41U. 


198 


97 


1572 


AA017114 


Homo sapiens 


Human Gli3 protein SEQ ID NO: 21. 


1105 


62 


1572 


gi6102812 


Homo sapiens 


CjLri3 protein 


1 1 fK 
1 IUj 


£9 


1572 


gil 83248 


Homo sapiens 


DNA-binding protein 


1105 


62 


1573 


g i 18447471 


Drosophila 
melanogaster 


RE41571p 


177 


47 


1573 


gi9965400 


murine 

herpesvirus 72 ' 


membrane virion glycoprotein 150 


128 


30 


1573 


gil 01 9435 


Trypanosoma 

* 

cruzi 


mucin-like protein 


127 


36 


1 C T A 

1574 


ABG38560 


Homo sapiens 


Human peptide encoded oy genome- 
derived single exon probe SEQ ID 






1574 


AAM04517 


Homo sapiens 


Peptide #3199 encoded by probe for 
measuring oreast gene expression. 


239 


100 


1574 


AAM29283 


Homo sapiens 


Peptide itoDZK) encooea oy prooe ior 
measuring placental gene exprcfasiun. 




i on 


1575 


A A T TOO -COT 

AAUo35y / 


xlomo sapiens ( 


riuman r xvv^i protein, ocq ijli ino iz. 


1904 


100 


1575 


A A TD ^7y1 "~} r\C\ 

AAo /4 f\Jy 


Homo sapiens 


riumaii inein Diane as&ociateu pruLC/in 
MEMAP-15. 


19Q4 


1 00 

1 \J\J 


15 /5 


A A T TA/C 1 0< 

AAUyoloD 


Homo sapiens 


numan secreted proiein, oc/\£ vu inu 
87. 


1 191 


so 


1576 


AAM79494 


Homo sapiens 


riuman protein oiiv^ it-' inu j i^fu. 


94^9 


oo 


1576 


gil773381 


Homo sapiens 


APXL 


2432 


86 


1576 


AAM78510 


Homo sapiens 


Human protein SEQ ID NO 1 172. 


2428 


86 


1577 


AAU07829 


TT * 

Homo sapiens 


Human ARTS-1 polypeptide. 




/O 


1577 


gi213 15078 


Homo sapiens 


type 1 tumor necrosis factor receptor 
shedding aminopeptidase regulator 


3395 


76 


1577 


gi6381989 


Homo sapiens 


adipocyte-denved leucine 
aminopeptidase 




/o 


1578 


ABG35337 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
25002. 


1 ^A 

IjU 


/J 


1578 


AAM01319 


Homo sapiens 


Peptide #1 encoded by probe for 
measuring human breast gene 
expression. 


1 ^A 
13U 


7^ 

/ J 


1578 


AAM25964 


Homo sapiens 


Peptide tfI encoded by probe tor 
measuring placental gene expression. 


1 ^A 
1DU 


/J 


1579 


gi76578o4 


Homo sapiens 


matcn to nuclear protein, JNrZzu, 
note, sequence uinerence at residue 

JO 




OS 


1 S7Q 
ld /y 


a AY07032 




Tireast cancer associated antigen 
precursor sequence. 


4023 


98 


1579 


gil374698 


Homo sapiens 


nuclear protein, NP220 


4023 


98 


1580 


ABB09439 


Homo sapiens 


Serpin domain protein Zserpl5 
associated amino acid sequence. 


1172 


98 


1580 


AAW51933 


Homo sapiens 


Human protease nexin I type alpha. 


1172 


98 


1580 


AAW51934 


Homo sapiens 


Human protease nexin I type beta. 


1172 


98 


1581 


gi499184 


Felis catus 


neuronal protein 


305 


93 
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Table 2B 
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SEQ 
ID 


Hit ID 


Species 


Description 

- 


S 

score 


% 
Identity 


1581 


AAB95041 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 16804. 


284 


13 


1581 


gil3960126 


Homo sapiens 


Similar to leucme-nch neuronal 
protein 


/*»»"» /A 

230 


75 


1582 


AAB80253 


Homo sapiens 


Human PR0293 protein. 


3181 


99 


1582 


AAY13385 


Homo sapiens 


Amino acid sequence of protein 
PR0293. 


3181 


99 


1582 


AAU83686 


Homo sapiens 


Human PRO protein, Seq ID No 190. 


1669 


55 


1583 


AAB73858 


Homo sapiens 


Human NPM/ALK fusion protein. 


268 


75 


1583 


AAB28184 


Homo sapiens 


Human Nucleophosmin. 


268 


75 


1584 


AAB93492 


Homo sapiens 


Human protein sequence SEQ ID 
NO:12796. 


697 


100 


1584 


gi7302187 


Drosophila 
melanogaster 


CG7849-PA 


274 


40 


1584 


gi21626800 


Drosophila 
melanogaster 


CG7849-PB 


274 


40 


1585 


gi8572061 


Bombyx mori 


fibroin heavy chain Fib-H 


156 


34 


1585 


gi765323 


Bombyx mori 


silk fibroin heavy chain 


155 


39 


1585 


gil55999 


Bombyx mori 


silk fibroin 


147 


39 


1586 


gil2002682 


Homo sapiens 


FERM-containing protein 


2019 


83 


1586 


ABP37983 


Homo sapiens 


Human GS93382 protein. 


531 


99 


1586 


AAB12318 


Homo sapiens 


Human secreted protein encoded by 
gene 18 clone HE2FL70. 


189 


100 


1587 


ABP41887 


Homo sapiens 


Human ovarian antigen HKZBB48, 
SEQ IDNO:3019. 


1005 


90 


1587 


AAB95585 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18250. 


1005 


90 


1587 


AAY86208 


Homo sapiens 


Nuclear transport protein clone 
hfb030 protein sequence. 


1005 


90 


1590 


AAG0285 1 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6932. 


226 


88 


1590 


AAB38280 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 20 SEQ ID 
NO:136. 


226 


88 


1591 


AAB95593 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18273. 


2718 


82 


1591 


AAU27637 


Homo sapiens 


Human protein AFP39158. 


609 


75 


1591 


ABG45908 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
35573. 


311 


100 


1592 


ABB05746 


Homo sapiens 


Human GASC1 protein SEQ ID 
NO:l. 


780 


33 


1592 


gil 0567 164 


Homo sapiens 


gene amplified in squamous cell 
carcinoma- 1 


780 


33 


1592 


AAM39339 


Homo sapiens 


Human polypeptide SEQ ID NO 
2484. 


111 


33 


1593 


AAY69069 


Homo sapiens 


Amino acid sequence of a human 
reduced tropoelastm derivative. 


2887 


80 


1593 


gil 82020 


Homo sapiens 


elastin 


2887 


80 


1593 


AAO17360 


Homo sapiens 


Human elastin. 


2870 


79 


1595 


gi291854 


Homo sapiens 


aminopeptidase A 


1289 


91 


1595 


gil518S65 


Sus scrofa 


aminopeptidase A 


1127 


80 


1595 


gi7673021 


Rattus 
norvegicus 


aminopeptidase A 


1103 


77 


1596 


AAU71952 


Homo sapiens 


Human bone marrow tissue 


356 


58 
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Table 2B 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 


% 

jlvi v/Hiit^y 








poiypeptiae wd\j. 






1597 


gl55o7o9 


Homo sapiens 


inositol 1,+, j-nipnopnaie 
phosphatase 




92 


1597 


ABB77441 


Homo sapiens 


Human Tumour marKer protein laz/- 1 . 




01 


1597 


gi21262190 


Homo sapiens 


CTCL tumor antigen Li9~l 


339 


91 


1598 


ABB08405 


Homo sapiens 


AlpnaljO-iucosyi transierase ammo 
acid sequence. 


JO / 


lUU 


1598 


AAW22125 


TT • „ _ 

Homo sapiens 


Human alpna l-o tucosyitransierase. 


do / 


1 nn 


1598 


gi7638407 


Bos taurus 


6-alpha-L-fucosyl tr ansferas e 


367 


100 


1599 


AAM90773 


Homo sapiens 


Human lmmune/naematopoietic 
antigen SEQ ID NO: 18366. 




y / 


1599 


AAU74354 


XT * 

Homo sapiens 


Human cytoskeleton-associated 
protein Yolvr^; TfZD. 


/4U 




1599 


AAB93267 


Homo sapiens 


Human protein sequence SEQ ID 

"\T/"V 1 oonrt 
JND. IZoUU. 


149 


46 


■t s~ r\ r\ 

1600 


AAB48140 


Homo sapiens 


Human iajnuU zuy variant z 
p oi yp epnae . 






1600 


AAB48139 


Homo sapiens 


Human TANGO 209 variant 1 
poiypeptiae. 


372 


80 


1600 


AAB48107 


Homo sapiens 


Human TANGO 209 polypeptide. 


372 


80 


1601 


gi694l623 


Homo sapiens 


dJ4oyJUZz. l (^Kiiesus Diooa group, 
CcEe antigens) 


ZOO 


/o 


1601 


AAB03339 


TT ' 

Homo sapiens 


Human RhCe protein. 


J.DZ 


1 


1601 


AAB03338 


Homo sapiens 


Human RhCE protein. 




1 nn 

1UU 


1602 


AAB94196 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 143.50. 


784 


79 


1602 


AAB93449 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12696. 


784 


79 


1602 


AAB97178 


Homo sapiens 


Human EF-chiral calcium-binding 
protein 96. 


784 


79 


1603 


gil3096S14 


Mus musculus 


RIKEN cDNA 4932442K08 gene 


165 


72 


1603 


gil 1493928 


TT * 

Homo sapiens 


nomeodomain-interactmg protein 
kinase 2 


/4 


on 


1604 


gil32l596 


TT 1 

Homo sapiens 


immunoglobulin lambda-cnain 
subgroup 11 


J /o 


Oj 


1604 


AAG76014 


T T _- „ • 

Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:6778. 




j i 


1604 


gi2l6l9848 


TT „ „• „ „ _ 

Homo sapiens 


similar to immunogloDuim lamoaa 

■ • • /*> 

joining j 






1605 


AAB95234 


Homo sapiens 


Human protein sequence SEQ ID 

XTfV 1 

iNU, 1 13 (0, 


2336 


92 


1605 


AAB27239 


Homo sapiens 


Human EXMAD-17 SEQ ID NO: 17. 


2334 


99 


1605 


gi 19263740 


Mus musculus 


axotropnin 


1 




1606 


AAB92702 


Homo sapiens 


Human protein sequence oiiv«) iu 
NO: 11 102. 




Q7 


1606 


gi204516S2 


T T * 

Homo sapiens 


JliMMA protein 




yz 


1606 


AAB93188 


Homo sapiens 


Human protein sequence SEQ ID 


3088 


91 


1607 


AAB31693 


Homo sapiens 


Amino acid sequence of viral 
encoded semaphorin protein receptor. 


1014 


79 


1607 


AAB70131 


Homo sapiens 


Human VESPR. 


1014 


79 


1607 


AAY13462 


Homo sapiens 


Viral-encoded semaphorin protein 
receptor (VESPR) polypeptide. 


1014 


79 


1608 


AAW29683 


Homo sapiens 


Human Na-K-2C1 cotransporter 
NKCC2. 


1295 


99 



1 
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Table 2B 
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SEQ 
ID 


Hit ID 


Species 


Description 


score 


0/ 

/o 

Identity 


1608 


gi 1373425 


T T" ♦ 

Homo sapiens 


bumetanide-sensitive Na-K-zCl 
cotransporter 


izyD 


QQ 

yy 


1608 


gi5 16001 


Oryctolagus 
cuniculus 


bumetanide-sensitive Na-K-Cl 
cotransport protein splice isoform F 


1 O/i A 




1610 


AAB88388 


T T 

Homo sapiens 


Human membrane or secretory 
protein clone PSEC0131. 




luU 


1610 


gi208 10493 


Homo sapiens 


O " * 1 x _ T» TTyTT-j'KT „ T^»7vT A 

Similar to RIKEN cDJNA 
2810417M05 gene 


92U 


yy 


1610 


AAE21272 


Homo sapiens 


Human gene 16 encoded secreted 
protein fragment, oilv ID JNU: 1 Jo. 




IUU 


1611 


AAY53753 


Homo sapiens 


Amino acid sequence of the MMSC2 
protein. 


269 


98 


loll 


AA Y 04732 


Homo sapiens 


Protein containing Jr-Uz. domain rrom 
clone 3 8-2- la. 


zoy 


OS 


1611 


AAYU4734 


Homo sapiens 


rrotem containing ruz, aomam rrom 
clone 38-2- lc. 




yo 


1612 


AAR85912 


tt„ „ • „„ _ 

Homo sapiens 


Uncostatm M receptor- oeta suounit. 


1 1 3 Q 


yj. 


1612 


gil794211 


Homo sapiens 


oncostatin-M specific receptor beta 
suuunit 


1138 


92 


1612 


gil5012082 


Homo sapiens 


oimilar to oncostatm M. receptor 


1 Ho 


y/* 


1613 


ABP42361 


Homo sapiens 


Human ovarian antigen HNOKG34, 


406 


82 


1613 


AAB48966 


Homo sapiens 


rluman Heterogeneous nuclear 
ribonucleoprotein Al (hnRNP Al). 


4UO 


oZ 


1613 


AAW55828 


TT „ • „ 

Homo sapiens 


Human heterogeneous nuclear 
riDonucieoproiein core proxem ai. 


4UO 


oZ 


1614 


ABB07649 


Homo sapiens 


Human LOR-1 protein. 


898 


99 


1614 


A AB 00077 


Homo sapiens 


Human lysyl oxidase related protein 
(Lor). 


898 


99 


1614 


gil 890108 


Homo sapiens 


lysyl oxiaase-reiatea protein 




QQ 

yy 


1615 


gil 2803 157 


Homo sapiens 


COX 15 (yeast) homolog, cytochrome 
c oxidase assembly protein 


163 


48 


1615 


gi3603230 


TT ' 

Homo sapiens 


cytochrome oxidase assembly factor 


lD3 


4o 


1615 


gi76 19702 


Homo sapiens 


bA483F11.2.1 (COX15 (yeast) 
homolog, cytochrome c oxidase 
assembly protein (isoform 1)) 


163 


48 


1616 


gi7672979 


TT 

Homo sapiens 


glucosidase II beta subunit 




04 


1616 


ABB50266 


Homo sapiens 


Protein kinase C substrate 80K-H 
ovarian tumour marker protein, #20. 


2536 


84 


1616 


gil293640 


Homo sapiens 


protein kinase C substrate 80K-H 


2536 


84 


1617 


AAY07076 


Homo sapiens 


Renal cancer associated antigen 
precursor sequence. 


355 


100 


1617 


gil022888 


Oryctolagus 
cuniculus 


protein phosphatase 2A0 B' subunit 
beta4 isoform 


355 


100 


1617 


gil 022886 


Oryctolagus 
cuniculus 


protein phosphatase 2A0 B 
regulatory subunit beta3 isoform 


355 


1 r\t\ 

100 


1618 


gil 40 10930 


Haemophilus 
influenzae Rd; 
similar to 
AAC22226.1 
(PID:gl 573555) 
[Homo sapiens] 


similar to transcription accessory 
protein (tex) 


1868 


100 


1618 


AAB92758 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11220. 


1863 


99 


1618 


ABB89105 


Homo sapiens 


Human polypeptide SEQ ID NO 


1231 


95 
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SEQ 
ID 


Hit ID 


hpecies 


uescripiion 


« 

score 


Identity 








L*+o 1 , 






1619 


AAW64262 


Homo sapiens 


Human neutrophil elastase. 


628 


100 


1619 


AAP80335 


Homo sapiens 


Sequence of serine protease (SP) of 
human myeloid cellorigin and leader 
peptide. 


628 


100 


1619 


gi296665 


Homo sapiens 


serine protease 




1 00 


1620 


gi9965989 


Homo sapiens 


calcineurin A catalytic subunit 
gamma isoform 


1648 


91 


1620 


gi258001 


Homo sapiens 


calcineurin A catalytic subunit; 
calmoaulin-aepenaent protein 
phosphatase catalytic subunit; CaM- 
PrP catalytic subunit 


1637 


96 


1620 


gi200466 


Mus musculus 


pnospnoprotein pnospnatase 


14.71 




1621 


gil778538 


Escherichia coli 


anaerobic carrier for c4 5 
oi carboxylases 


739 


100 


1621 


gil786839 


Escherichia coli 
Kl2 


transport of dicarboxylates 


739 


100 


1621 


gil25 13520 


Escherichia coli 

0157:H7 

EDL933 


transport of dicarboxylates 


739 


100 


1622 


AAE15635 


Homo sapiens 


Human Lr-protem couple a receptor-2) 
(GCREC-5) protein. 


071 

Z 1 1 


OA 


1622 


AAB66272 


Homo sapiens 


tiuman i ajnol? j> /o oiiv ^ jnvj. 
29. 


971 




1622 


AAB82487 


Homo sapiens 


Human secretin-like receptor Zgprl 
splice variant. 


271 


94 


1623 


gil787320 


Escherichia coli 
K12 


homolog of Salmonella P-ring of 
ilagelia basal Doay 


749 


87 


1623 


gi4062658 


Escherichia coli 


rlagellar basal t>oay r-nng protein 
precursor 


7AQ 


87 


1623 


gil25 14621 


Escherichia coli 

Oi57:H/ 

EDL933 


homolog of Salmonella P-ring of 
rlagella basal ooay 


748 


86 


1624 


gil3360031 


jiscnencma coli 
0157:H7 


aiianxomase 






1624 


gi2735238 


Escherichia coli 


vjIXjd^ 




70 


1624 


gi 1773 192 


Escherichia coli 


similar to S. cerevisiae dall 


491 


70 


1625 


gi606368 


Escherichia coli 


aspartate semiaioenyae 
dehydrogenase 


1 A7S 


yj 


1625 


gi3859587 


Shigella sonnei 


aspartate semialdehyde 
dehydrogenase 


1478 


93 


1625 


gil789841 


Escherichia coli 
K12 


aspartate-semialdehyde 
dehydrogenase 


1478 


93 


1626 


gil788006 


Escherichia con 
K12 


phenylalanine tRNA synthetase, 
beta-subunit 


/ J3 




1626 


gil25 15724 


Escherichia coli 
0157:H/ 


phenylalanine tRNA synthetase, 
oeta-suDunit 






1626 


gil742806 


Escherichia coli 


Phenylalanine— tRNA ligase (EC 
6. LI. 20) b chain 


735 


94 


1627 


AAB49502 


Homo sapiens 


Clone HYASC03. 


310 


98 


1627 


gi22 137373 


Mus musculus 


similar to RIKEN cDNA 
2810051A14 gene 


240 


95 


1627 


gi20071228 


Mus musculus 


RIKEN cDNA 2810051A14 gene 


151 


51 


1628 


gil7431382 


Ralstonia 


PROBABLE TRANSCRIPTION 


357 


37 
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S 

score 


% 
Identity 






solanacearum 


REGULATOR PROTEIN 






1628 


gi2 11 14950 


Xanthomonas 
campestris pv. 
campestris str. 
ATCC 33913 


transcriptional regulator 


331 


38 


1629 


gi311422 


Escherichia coli 


ORF-2 


282 


100 


1630 


gi732874 


Neisseria 
meningitidis 


IgAl protease 


101 


33 


1630 


gi600255 


Gallus gallus 


caldesmon 


99 


23 


1631 


gil788756 


Escherichia coli 
K12 


PEP-protein phosphotransferase 
system enzyme I 


785 


96 


1631 


gi 125 16792 


Escherichia coli 

0157:H7 

EDL933 


PEP-protein phosphotransferase 
system enzyme I 


785 


96 


1631 


gil799835 


Escherichia coli 


PHOSPHOENOLPYRUVATE- 
PROTEIN 

PHOSPHOTRANSFERASE (EC 
2.7.3.9) 

(PHOSPHOTRANSFERASE 
SYSTEM, ENZYME I). 


785 


96 


1632 


gi8 87820 


Escherichia coli 


UUG start; possible frameshift at 
end? 


810 


72 


1632 


gi466651 


Escherichia coli 


No definition line found 


343 


82 


1632 


gi2 1957797 


Yersinia pestis 
KTM 


acridine efflux pump 


250 


56 


1633 


AAB92950 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11 629. 


290 


100 


1633 


gi2 1627 153 


Drosophila 
melanogaster 


CG8155-PA 


82 


39 


1633 


gi3850257 


Colti virus JKT- 
7043 


Vp9 


78 


40 


1634 


gil2580933 


Homo sapiens 


dJ505P2.1.1 (ribonuciease 6 
precursor) 


442 


62 


1634 


AAY21852 


Homo sapiens 


Human signal peptide-contianing 
protein (SIGP) (clone ID 2652271). 


437 


61 


1634 


AAW75103 


Homo sapiens 


Human secreted protein encoded by 
gene 47 clone HMCBP63. 


437 


61 


1635 


gil 8 146756 


Homo sapiens 


limkain beta 2 


1620 


100 


1635 


AAY86509 


Homo sapiens 


Human gene 70-encoded protein 
fragment, SEQ ID NO:424. 


696 


100 


1635 


AAY86510 


Homo sapiens 


Human gene 70-encoded protein 
fragment, SEQ ID NO:425. 


436 


100 


1636 


gi8096340 


Homo sapiens 


RERE 


513 


71 


1636 


gi4689163 


Caenorhabditis 
elegans 


EGL-27 


83 


23 


1636 


gi21913127 


Caenorhabditis 
elegans 


C. elegans EGL-27 protein 
(corresponding sequence C04A2.3b) 


83 


23 


1637 


AAU69417 


Homo sapiens 


Lung small cell carcinoma antigen 
#11. 


514 


94 


1637 


AAB85481 


Homo sapiens 


Human 23553 sulfatase polypeptide. 


514 


94 


1637 


AAM25714 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 1229. 


514 


94 


1638 


AAM43540 


Homo sapiens 


Human polypeptide SEQ ID NO 218. 


741 


100 


1638 


AAM38682 


Homo sapiens 


Human polypeptide SEQ ID NO 
1827. 


741 


100 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


fx t 

% 

Identity 


1638 


AAY42750 


Homo sapiens 


Human calcium binding protein 1 
(CaBP-1), 


741 


100 


1639 


AAB94175 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 14484. 


908 


86 


1639 


AAM40361 


Homo sapiens 


Human polypeptide SEQ ID NO 
3506. 


908 


86 


1639 


gil52150S5 


Mus musculus 


Similar to COP9 (constitutive 
photomorphogenic), subunit 7b 
(Arabidopsis) 


900 


85 


1640 


gil529 


Oryctolagus 
cuniculus 


calcium channel BI-2 


95 


30 


1640 


gi2653364 


Bovine 

herpesvirus type 
LI 


immediate-early transactivator 
protein (cell nucleus) 


91 


25 


1640 


gi291536 


Bovine 
herpesvirus 1 


BICP4 


91 


25 


1641 


AAB26105 


Homo sapiens 


Human DAN/Cerberus-related 
protein 6 (hDCR6) #1. 


2012 


100 


1641 


AAE 17089 


Homo sapiens 


Human osteolevin protein. 


758 


100 


1641 


ABB07209 


Homo sapiens 


Human cloaked-2 polypeptide 
sequence. 


758 


100 


1642 


AAU29026 


Homo sapiens 


Human PRO polypeptide sequence 
#3. 


279 


94 


1642 


AAY99458 


Homo sapiens 


Human PRO 193 amino acid 
sequence SEQ ID NO:410, 


279 


94 


1642 


AAY36125 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 510. 


279 


94 


1643 


ABG60166 


Homo sapiens 


Human DITHP polypeptide #224. 


426 


76 


1643 


AAG76121 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:6885. 


426 


16 


1643 


AAM38668 


Homo sapiens 


Human polypeptide SEQ ID NO 
1813, 


426 


76 


1644 


AAU84339 


Homo sapiens 


Protein HPYl6ElBmd differentially 
expressed m breast cancer tissue. 


514 


82 


1644 


gi2232019 


Homo sapiens 


HP VI 6 El protein binding protein 


514 


82 


1644 


gil2653271 


Homo sapiens 


thyroid hormone receptor mteractor 
13 


514 


82 


1645 


AAU74752 


Homo sapiens 


Human protease PRTS-12 protein 
sequence. 


3063 


99 


1645 


ABG34074 


Homo sapiens 


Human Pro peptide #45. 


2954 


98 


1645 


AAB50936 


Homo sapiens 


ADAM protein #2. 


2814 


100 


1646 


AAO20516 


Homo sapiens 


Protein of APP related human 
homologue hCP50592. 


Ms A 

273 


98 


1646 


gi 1232077 


Homo sapiens 


huMCM2 


273 


98 


1646 


gi 13544066 


Homo sapiens 


Similar to mini chromosome 
maintenance deficient 2 (S. 
cerevisiae) 


273 


98 


1647 


AAG75416 


TT * 

Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:6180. 


ODD 


HA ! 

74 


1647 


AAB57016 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1594. 


655 


74 


1647 


gil4328059 


Homo sapiens 


argininosuccinate synthetase 


655 


74 


1648 


gi7960207 


Oncorhynchus 
mykiss 


vitelline envelope protein alpha 


252 


43 


1648 


gi!5384295 


Mycoplasma 


variable surface lipoprotein Vsp422- 


177 


34 
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SEQ 
ID 


Hit ID 


Species 


Description 


score 


0/ 

/o 

Identity 






bo vis 


8 






1648 


gi2281750 


Salmo salar 


eggshell protein 


ID 4 * 


JO 


1649 


AAM79049 


Homo sapiens 


Human protein oiiQ ID N<J 1/11. 




/ 0 


1 f e? r\ 

1650 


gi7018384 


Homo sapiens 


dJ lyiiN 13.1 (rnannosidase, aipna, 
class 1A, member 1) 


ivzo 


OS 


1650 


gil 5 929672 


Mus musculus 


mannosidase 1, alpha 


yyjy 




1650 


gi474280 


Mus musculus 


mannosyl-oligosaccharide alpha- 1 ,2- 
mannosidase 


909 


88 


1651 


AAU 10551 


Homo sapiens 


Human A259 polypeptide. 


835 


71 


1651 


ABP47883 


XT * 

Homo sapiens 


Human polypeptide SEQ ID NO 313. 




/ 1 


1651 


AAB50085 


Homo sapiens 


T X A 'T Cf\ 

Human A259. 


833 


/I 


1652 


gi45 12295 


Homo sapiens 


< 111*1 1 ■ „ • _ t_i _ 

immunogloblin heavy chain variable 

region 


619 


1 riA 
1UU 


1652 


AAR66320 


Homo sapiens 


Human immunoglobulin variable 
heavy chain #26. 


613 


99 


1652 


gi296657 


Homo sapiens 


Ig heavy chain gene variable region 
V(12G-1) 


613 


99 


1653 


AAE24079 


Homo sapiens 


Human MDPK protein. 


185 


85 


1653 


AAE24150 


Homo sapiens 


Human kinase (PKIN)-2l protein. 


181 


97 


1653 


AAU03501 


Homo sapiens 


Human protein kinase #l. 


181 


97 


1654 


AAU75784 


Homo sapiens 


Human protein phosphatase 2 (PP2) 

• 

protein sequence. 


667 


92 


1654 


AAE04841 


Homo sapiens 


Human SGP039 phosphatase 
polypeptide. 


667 


92 


1654 


gi8954030 


117 


Contains similarity to protein 
phosphatase 2C from Arabidopsis 
thaliana gb|AF085279. It contains a 
protein phosphatase 2C domain 
PFJ00481 


42 




1655 


AAU96179 


Homo sapiens 


Human secreted protein, SEQ ID No 
81. 


86 


27 


1655 


gi 17942 19 


XT * 

Homo sapiens 


150 kDa oxygen-regulated protein 
ORP150 


oJ. 




1655 


AAB01381 


XT " 

Homo sapiens 


Neuron-associated protein. 


lb 


41 


1656 


ABB84869 


Homo sapiens 


Human PRO 1079 protein sequence 
SEQ ID NO: 106. 


404 


78 


1656 


ABB95475 


Homo sapiens 


tt : _ ■ 1 _ 1 j „ 1 • „ 

Human angiogenesis related protein 
PRO1079 SEQ ID NO: 106. 


404 




1656 


AAB75372 


Homo sapiens 


TT A 1 J * _LL1 1 

Human secreted protem #3 1 . 


404 


78 


1657 


AAM41200 


Homo sapiens 


Human polypeptide SEQ ID NO 
6131. 


332 


91 


1657 


AAM39414 


Homo sapiens 


Human polypeptide SEQ ID NO 
2559. 


332 


97 


1657 


gi31867 


Homo sapiens 


\t aII * 1 1 x 

N-acetylglucosamme-6-sulphatase 


332 


97 


1658 


AAW25154 


XT * 

Homo sapiens 


Human disulphide epimerase like 
enzyme, EP52. 


22ol 


1 r\r\ 

100 


1658 


gil 136743 


Homo sapiens 


human P5 


2281 


100 


1 c 0 

1658 


gi 12654931 


Homo sapiens 


protein cisuitiae isomerase-reiatea 
protein 






1659 


AAM41859 


Homo sapiens 


Human polypeptide SEQ ID NO 
6790. 


357 


71 


1659 


gi6694278 


Homo sapiens 


cell recognition molecule Caspr2 


357 


71 


1659 


gi!3624214 


Homo sapiens 


contacthvassociated protein 2 


357 


71 


1660 


AAO 11734 


Homo sapiens 


Human polypeptide SEQ ID NO 
25626. 


141 


100 
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SEQ 
ID 


Hit ID 


Species 


Description 


IS 

score 


o/ 
/o 

laeniiiy 


1661 


AAG74841 


TT ~ ■ „„ „ 

Homo sapiens 


Human colon cancer anugen protein 

oxiV^ 1U iNW .DOKJD . 




R7 


1661 


gi4588087 


Homo sapiens 


PTH-responsive osteosarcoma Bl 
protein 


568 


87 


1661 


gi20987880 


Mus museums 


Similar to PTH-responsive 
osteosaicoma rs 1 proxem 


536 


86 


1662 


gi4754907 


Homo sapiens 


histone deacetylase 4 


2891 


77 


1662 


gil4495171 


Gallus gallus 


histone deacetylase-4 


ZZ>Zj 


Oo 


1662 


AAB49957 


Homo sapiens 


Human histone deacetylase HDAC-4. 


2508 


79 


1663 


ABB94075 


Homo sapiens 


Human secreted protein SbQ ID NU. 

1 i o 

lib. 


1 Q1 




1663 


ABB94056 


Homo sapiens 


Human secreted protein SEQ ID NO: 
99. 


183 


100 


1663 


ABB94030 


Homo sapiens 


Human secreted protein id jn<j. 
73. 






1664 


ABB37187 


Homo sapiens 


Peptide #4693 encodea by human 
foetal liver single exon probe. 


1 Q£ 


1 OA 


1664 


gil 3310486 


Homo sapiens 


C2H2 zinc finger protein 


y i 


j l 


1664 


gil5159543 


Agrobactenum 
tumefaciens str. 
C58 (Cereon) 


AGR_L_2143p 


Q S 
OO 


ZD 


1665 


AAB32388 


Homo sapiens 


Human secretea protein sequence 
encoaea oy gene io ocy iu rs\j. 




1 nn 


1665 


AAY91419 


Homo sapiens 


Human secreted protein sequence 
eiicoueu. Oy gene o lxj nv. i*ri/. 


82 


36 


i f f E 

1665 


giz 1708 132 


Homo sapiens 


oimnar to L^i^i^fyH- / j 




31 


1666 


gi219615 


Homo sapiens 


preproelafin 


621 


100 


1666 


* ax /x fx A "1 

gi299841 


Homo sapiens 


pre-elaiin 


^9 1 


L\J\J 


1666 


gil 90338 


Homo sapiens 


elaiin precursor 


1 


100 


1667 


gi 18072031 


Homo sapiens 


zinc finger protein 328 


397 


78 


1667 


AAU27687 


Homo sapiens 


Human lull-length polypeptiae 
sequence #12. 


Z / U 


oo 


1667 


AAM79885 


Homo sapiens 


Human protein odi^ id jnu ^ j j i . 




JO 


1669 


AAG03136 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 721 /. 


179 


75 


1669 


AAU85309 


Homo sapiens 


G-coupled olfactory receptor #170. 


176 


68 


1669 


AAU24689 


Homo sapiens 


Human olfactory receptor 

A /XT T7D 1 OO 

AOLF Kloo. 


1 /o 


oo 


1670 


AAY21852 


Homo sapiens 


Human signal peptide-contianing 
protein (oiLrx'J (clone ID Zoozz / 1). 


375 


83 


1670 


AAY48563 


TT 

Homo sapiens 


Human breast tumour-associatea 
protein 24. 






1670 


AAW75103 


Homo sapiens 


Human secreted protein encoded by 
gene 4/ clone HMLDroj. 


375 


83 


1671 


gi6572310 


T T * 

Homo sapiens 


CB13C9.1 (protein 239 Ab) 






1671 


AAU69508 


Homo sapiens 


Human purified secretory 
| polypeptide #77. 


348 


60 


10 / 1 


gl /O /H- 1 1Z 


JTHJII1U odpiCIlo 


J y r\^i-J 


347 


80 


1672 


AAG63163 


Homo sapiens 


Amino acid sequence of 
carcinoembryonic antigen-like 
polypeptide. 


2187 


99 


1672 


AAR54714 


Homo sapiens 


Carcinoembryonic antigen CEA-(c). 


456 


30 


1672 


AAB43688 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1 1 33 . 


455 


31 


1673 


AAU74354 


Homo sapiens 


Human cytoskeleton-associated 


1163 


34 
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ID 
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S 

»core 


% 








protein (CYSKP) #25. 






1673 


AAB93267 


Homo sapiens 


Human protein sequence o.bQ J.J-J 
NO: 12300. 






1675 


AAM06070 


Homo sapiens 


reptiae #4 /dz encoded by prooe tor 
measuring breast gene expression. 




i no 


1676 


ABP43066 


Homo sapiens 


Human ovarian antigen HVCAE01, 
SEQ ID NO:4198. 


zyz 


JO 


1676 


ABP42762 


XT * „ 

Homo sapiens 


Human ovarian antigen HOVJU75, 
SEQ ID NO:3894. 




Do 


1676 


AAG01285 


Homo sapiens 


Human secreted protein, diiQ ID 
INU: J30O. 




Do 


1677 


gi458226 


Homo sapiens 


a gene isolated from a CpG island 
oetween Mb ana 1vA.l 


1249 


97 


1677 


AAG00737 


Homo sapiens 


Human secreted protein, SEQ ID 

INU: 4olo. 


531 


94 


1677 


gi966315l 


Homo sapiens 


transport-secretion protein 2.1 (TTS- 
2.1) 


421 


35 


1678 


gil806l02 


Homo sapiens 


T cell receptor beta chain 


444 


100 


1678 


AAB68370 


Homo sapiens 


Human beta-chain variable region of 
1 ceiL receptors (V beta I /j. 


440 


98 


1678 


AAW76993 


Homo sapiens 


Human T cell receptor beta chain 
vanaole region protein v-oeta l /. 


440 


98 


1679 


AAB 92624 


Homo sapiens 


Human protein sequence bc,^) ID 
NO:10919. 




1 nn 
1UU 


1679 


ABG39897 


TT. * _ 

Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
zyDoz. 




oy 


1679 


AAM05720 


Homo sapiens 


Peptide #4402 encoded by probe for 
measuring breast gene expression. 


155 


69 


1680 


gil0186503 


Homo sapiens 


sialic acid-specific acetylesterase II 


907 


68 


1680 


gi 10242345 


T T — - * 

Homo sapiens 


sialic acid-specific 9-0- 
acetylesterase I 


yu / 


oo 


1680 


gi59 17657 


Mus musculus 


cytosolic sialic acid y-U- 
acetylesterase 


idZ 


DD 


1681 


gi46540 


Staphylococcus 
aureus 


coagulase precursor 




ZD 


1681 


gi20338653 


Staphylococcus 
aureus 


coagulase 




ZD 


1681 


gi2 1328339 


Caenorhabditis 
elegans 


C. elegans RBF-l protein 
(corresponding sequence F37A4.7d) 


OA 


o 1 
J 1 


1682 


AAO07844 


Homo sapiens 


Human polypeptide SxiQ ID JNU 
LY /30. 


1 Ar\ 


/u 


1682 


AAO0 /413 


Homo sapiens 


Human poiypeptiae Miv^ iu inu 


Q1 




LOoZ 


gil /ooiyDZ 


urosopnna 
melanogaster 


LUU Vy"t /p 


o / 




1 /TO 1 

1683 


A AW 99574 


Homo sapiens 


riuman early placental msuim-iiKe 
(EPIL^ oolvoetitide 




1 DA 


1683 


AAY26926 


Homo sapiens 


Human insulin-like 4 protein. 


349 


100 


1683 


AAW69168 


Homo sapiens 


Zinsl protein. 


349 


100 


1684 


gi23365 


Homo sapiens 


17-HSD protein (AA 1 - 328) 


1082 


70 


1684 


gil81951 


Homo sapiens 


estradiol 17 beta-dehydrogenase 


1082 


70 


1684 


gil77127 


Homo sapiens 


1 7-beta-hydroxysteroid 
dehydrogenase 


1082 


70 


1685 


ABB84963 


Homo sapiens 


Human PR04356 protein sequence 


1336 


99 
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score 
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SEQ ID NO:294. 






1685 


ABB95569 


Homo sapiens 


T T ♦ * "T 1 1 * 

Human angiogenesis related protein 
PR04356 SEQ ID NO: 294. 


1336 


99 


1685 


AAB31206 


Homo sapiens 


Amino acid sequence of human 
polypeptide PR04356. 


1336 


99 


1686 


ABB80870 


Homo sapiens 


Type II TGFbeta receptor (RII) 
sequence. 


2130 


88 


1686 


AAB82996 


Homo sapiens 


Human transforming growth factor- 
beta receptor RII. 


2130 


88 


1686 


AAY59453 


Homo sapiens 


Human Transforming growth factor- 
beta II protein sequence. 


2130 


88 


1687 


ABB79162 


Homo sapiens 


Human VPAC1 receptor protein SEQ 
ID NO:2. 


386 


76 


1687 


ABB79161 


Homo sapiens 


Human VPAC1 receptor protein SEQ 
ID NO:l. 


386 


76 


1687 


AAB71878 


Homo sapiens 


Human VIPR seven transmembrane 
domain. 


386 


76 


1688 


AAB 84663 


Homo sapiens 


Amino acid sequence of human 
tyrosine kinase protein Yes. 


520 


90 


1688 


AAY24421 


Homo sapiens 


Human yesl protein. 


520 


90 


1688 


gil 81268 


Homo sapiens 


cellular yes-1 protein 


520 


90 


1690 


AA014938 


Homo sapiens 


Human UFD2-associated protein 1 . 


1386 


76 


1690 


AAM00761 


Homo sapiens 


Human bone marrow protein, SEQ 
ID NO: 124. 


1386 


76 


1690 


gil 35 16467 


Homo sapiens 


homzygously deleted in 
neuroblastoma-l/UFD2 


1386 


76 


1691 


AAB47977 


Homo sapiens 


BCY5. 


334 


98 


1691 


gi2463632 


Homo sapiens 


monocarboxylate transporter 
homologue MCT6 


326 


96 


1691 


gi9246437 


Staphylococcus 
aureus 


fmtA-like protein 


81 


24 


1692 


AAB95143 


Homo sapiens 


Human protein sequence SEQ ID 
NO:17163. 


394 


98 


1692 


AAB93175 


Homo sapiens 


Human protein sequence SEQ ID 
NO:12114. 


394 


98 


1692 


AAB58175 


Homo sapiens 


Lung cancer associated polypeptide 
sequence SEQ ID 5 13. 


394 


98 


1693 


AAG65906 


Homo sapiens 


Ammo acid sequence of GSK gene 
Id 37967. 


3025 


92 


1693 


AAY67598 


Homo sapiens 


Human adipose tissue protein #l . 


501 


40 


1693 


gi3786312 


Homo sapiens 


extracellular matrix protein 


501 


40 


1694 


ABB83488 


Homo sapiens 


Human cytoskeleton-associated 
protein, CSAP-17. 


216 


93 


1694 


gi3002588 


Mus musculus 


Plenty of SH3s; POSH 


216 


93 


1694 


gil 8676610 


Homo sapiens 


FLJ00204 protein 


206 


86 


1695 


gil2018147 


Chlamydomonas 
reinhardtii 


vegetative cell wall protein gp l 


132 


30 


1695 


gil 79453 82 


Drosophila 
melanogaster 


RE17165p 


130 


28 


1695 


gil209103 


Rattus 
norvegicus 


atrophin-1 related protein 


130 


25 


1696 


AAB43791 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1236. 


353 


98 


1696 


AAM79791 


Homo sapiens 


Human protein SEQ ID NO 3437. 


152 


77 


1696 


AAM78S07 


Homo sapiens 


Human protein SEQ ID NO 1469. 


152 


77 
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score 
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Identity 


1697 


AAE22224 


Homo sapiens 


Human 5685C6 protem. 


542 


98 


1697 


ABG4055 1 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
30216. 


516 


98 


1697 


AAM 18652 


Homo sapiens 


Peptide #5086 encoded by probe for 
measuring cervical gene expression. 


516 


98 


1698 


AAM39274 


Homo sapiens 


TT 1 . » 1 riy \ f\ TT*"v "JwT/~v 

Human polypeptide SEQ ID NO 
2419. 


1596 


79 


1698 


AAB29653 


Homo sapiens 


T T 1 ) 1 _ i • 

Human membrane-associated protein 
HUMAP-10. 


1596 


'ICS 

79 


1698 


gi9858855 


Homo sapiens 


HPT protein 


1596 


79 


1699 


gi684113S 


Homo sapiens 


HSPC099 


275 


100 


1699 


AAB93037 


Homo sapiens 


Human protein sequence SEQ ID 
NO:11816. 


130 


23 


1699 


AAB61308 


T TT * 

Homo sapiens 


Human transcriptional regulator 
protein #8. 


125 


22 


1700 


AAM93959 


T T * 

Homo sapiens 


TT 1 j * 1 rip/-\ TTN "X T/~\ . 

Human polypeptide, SEQ ID NO: 
4164. 


1 1 o 

Ilo2 


99 


1700 


AAB36587 


Homo sapiens 


Human FLEXHT-9 protein sequence 
SEQ ID NO:9. 


590 


53 


1700 


AAB93652 


Homo sapiens 


Human protein sequence SEQ ID 
NO:1316L 


JOO 


Z>4 


1701 


gi30264 


Homo sapiens 


cystatin D 


392 


100 


1701 


gil61 16526 


Homo sapiens 


yX60D10.1 (cystatin D) 


387 


100 


1701 


AA015149 


Homo sapiens 


Human cystatm D protein sequence. 


380 


98 


1702 


gil374754 


Staphyl othermus 

• 

marmus 


tetrabrachion 


72 


27 


1703 


AAE17127 


Homo sapiens 


Human GnT- V protein. 


628 


77 


1703 


AAB48911 


Homo sapiens 


Human beta- 1,6 -N- 

acetylglucosamine transferase (GnT- i 
V). 


628 


77 


1703 


AAB83010 


Homo sapiens 


Human GnT-V. 


628 


77 


1704 


AAY68736 


Homo sapiens 


Short chain alcohol dehydrogenase- 
related molecule ScRM-2 cDNA. 


1249 


81 


1704 


AAY69295 


Homo sapiens 


A human human protem, designated 
HSPC021 (CBFAIG06). 


1249 


81 


1704 


AAB58463 


Homo sapiens 


Lung cancer associated polypeptide 
sequence SEQ ID 801. 


1249 


81 


1705 


gil5294065 


Ictalurus 
punctatus 


40S ribosomal protein S26-2 


212 


58 


1705 


ABP42978 


If T * 

Homo sapiens 


Human ovarian antigen HPDWD69, 
SEQ ID NO:41 10. 


208 


58 


1705 


» A >— mm -% mwm 

AAG76127 


Homo sapiens 


TT 1 j B i 

Human colon cancer antigen protein 
SEQ ID NO:6891. 


1 AO 


DO 


1706 


gi65 80428 


T T" " 

Homo sapiens 


IkappaBR 


2759 


95 


1706 


gi 14250636 


Homo sapiens 


nuclear factor of kappa light 

1 ft * 1 ^1 * 1 "1 

polypeptide gene enhancer in B-cells 
mniDitor-liKe z 


2606 


86 


1706 


gi746415 


Homo sapiens 


I kappa BR 


1655 


71 


1707 


AAM79219 


Homo sapiens 


Human protein SEQ ID NO 1 88 1 . 


325 


64 


1708 


gi5901529 


Homo sapiens 


C2H2 type Kruppel-like zinc finger 
protein splice variant b 


565 


99 


1708 


gi5901527 


Homo sapiens 


C2H2 type Kruppel-like zinc finger 
protein 


317 


33 


1708 


gil5029916 


Mus musculus 


Similar to Zinc finger protein 118 


296 


31 



4 
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1709 


AAU85415 


Homo sapiens 


Human protein NOV13. 


953 


98 


1709 


AAU72895 


Homo sapiens 


Human metal loprotease partial 
protein sequence #7. 


925 


100 


1709 


AAU74750 


Homo sapiens 


Human protease PRTS-10 protein 
sequence, 


925 


100 


1710 


gil2862392 


Mus musculus 


D86 


2863 


64 


1710 


ABG37531 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
27196. 


1687 


100 


1710 


AAM03553 


Homo sapiens 


Peptide #2235 encoded by probe for 
measuring breast gene expression. 


1687 


100 


1711 


gil787337 


acyl-carrier- 
protein synthase 
II [Escherichia 
coli K12] 


3-oxoacyl- 


315 


91 


1711 


gil25 14639 


acyl-carrier- 
protein synthase 
II [Escherichia 
coli0157:H7 
EDL933] 


3-oxoacyl- 


315 


91 


1711 


gi664870 


Escherichia coli 


beta-ketoacyl-acyl carrier protein 
synthase II 


315 


91 


1712 


AAG75407 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:6171. 


1258 


82 


1712 


AAR89952 


Homo sapiens 


Insulin-like growth factor binding 
protein-3 . 


1232 


80 


1712 


AAU85512 


Homo sapiens 


Clone #19095 (L549S) of lung 
tumour protein. 


1228 


80 


1713 


AAB94696 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15673. 


710 


97 


1713 


AAB99892 


Homo sapiens 


Human RNA helicase gene helicain 
C protein sequence SEQ ID NO:6. 


710 


97 


1713 


AAB99891 


Homo sapiens 


Human RNA helicase gene helicain 
B protein sequence SEQ ID NO:4. 


710 


97 


1714 


gi 153 84740 


Homo sapiens 


paralemmin-2 


1652 


100 


1714 


gi 153 84742 


Homo sapiens 


Palm2-AKAP2 fusion protein 


1577 


100 


1714 


gil4041780 


Homo sapiens 


AKAP-2 protein 


410 


73 


1715 


gil3021825 


Homo sapiens 


polymerase 


575 


48 


1715 


gi3 600067 


Homo sapiens 


polymerase 


572 


48 


1715 


gil780973 


Human 
endogenous 
retrovirus K 


pol protein 


572 


48 


1717 


gi35825 


Homo sapiens 


pregnancy zone protein 


6592 


88 


1717 


gi579594 


Homo sapiens 


alpha 2-macroglobulin 690-740 


4985 


66 


1717 


AAR11749 


Homo sapiens 


Human alpha-2 macroglobulin bait 
region mutant. 


4976 


66 


1718 


gil3366277 


Homo sapiens 


dJ998H6.1 (orthologofratPB- 
Cadhenn) 


1167 


93 


1718 


gi4760578 


Mus musculus 


PB-Cadherin 


1028 


84 


1718 


gil398912 


Rattus 
norvegicus 


short type PB-cadherin 


1022 


84 


1720 


AAE06588 


Homo sapiens 


Human protein having hydrophobic 
domain, HP10778. 


687 


100 


1720 


AAM40979 


Homo sapiens 


Human polypeptide SEQ ID NO 
5910. 


687 


100 



i 
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1720 


gil 5072402 


Raja ennacea 


organic solute transporter alpha 


3D 1 




1721 


AAG81345 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:20o. 


839 


62 


1721 


gil 6359082 


Homo sapiens 


Similar to RIKEN cDNA 
2810049006 gene 


839 


62 


1721 


AAB93797 


Homo sapiens 


Human protein sequence SEQ ID 
NO:13560. 


836 


62 


1722 


gi871883 


Homo sapiens 


lanosterol 14-demetnylase 






1722 


gil 809225 


Homo sapiens 


lanosterol 14-demethylase 
(cytochrome p450) 


2180 


99 


1722 


gil 698396 


Homo sapiens 


lanosterol 14-demethylase 
cytochrome P450 


2180 


99 


1723 


ABG41541 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
31206. 


232 


100 


1723 


AAM32019 


Homo sapiens 


Peptide #6056 encoded by probe for 
measuring placental gene expression. 


232 


100 


1723 


AAM71727 


Homo sapiens 


Human bone marrow expressed 
probe encoded protein SEQ ID NO: 
32033. 


*\ <*i 
232 


100 


1724 


AAY53040 


Homo sapiens 


Human secreted protein clone 
kj320_l protein sequence SEQ ID 
NO:86. 


2480 


100 


1724 


gi3510639 


Rattus 
norvegicus 


UDP-GalNAc:poIypeptide N- 
acetylgalactosaminyltransferase T5 


Id4j 


■CO 


1724 


gi6688167 


Homo sapiens 


GalNAc-T 5 


IVoZ 


1 AA 


1725 


ABP41917 


T T * 

Homo sapiens 


Human ovarian antigen JtiobDr / 1, 
SEQ ID JNG:3Q4y. 


1 ion 


y 1 


1725 


AAG75406 


T T „ 

Homo sapiens 


Human colon cancer antigen protein 
&r,\l ID iNU.Ol /U. 




on 


1725 


gi396176 


Homo sapiens 


antigenic surface determinant OA3 


1329 


97 


1726 


AAU79036 


Homo sapiens 


Human orlJro- 1 ^not aeimeaj 
receptor. 


OJO 




1726 


A A WT A A A O 1 

AAW40451 


Homo sapiens 


riuman oriz omaing protein. 






1726 


AAW49909 


Homo sapiens 


Signal regulatory protein 4 (SIRP4). 


858 


98 


1727 


gi2707601 


TT • 

Homo sapiens 


synaptophysin 


lOJO 




1727 


gil 5928723 


Mus musculus 


synaptophysin 


1585 


94 


1727 


gi57326 


Rattus 
norvegicus 


synaptophysin (AA 1-30/J 


1 coo 




1728 


gil5590682 


Homo sapiens 


histone deacetylase 9a 


628 


96 


1728 


gil5590680 


Homo sapiens 


histone deacetylase 9 


628 


yo 


1728 


gil2060992 


Mus musculus 


MEF2-interacting transcription 
repressor MITR 


625 


95 


1729 


gi2 1430596 


Drosophila 
melanogaster 


RE16431p 


891 


39 


1729 


gil41 64377 


Mus musculus 


Type II membrane protein of 
ER~mouse gene similar to alpha- 
mannosidase 


350 


33 


1729 


gil 504008 


Homo sapiens 


Containing ATP/GTP-binding site 
motif A(P-loop): Similar to 
C.elegans 

protein(Pl:CEC47E128);Similar to 
Mouse alpha- 
mannosidase(Pl :B54407) 


346 


33 


1730 


gi7861753 


Mus musculus 


GABA-A receptor epsilon-like 


412 


32 
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score 
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J a entity 








subunit 






1730 


gil 1610622 


Rattus 
norvegicus 


GAB A- A epsilon subunit splice 
variant 


A 1 1 




1730 


gi7861743 


Rattus 
norvegicus 


GABA-A receptor epsilon-like 
subunit 


411 


32 


1731 


AAW80135 


Homo sapiens 


Human recombinant neurokmm-2 
(NK-2) receptor protein. 


1 HA 1 

1745 


9j 


1731 


gil 89222 


Homo sapiens 


neurokinin-2 receptor 


1743 


95 


1731 


gil89135 


Homo sapiens 


neurokinin A receptor 


1742 


95 


1732 


AAB75594 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 37 SEQ ID 
NO:148. 


678 


99 


1732 


AAB 80437 


Homo sapiens 


Gene #20 associated peptide #1. 


381 


95 


1732 


AAM78175 


Homo sapiens 


Human bone marrow expressed 
probe encoded protein SEQ ID NO: 
38481. 


365 


100 


1733 


ABB84853 


Homo sapiens 


Human PRO 1 120 protein sequence 
SEQ ID NO:74. 


293 


1 Ar\ 
100 


1733 


ABB90378 


Homo sapiens 


Human polypeptide SEQ ID NO 
2754. 


293 


1 AA 

100 


1733 


ABB95459 


Homo sapiens 


Human angiogenesis related protein 
PROl 120 SEQ ID NO: 74. 


293 


100 


1734 


ABB07527 


Homo sapiens 


Human drug metabolizing enzyme 
(DME) (ID: 5643401CD1). 


1652 


79 


1734 


ABB07515 


Homo sapiens 


Human drug metabolizing enzyme 
(DME) (ID: 8097779CD1). 


867 


73 


1734 


gil3161409 


Mus musculus 


family 4 cytochrome P450 


718 


58 


1735 


AAM40183 


Homo sapiens 


Human polypeptide SEQ ID NO 
3328. 


1322 


67 


1735 


AAY71159 


Homo sapiens 


Human phosphodiesterase interacting 
protein, myomegalm. 


1322 


67 


1735 


gi4761644 


Rattus 
norvegicus 


myomegalin 


886 


44 


1736 


AAM94312 


Homo sapiens 


-wr 1 J • J "111 

Human reproductive system related 
antigen SEQ ID NO: 2970. 


500 


^71 

71 


1736 


ABJ03726 


Homo sapiens 


Human ovary specific protein SEQ 
ID NO: 168. 


z9o 


A 1 

41 


1736 


gi8439396 


HERV-H/env62 


envelope protein 


ZyZ 




1737 


AAY76177 


Homo sapiens 


Human secreted protein encoded by 
gene 54. 


288 


100 


1738 


AAY92075 


Homo sapiens 


Human DKR-4. 


759 


100 


1738 


AAB08875 


Homo sapiens 


Ammo acid sequence of a human 
Dickkopf (Dkk)-4 protein. 


759 


100 


1738 


AAW73017 


Homo sapiens 


Human cysteine-rich secreted protein 
CRSP-2. 


759 


100 


1739 


ABB97828 


Homo sapiens 


Human secretory polypeptide 
(SPTM) 80. 


2007 


86 


1739 


ABB90159 


Homo sapiens 


hluman polypeptide £>Ji^ id rsw 
2535. 


1S47 




1739 


gil 8642980 


Homo sapiens 


GTPase 


1524 


61 


1740 


AAB49278 


Homo sapiens 


Protein encoded by zsig8 1 cDNA 
fragment. 


755 


94 


1740 


AAU29276 


Homo sapiens 


Human PRO polypeptide sequence 
#253. 


755 


94 


1740 


gil 2003 127 


Eremothecium 


GTPase activating protein BEM2 


83 


28 
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ID 


1111 1U 


* 


T% pc t*ri irti ft ti 


S 

score 


% 
Identity 






• • 

gossypii 








1 HA 1 
1/41 


gl/1 l\j l ZdZ 


norno sapiens 


nrotein 


16^7 


76 


1741 


AAB65196 


Homo sapiens 


Human PROH15 (UNQ558) protein 
sequence SEQ ID NO:177. 


1636 


76 


1 "J A 1 

1/41 


A AD Q7</1 1 


riomo sapiens 


Wnrnfin PRD1 1 1 


1636 


76 


1742 


AAG66142 


Homo sapiens 


Human PAS Kinase (PASK) 
polypeptide. 


2227 


99 


1742 


AAA jfnmi 1 

A AM 1 


riomo sapiens 


Unman rvr*M-£»in QT7^ TT^ MO 1 RQ"^ 

riuman protein jji^ aj-/ inu loyj. 


9797 


QQ 


1742 


AAM79230 


Homo sapiens 


Human protein SEQ ID NO 1892. 


mi 


99 


1743 


gil3879899 


Mycobactenum 
tuberculosis 

CDCloM 


rr ii lamuy protein 




97 


1743 


gi26533 1 1 


Bovine 

Herpesvirus type 
1.1 


very large virion protein (tegument) 


99 


27 


\ T /to 

1743 


• 1 A(\-t /TO 1 

gll4yio21 


Bovine 
herpesvirus 1 




QQ 


97 


1 /44 


c ni 1710 

gijy^ i /io 


L/Jiiamyaomonas i 
reinnarutii 


i-aipna oynem neavy ciiaiii 


71 26 


56 


1744 


gi9409781 


Chlamydomonas 
reinndratn 


1 beta dynein heavy chain 


1121 


34 


1744 


gi5 14215 


Chlamydomonas 
reinnaraiii 


dynein beta heavy chain 


1070 


33 


1745 


AAB53088 


Homo sapiens 


Human angiogenesis-associated 

nrritp»iTi PR fV398 WO TD TSJO* 1 ^9 

piuieiii rj\L/jzOj oijv^ lis inu.ijl. 


1972 


89 


1745 


AAB80260 


Homo sapiens 


Human PR0328 protein. 


1972 


89 


1745 


A A T T1 1 

AAU 1233 1 


riomo sapiens 


riunian j^ivljj^oi poiypepuue 
sequence. 


1 979 


SQ 


1 /4o 


A A T TOO 1 *70 

AAU/yi/z 


riomo sapiens 


riuman r i\u poiypepuue sequence 
#149. 




68 


1746 


AAY99398 


Homo sapiens 


Human PRO1301 (UNQ667) amino 
acia sequence ojcv^ iu inu.ziz. 


730 


68 


1746 


AAM38651 


Homo sapiens 


Human polypeptide SEQ ID NO 
i /yo. 


728 


68 


1747 


AAE21056 


Homo sapiens 


Human drug metabolising enzyme 
^ijivijd- i^f^ protein. 


111 


59 


1748 


AAE03560 


Homo sapiens 


Human differentially expressed 
Kictney cuin/v zzjou encooea 
protein. 


1140 


88 


1 /4o 


AAJV14Z434 


riomo sapiens 


nuincin iLiuney rcidiccx puiypcpuuc 


466 


98 


1 /4o 




riomo sapiens 


SEQ ID NO 356. 


466 


98 


1 9.4Q 


a AP99Q1 1 


rioino sapiens 


T—fi i man fr^n ^nr^rf rr pin rl ion fTiatiti^l 

(TRICH) 10. 


2309 


100 


1 7/1 0 

i /4y 


gllOjUDoUj 


nomo sapiens 


+lr\/mif* ch*om5il pn-fran cnnif pt 


2309 


100 


1749 


eil 3506808 


"Mus wiusculus 

1V1UO I A 1 UOvUlLiL? 


thymic stromal co-transnorter 


1782 


77 


1750 


gil 8 157547 


Mus musculus 


pecanex-like 3 


7005 


89 


1750 


gil 5076843 


Homo sapiens 


pecanex-like protein 1 


4359 


51 


1750 


gil3171105 


Takifugu 
rubripes 


pecanex 


4118 


58 


1751 


AAY06603 


Homo sapiens 


Retinoblastoma interacting protein 
GluTl*. 


411 


100 


1751 


gi825504 


Homo sapiens 


glutamate transporter 


411 


100 
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score 


% 
Identitv 


1751 


gi825663 


Homo sapiens 


GLAST1 


411 


100 


175Z 




riomo Sapiens 


ncugeniii 


593 


100 


1752 


gi641966 


Gallus gallus 


neogenin 


591 


98 


1752 


gu7859yy 


Kattus 

♦ 

norvegicus 


• 

neogenin 


586 


97 


1753 


ABB53278 


Homo sapiens 


Human polypeptide #18. 


315 


26 


1753 


ABB53277 


Homo sapiens 


Human polypeptide #17. 


315 


26 


1753 


AAP94014 


TT • „„ „ 

Homo sapiens 


Carcinoemoryonic cen suriace 
antigen. 






1754 


AA021681 


Homo sapiens 


Human secreted protein oxii^ 11^ ino 
23. 


*T*TJ 




1754 


AAB75375 


Homo sapiens 


riuman secretea protein 770*+. 


AA'X 

r rj 




1754 


AAB88603 


Homo sapiens 


riuman nyaropnoDic aomain 
containing protein clone HP 10770 






1755 


AAE22906 


Homo sapiens 


Human transporter and ion channel 


1973 


90 


1755 


AAE16350 


Homo sapiens 


Human tetracycline transporter like- 
liKe protein, rijj_yi i*r. 


1969 


90 


1755 


gi2506078 


Mus musculus 


tetracycline transporter-like protein 


1961 


90 


1756 


AAB 88469 


TT 

Homo sapiens 


Human membrane or secretory 
piotem clone roD^uuz/. 


1UOJ 


OS 


1756 


AAE06608 


Homo sapiens 


Human protein having hydrophobic 
domain, nriu/yo. 


1065 


98 


1756 


AAM40347 


. — 

Homo sapiens 


Human poiypeptiac or^Vi 11^ i^u 




98 


1757 


gi89252o4 


Homo sapiens 


pnospnauuyimuoiLui puiypiiu&piiaLc 

^-r^VirvcT^TnafQCf 1 tvnf* TV 
^"•piliJapiictla.ac iy^j^ x v 


2599 


91 


1 

l/D / 


giyzyD->3 J 


lvius muacuiub 


i n r»cif ril r»r»1vnVir><:r>Viatfi S -nil nsnh ata.se 


1989 


78 


1757 


gi5360761 


Rattus 

i 

norvegicus 


pharbin 


1928 


77 


1758 


gi395207 


Bos taurus 


potassium channel (BGK5) 


753 


82 


175b 


gllbODOy 


riomo sapiens 


puldbblUlll UilcllillCl 


750 


82 


1758 




t^anis iamiiiaris 




746 


81 


1759 


AAB65058 


Homo sapiens 


Gene #7 associated peptide #12. 


908 


93 


1759 


AAB64yyy 


riomo sapiens 


riuman sccrctcG protein tr / . 


908 


93 


1759 


AAU01099 


Homo sapiens 


Gene 35 Human secreted protein 
nomoiogous amino aciQ tjccjuciiL/c 


814 


91 


1760 


AAU11384 


Homo sapiens 


Human T2R61 (hT2R61) 
polypeptide. 


802 


100 


1760 


gi20336531 


Homo sapiens 


candidate taste receptor T2RP1 


794 


99 


1760 


ABB06836 


Homo sapiens 


Human nGPCR-Seql048 protein 
sequence oxiQ ID ISI(J:iUd. 


718 


79 


1761 


AAY36115 


Homo sapiens 


Extended human secreted protein 
sequence, obQ ID JNU. dUO. 


514 


87 


1761 


AAW64556 


Homo sapiens 


Human osterocarcinoma cell line U-2 
Uo clone rlr'iLoUD protein. 


'si A 


0 / 


17fv1 
1 /Ol 


cril 49*501 11 


r-fnmo QPiTYipn^ 


un characterized hemato"Doietic 
stem/progenitor cells protein 
MDS029 


514 


87 


1762 


AAG78575 


Homo sapiens 


Human SLC5A3 amino acid 
sequence. 


3621 


100 


1762 


AAB47976 


Homo sapiens 


BCW2. 


3621 


100 


1762 


1^2739094 


Homo sapiens 


sodium/myo-inositol cotransporter 


3621 


100 


1763 


ABB80578 


Homo sapiens 


Human sbg618069LRR protein #2. 


3076 


99 
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1763 


ABB 805 77 


Homo sapiens 


Human sDgoioUoyj^ivK protein wl. 


1 S7? 




1763 


AAWo45yo 


Homo sapiens 


Ammo acid sequence oi trie numan 
iango-/y protein. 


1 9HO 


A A 


1 /o4 


ABBol4oU 


Homo sapiens 


Human aggrecanasc iyij-'I oo proieiii 






1 /04 


guy i / 1 i 3u 


nomo sapiens 




^364 

JJUT 


89 


1764 


AAU72893 


Homo sapiens 


Human metalloprotease partial 

pi UIC1I1 oCt-JUCIft/C ft J . 


2762 


90 


1765 


gi 1695682 


Homo sapiens 


hepatic triglyceride lipase 


152 


64 


1 /CD 




xiomo sapiens 


precursor ^/\/\ -zj io *t / oy 




64 


1 ICS. 
1 /OD 


giJ3yDy3 


Homo sapiens 


irigiyceriae lipase 


1 SO 




1766 


AAM79459 


Homo sapiens 


Human protein SEQ ID NO 3 105. 


1493 


100 


1766 


AAM78475 


Homo sapiens 


Human protein o±iv^ lu JNvi 1 /. 


1 AQ1 


i 


1766 


ABB 11930 


Homo sapiens 


Human secreted protein homologue, 


1493 


100 


1767 


AAM47914 


Homo sapiens 


Human lysopnospnanoic ammoacyi 
transferase 42. 






I /t> / 




nomo sapiens 


nuinaii LrAAi ueita provcin. 






1767 


AAB65188 


Homo sapiens 


Human PRO 10 16 (UNQ499) protein 

roniion/^D OTTO TT\ "Mr^«1 *\& 

sequence ojcv«j iu inu, too. 


324 


90 


1768 


AAE23757 


Homo sapiens 


Human metabotropic glutamate 
^mvjiuiv'fj receptor protein. 


926 


99 


1768 


AAR82658 


Homo sapiens 


Human mGluR4. 


926 


99 


1768 


gill60183 


Homo sapiens 


metabotropic glutamate receptor type 
4 






1769 


AAM41363 


Homo sapiens 


Human polypeptide obCj ID JNU 
6294. 


TJ7A 


yo 


1769 


AAE17500 


T T ■ _ 

Homo sapiens 


Human secretion and trafficking 
protein-9 (SAT-9). 


Z>iU / 


yy 


1 ^7 s~ i~\ 

1769 


A A TV /TO ATTI 

AAM39577 


Homo sapiens 


Human polypeptide bbv,) 1L> NO 

T700 




QQ 

yy 


1770 


A A "Ti CIO 

AAB /3 j 12 


Homo sapiens 


Human transterase tiiro-iy, oriv^ 


9/^A 




1770 


AAB56487 


Homo sapiens 


Human prostate cancer antigen 
protein sequence ciCK^ iu inw. iuod. 


264 


45 


1770 


gi 14249942 


Homo sapiens 


Similar to RIKEN cDNA 
uoiuuuor 10 gene 


264 


45 


1771 


gi7678873 


Homo sapiens 


vascular cadherin-2 


5369 


99 


1771 


•Til AT 1 TA 

gi7407150 | 


Homo sapiens 


protocadnerm iz 


DJOy 


QQ 

yy 


1 ""7*7 1 

1771 


gio 1 0403 / 


Homo sapiens 


vascular enootnenai caonerm z 




QQ 

yy 


1772 


AAM93947 


Homo sapiens 


Human polypeptide, SEQ ID NO: 

41 jo. 


231 


93 


1772 


AAM93886 


Homo sapiens 


Human polypeptide, SEQ ID NO: 

id.ni "5 


231 


93 


1772 


AAM93857 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3y4y. 


231 


93 


1773 


gi 1335205 


Homo sapiens 


ORFII 


270 


48 


1773 


A A "QO/1 QOI 

AABy4oyl 


xiomo sapiens 


Human protein sequence Dxiv 
NO: 16231. 


963 


69 


1773 


AAG73650 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:4414. 


263 


58 


1774 


AAB93885 


Homo sapiens 


Human protein sequence SEQ ID 
NO:13815. 


1088 


85 


1774 


AAM93980 


Homo sapiens 


Human stomach cancer expressed 
polypeptide SEQ ID NO 29. 


1088 


85 



i 
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SEQ 
ID 


Hit ID 


Species 


Description 


s 

score 


% 
Identity 


1774 


gi3603459 


Homo sapiens 


tetraspan NET-5 


1 AOO 

1088 


Of 


1775 


ABB06607 


Homo sapiens 


G protein-coupled receptor GPCR5 
protein SEQ ID NO:24. 


982 


77 


1775 


AAG71597 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1278. 


982 


77 


1775 


gi2 1928553 


Homo sapiens 


seven transmembrane helix receptor 


AOO 

982 


77 


1776 


AAB64888 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID NO:66. 


252 


85 


1776 


AAB3S011 


Homo sapiens 


Human secreted protein encoded by 
gene 3 clone HPJCX13. 


252 


85 


1776 


AAB94917 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 16357, 


191 


62 


1777 


gi 182851 


Homo sapiens 


G0S2 protein 


497 


1 AA 

100 


1777 


gil213013 


Mus museums 


G0S2-hke protein 


377 


77 


1777 


AAM79519 


Homo sapiens 


Human protein SEQ ID NO 3 165. 


81 


26 


1778 


ABB89432 


Homo sapiens 


Human polypeptide SEQ ID NO 
1808. 


473 


100 


1778 


ABP61790 


Homo sapiens 


Human polypeptide SEQ ID NO 144. 


473 


100 


1778 


AAY80991 


Homo sapiens 


Human VAMP-2 homologue, 
CBCBMH06. 


473 


100 


1779 


gi5264503 


Mus mus cuius 


sif and Tiaml-hke exchange factor 


oo 

82 


30 


1779 


gi9295309 


frog adenovirus 
1 


pllla protein 


78 


40 


1779 


gi 164 15263 


Listeria innocua 


similar to two-component sensor 
histidine kinase 


78 


27 


1780 


gi2 1693 020 


Homo sapiens 


MHC class I antigen 


1311 


70 


1780 


gil399321 


Macaca mulatta 


MHC class I antigen Mamu B*08 


1307 


71 


1780 


gi8117799 


Pan troglodytes 


MHC class I antigen 


1302 | 


70 


1781 


ABB75677 


Homo sapiens 


T — h * i • 1 A * 1 

Breast protein-eukaryotic conserved 
gene 1 (BSTP-ECG1) protein. 


754 


40 


1781 


AAU29191 


Homo sapiens 


Human PRO polypeptide sequence 
#168. 


754 


40 


1781 


AAY99421 


Homo sapiens 


Human PR01433 (UNQ738) ammo 
acid sequence SEQ ID NO:292. 


754 


40 


1782 


gi2343157 


Homo sapiens 


peroxisomal membrane protein 69 


2796 


89 


1782 


gi2706518 


Homo sapiens 


peroxisomal ABC-transporter 


2796 


89 


1782 


gil5215442 


Homo sapiens 


Similar to ATP-binding cassette, sub- 
family D (ALD), member 4 


2788 


88 


1783 


AAB35235 


Homo sapiens 


Human neurotransmitter transporter 
protein GC42. 


3573 


98 


1783 


AAB35236 


Homo sapiens 


Human glycine transporter type lc. 


3559 


98 


1783 


gi546769 


Homo sapiens 


glycine transporter type lb; GlyT-lb 


3559 


98 


1784 


AAU00017 


Homo sapiens 


Human Plexm-D 1 . 


7512 


90 


1784 


ABB 11709 


Homo sapiens 


Human plexm-Bl/SEP receptor 
homologue, SEQ ID NO:2079. 


7467 


90 


1784 


gi5918167 


Homo sapiens 


plexm-Bl/SEP receptor 


2120 . 


33 


1785 


ABB44591 


Homo sapiens 


Human wound healing related 
poiypepxiae otiv^ iu inu 4o. 


953 


99 


1785 


gi6 19730 


Homo sapiens 


nuclear factor I 


953 


99 


1785 


gi9653290 


Mus sp. 


NFI-X3 


953 


99 


1786 


gil3810568 


Homo sapiens 


Toll-like receptor 5 


4482 


100 


1786 


ABB1 1795 


Homo sapiens 


Human Toll/IL-lR-like protein 
homologue, SEQ ID NO:2165. 


4478 


99 


1786 


gi3 132526 


Homo sapiens 


Toll/interleukin-1 receptor-like 
protein 3 


4464 


99 
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SEQ 
ID 


Hit ID 


Species 


Description 


Ci 

s 

score 


la entity 


1787 


AAB88597 


T T * 

Homo sapiens 


Human hydrophobic domain 
containing protein cione jrijrujo j\j 

if 1 O 1 
?F1Z1. 


OO / 


oZ 


1787 


AAB56473 


Homo sapiens 


Human prostate cancer antigen 
proxem sequence oxiv^ ilj invj, iuj i. 


887 


82 


1787 


AAB60119 


Homo sapiens 


Human transport protein TPPT-39. 


564 


75 


1 TCI Ci 

1788 


AAB80300 


Homo sapiens 


Human prostate cancer anngen tfzo. 






1788' 


AAB80276 


Homo sapiens 


Human prostate cancer antigen #4. 


741 


89 


1788 


gi4929765 


Homo sapiens 


CGI- 148 protein 


741 


89 


1789 


gi22038189 


Escherichia coli 


multidrug transporter 


2265 


100 


1789 


gil736785 


Escherichia coli 


Acriflavin resistance protein F (EnvD 
protein). 


2265 


100 


1789 


gil5980819 


Yersinia pestis 


AcrB/AcrD/AcrF family membrane 
protein 


1854 


79 


1790 


gi 1774 1602 


ribose 

[Agrobacterium 
tumefaciens str. 
C58 (U. 
Washington)] 


ABC transporter, membrane 
spanning protein 


549 


59 


1790 


gil5160166 


Agrobacterium 
tumefaciens str. 
C58 (Cereon) 


AGR_L_3181p 


549 


59 


1790 


gi20515187 


Thermoanaeroba 
cter 

tengcongensis 


Rib ose/xyl ose/ arabinose/galactosi de 
ABC-type transport systems, 
permease components 


340 


38 


1791 


gi 178 8573 


Escherichia coli 
K12 


sn-glycerol-3 -phosphate permease 


1 ATI 

1073 


100 


1791 


gil799587 


Escherichia coli 


glycerol-3-phosphate transport 
protein 


1073 


100 


1791 


gi41587 


Escherichia coli 


glycerol-3-phosphatase transporter 
(AA 1 - 452, glpT) 


1073 


100 


1792 


gi 1790233 


Escherichia coli 
K12 


arylsulfatase 


679 


98 


1792 


gil25 18665 


Escherichia coh 

0157:H7 

EDL933 


arylsulfatase 


679 




1792 


gil3364207 


Escherichia coli 
0157:H7 


arylsulfatase 


679 


no 


1793 


gi21959134 


Yersinia pestis 
KIM 


sulfate transporter 


378 


80 


1793 


gi9657461 


Vibrio cholerae 


sulfate permease family protein 


267 


51 


1793 


gil8145142 


Clostridium 
perfringens str. 
13 


probable sulfate permease 


264 


53 


1794 


gil799719 


PIR Accession 
Number S08346 
[Escherichia 
coiij 


similar to 


922 


100 


1794 


gil5156677 


Agrobacterium 
tumefaciens str. 
C58 (Cereon) 


AGR_C_2926p 


452 


50 


1795 


gilll77166 


Mus musculus 


adhesion molecule ninjurin 


228 


81 


1795 


gi3077901 


Mus musculus 


ninjurin 


228 


81 


1795 


gi 1644366 


Rattus 
norvegicus 


ninjurinl 


228 


81 
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ID 


Hit ID 


Species 


Description 


S 


% 
Identity 


1796 


AAU74824 


Homo sapiens 


Human REPTR 7 protein. 


6196 


92 


1796 


ABB90740 


Homo sapiens 


.Human l umour xinootneiiai iviarKor 
polypeptide SEQ ID NO 212. 


6196 


92 


1796 


ABB90725 


TT „„ ' _ 

Homo sapiens 


riuman 1 umour xinaotnenai iviarKer 
poiypeptiae ojivJ iu rs^ loo. 


6106 


9? 


1797 


ABP61451 


Homo sapiens 


riuman inp-Ko acuvaLiiig piuicni 
SEQ ID NO 55. 


1 ^31 


100 

J. w 


1797 


AAY94343 


Homo .sapiens 


Human cell surface receptor protein 


1331 


100 


1797 


gil3938575 


Homo sapiens 


Similar to RIKEN cDNA 
ZolUD i luzz gene 


1331 


100 


1798 


gi606234 


1 — • — r** r* — 

Escherichia coli 


secY 


953 


100 


1798 


gi42989 


Escherichia coli 


SecY (PrlA) polypeptide (aa 1-443) 


953 


100 


1798 


gi 1642 1976 


Salmonella 
typhimurium 

T TO 


preproxein xransiocase oi nor i.aiiiii>' 




99 


1799 


gi 18255305 


Mus musculus 


p53 apoptosis effector related to 


440 


53 


1799 


gi7582391 


Mus musculus 


p53 apoptosis-associated target 


440 


53 


1799 


AAM50572 


T T * 

Homo sapiens 


Human tumour suppressor proiem 
THW. 


3Q1 


100 


1800 


AAU1 1433 


Homo sapiens 


Human snort-cnam aenyorogenase, 
SCDR. 


^1 


J o 


1800 


AAU18368 


TT _ ■ „ : 

Homo sapiens 


hLuman endocrine poiypepnoe 3iiv^ 
ID No 323. 


5^1 




1800 


AAU 18369 


Homo sapiens 


Human enaocrme poiypeptiae oiii^ 
ID No 324. 




59 


1801 


ABB 12425 


Homo sapiens 


Human bone marrow expressed 
protein oJc-v J^^-'* zo*f. 


1 78 1 
I 1 o L 


Qf) 


1801 


AAM50318 


Homo sapiens 


riuman memorane transponer ^ivi ir) 
^^^^^ 

JJJJU. 


1499 


90 


1801 


AAM83805 


Homo sapiens 


Human immune/haematopoietic 
antigen ojdv^ iu isyj.i Loyo> 


1499 


90 


1802 


gi208 10074 


Homo sapiens 


Similar to pepsinogen 5, group I 
(pepsinogen A^) 


670 


86 


1802 


AAB61351 


Homo sapiens 


Pepsin protein. 


661 


85 


1802 


AABoo589 


riomo sapiens 


numan pepsin. 


661 


85 


1803 


ABB04707 


Homo sapiens 


numan o-roz protein oJCrV^ ix> inv.^. 


1 1 13 


99 


1803 


ABB 849 12 


Homo sapiens 


jiuman ±^jk.ci jdo proiein bcqueiiuc 
SEQ ID NO: 192. 


1113 
1 1 ij 


99 


1803 


a AT S~ C\ A 

AAU76534 


Homo sapiens 


i umour-associatea antigenic targe l 
protein, TAT134. 


1113 


99 


1804 


gi2641217 


Oryctolagus 
cuniculus 


anion exchanger 3 brain isoform 


645 


66 


1804 


gi476222 


Homo sapiens 


anion exchanger 3 brain isoform 


645 


66 


1804 


gi886256 


Homo sapiens 


anion exchange protein 


OH"Z 




1805 


AAM25789 


Homo sapiens 


Human protein sequence SEQ ID 


2659 


56 


1805 


AAB29632 


Homo sapiens 


Human pollinosis-associated gene 
581 -encoded protein, SEQ ID 
NO: 12. 


2659 


56 


1805 


gi 18698435 


Homo sapiens 


pVHL-interacting deubiquitinating 
enzyme 1 type II 


2659 


56 


1806 


AAU29153 


Homo sapiens 


Human PRO polypeptide sequence 
#130. 


1771 


98 
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ID 
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Description 


S 

score 


Identity 


1806 


A A A 7" A AO /TO 

AAY99363 


Homo sapiens 


Human J J KU13oU (^UJN^/1 /; ammo 
acid sequence SEQ ID NO:79. 


1 in i 
l / / 1 




1 OA/T 

1806 


AAG67487 


Homo sapiens 


Ammo acid sequence 01 a numan 
transporter protein. 


1 /OJ 


yo 


1807 


ABB76315 


T T _ _ • 

Homo sapiens 


Human protein phosphatase PP-8 
Incyte ID No. 4022502CD1. 


oo / 


(5 


1807 


gi 15778670 


Mus musculus 


sphingosine- 1 -phosphate 
phosphatase 


Id 1 




1 OAT 

1807 


'A^OO 1 A A 

gi9623190 


Mus musculus 


sphingosine- 1 -phosphate 
phosphohydrolase 


i ri 

ID 1 


3U 


1808 


AAM93947 


Homo sapiens 


Human polypeptide, SEQ ID NO: 

4138. 


231 


93 


1808 


AAM93886 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
4013. 


231 


93 


1 O AO 

1808 


AAM93857 


Homo sapiens 


Human polypeptide, iiiit^ ID JN<J: 

1Q/1Q 

3y^y. 


15 1 


y3 


1 OAO 


gii4D /do /y 


Homo sapiens 


nemicentm 




yo 


1809 


gi3328186 


Caenorhabditis 
eiegans 


hemicentin precursor 


354 


57 


1809 


AAU75886 


Homo sapiens 


Human adhesion molecule protein 

AT1A/A Ami son 1 


185 


33 


1810 


gi3581982 


Homo sapiens 


extraneuronal monoamine transporter 


775 


99 


1 O 1 A 

1810 


gi 142705 13 


Homo sapiens 


organic cation transporter 3 


/ / J 


yy 


1810 


gil3699874 


Mus musculus 


organic cation transporter 3 


720 


89 


1811 


AAB74762 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 18 SEQ ID NO:71. 


266 


88 


1811 


AAB74760 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 18 SEQ ID NO: 69. 


266 


91 


1811 


AAB74759 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 18 SEQ ID NO:68. 


266 


A1 

91 


1812 


gil5082375 


Homo sapiens 


Similar to transmembrane 7 
superfamily member 1 (upregulated 
in Kidney) 


912 


82 


1812 


gil3096836 


Mus musculus 


Similar to transmembrane 7 
superfamily member 1 (upregulated 
in kidney) 


oor 
880 


OA 


1812 


ABB97817 


Homo sapiens 


Human secretory polypeptide 

(brlM) 69. 


432 


68 


1 O 1 't 

1813 


gl 15 04024 


Homo sapiens 


similar to Mouse finger protein(clone 

mKTJ J^oU30 / J). 


OA 1 


D 1 


1813 


gil4549186 


Mus musculus 


zinc finger protein 219 


636 


31 


ion 
1813 




Homo sapiens 


Human polypeptiae IbliQ ID JNvJ 


oiy 


3Z 


t Of A 

1814 


A A V^1^/1 A 

AAYD3044 


rlomo sapiens 


rrotem encoaea oy tne numan 
longevity assurance gene l ^i^/yvji^. 


135Z 


Q1 

y3 


1814 


AAR20230 


Homo sapiens 


hUOG-1. 


1382 


93 


1 O 1 A 

1814 


gl43Z440o 


Homo sapiens 


j^AOi protein 


1 ICO 

13oz 


y3 


1 Q. 1 ^ 
lo ID 




nomo sapiens 


nuiiid.ii cpiucrrriis-spcciiic serine 
protease #2. 




41 


1815 


gi6009515 


Xenopus laevis 


epidermis specific serine protease 


329 


41 


1815 


gi!9353198 


Mus musculus 


RIKEN cDNA 2010001P08 gene 


323 


41 


1816 


AAM42401 


Homo sapiens 


Human polypeptide SEQ ID NO 134. 


706 


100 


1816 


gil786232 


Escherichia coli 
K12 


K+ efflux antiporter, glut athi one- 
regulated 


706 


100 


1816 


gi21321928 


Escherichia coli 


Glutathione-regulated potassium- 


706 


100 
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S 

score 


% 
Identity 








efflux system protein KeiC 
(K(+)/H(+) antiporter). 






1817 


gil657563 


Escherichia coli 


dioxygenase 


1510 


100 


1817 


gil786565 


Escherichia coli 
K12 


taurine dioxygenase, 2-oxoglutarate- 
dependent 


1510 


100 


1817 


gil 054578 


Escherichia coli 


dioxygenase 


1510 


100 


1818 


gi 1787550 


Escherichia coli 
K12 


homolog of Salmonella peptide 
transport permease protein 


928 


98 


1818 


gi 125 15484 


Escherichia coli 

0157:H7 

EDL933 


homolog of Salmonella peptide 
transport' permease protein 


928 


98 


1818 


gil3361335 


Escherichia coli 
0157:H7 


homolog of Salmonella peptide 

• 

transport permease protein 


928 


98 


1819 


gi9652147 


Homo sapiens 


transmembrane-type protein tyrosine 
phosphatase H 


5771 


98 


1819 


gi475004 


Homo sapiens 


protein tyrosine phosphatase 
precursor 


5235 


93 


1819 


gi!321659 


Rattus 
norvegicus 


brain- enriched membrane-associated 
protein tyrosine phosphatase (BEM)- 
2 


1487 


71 


1820 


gi7141127 


Homo sapiens 


Ellis- van Creveld syndrome protein 


1277 


100 


1820 


gi7271903 


Homo sapiens 


DWF-1 


1271 


99 


1820 


gil3506715 


Bos taurus 


Ellis-van Creveld syndrome protein 


1034 


81 


1821 


AAG72370 


Homo sapiens 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2051. 


568 


98 


1821 


AAG71453 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1134. 


568 


98 


1821 


AAE04556 


XT * 

Homo sapiens 


Human G-protein coupled receptor- 
12 (GCREC-12) protein. 


JJO 


1UU 


1822 


ABB89189 


Homo sapiens 


Human polypeptide SEQ ID NO 
1565. 


388 


63 


1822 


AAM90349 


Homo sapiens 


Human immune/haematopoietic 
antigen SEQ ID NO: 17942. 


126 


50 


1822 


AAB95094 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17042. 


94 


41 
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Database entry 
ID 
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Results* 


912 


BL01158 


Macrophage migration inhibitory factor 
family proteins. 


BL01158A 21.81 4.971e-37 124- 
169 BL01158B 17.07 4.343e-23 
169-196 


912 


PF00043 


Glutathione S-transferases. 


PF00043 21.83 5.333e~14 377-407 


913 


PD01719 


PRECURSOR GLYCOPROTEIN 
SIGNAL RE. 


PD01719A 12.89 8.200e-17 180- 
208 PD01719A 12.89 2.364e-13 
123-151 


913 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR00010C 11.16 2.091e-10 817- 
828 


913 


BL01187 


Calcium-bindmg EGF-hke domain 
proteins pattern proteins. 


BL01187B 12.04 6.538e-16 812- 
828 BL01187B 12.04 5.696e-13 
569-585 BL01187B 12.04 7.261e- 
13 696-712 BL01187A9.98 
l.429e-l0 508-520 BL0H87B 
12.04 2.286e-10 484-500 

HLOiio/A y.y© i./jUe-uy /yo-oUo 


913 


BL00022 


EGF-like domain proteins. 


BL00022B 7.54 1.90Oe-O9 821-828 


913 


BL01177 


Anaphylatoxin domain proteins. 


BL01 177D 17.50 5.167e-09 503- 
521 


913 


BL00281 


Bowman-Birk serine protease inhibitors 
family proteins. 


BL00281A 14.18 6.754e-09 479- 
496 


913 


BL00799 


Granulins proteins. 


BL00799B 11.02 7.429e-09 475- 
511 


913 


PR00907 


THROMBOMODULIN SIGNATURE 


PR00907B 11.29 2.479e-ll 808- 
825 PR00907G 11.63 9.660e-10 
812-839 PR00907G 11.63 9.745e- 
10 696-723 PR00907G 11.63 
9.027e-09 569-596 


914 


BL01158 


Macrophage migration inhibitory factor 
family proteins. j 


BL01158A 21.81 4.073e-33 2-47 
BL01 158B 17.07 2.884e-20 47-74 


915 


BL01158 


Macrophage migration inhibitory factor 
family proteins. 


BL01158B 17.07 4.343e-23 60-87 
BL01 158A 21.81 3.656e-22 2-47 


916 


BL00023 


Type II fibronectin collagen-binding 
domain proteins. 


BL00023 24.31 5.091e-27 101-138 


916 


BL00134 


Serine proteases, trypsin family, 
histidine proteins. 


BL00134B 15.99 8,560e-17 626- 
650 BL00134A 11.96 1.321e-16 
469-486 BL00134C 13.45 4.462e- 
16 662-676 


916 


BL01253 


Type I fibronectin domain proteins. 


BL01253C 15.89 9.027e-40 327- 
366 BL01253B 15.21 5.071e-38 
272-316 BL01253H 13.15 7.070e- 
36 644-679 BL01253E 16.01 
1.000e-34 543-580 BL01253F 
14.35 2.846e-34 581-620 
BL01253A 20.33 7.097e-25 201- 
230 BL01253G 11.34 8.Q85e-18 
625-639 BL01253D 4.84 7.070e-16 
469-483 


y 10 


r*KUU /ZZ 


PROTEASE FAMILY (SI) 
SIGNATURE 


PR00722A 12.27 J.793e-14 470- 
486 PR00722C 10.87 2.059e-13 
625-638 


916 


BL00021 


Kringle domain proteins. 


BL00021D 24.56 3.000e-32 634- 
676 BL00021B 13.33 9.217e-17 
469-487 BL00021C 22.21 4.293e- 
12 554-576 


916 


PR00013 


FIBRONECTIN TYPE II REPEAT 


PR00013C 12.29 4.273e-19 132- 
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Ci T ^"1X T A rpT TT» T™ 1 

SIGNATURE 


148 PR00013A 12.26 6.595e-l 1 
105-115 PR00013B 14.75 1.409e- 
10 116-129 


916 


BL00495 


Apple domain proteins. 


BL00495N 11.04 7.987e-21 618- 
653 BL00495O 13.75 3.311e-17 
653-682 BL00495M 8.50 6.243e- 
10 545-580 


916 


BL00022 


EGF-like domain proteins. 


BL00022B 7.54 1.900e-09 267-274 


916 


PR00018 


KRINGLE DOMAIN SIGNATURE 


PR00018C 14.30 7.750e~22 331- 
352 PR00018A 14.52 2.286e-12 
286-302 PR00018B 17.75 7.818e- 
10 302-315 PR00018D 13.51 
3.531e-09 356-368 


916 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR00010A 11.79 4.000e-09 160- 
172 PR00010C 11.16 8.071e-09 
182-193 


916 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


PR00011D 14.03 8.826e-09 179- 
198 


916 


PD00919 


CALCIUM-BINDING PRECURSOR 
SIGNAL R. 


PD00919A 11.53 1.000e-08 164- 
176 


917 


BL01248 


Laminin-type EGF-like (LE) domain 
proteins. 


BL01248 11.02 4.429e-15 314-327 
BL01248 11.02 1.375e-14 379-392 
BL01248 11.02 7.975e-ll 1011- 
1024 BL0 1248 11.02 5.245e-09 
1069-1082 BL01248 11.02 5. 585e- 
09 899-912 


917 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


PR00011A 14.06 8.579e-19 1060- 
1079 PR00011D 14.03 9.250e-17 
1107-1126 PR00011B 13.08 
7.938e-16 846-865 PR00011A 
14.06 1.340e-15 535-554 
PR00011D 14.03 3.800e-15 846- 
865 PR00011A 14.06 5.755e-15 
846-865 PR00011B 13.08 5. 846e- 
15 1107-1126 PR00011D 14.03 
2.286e-14 1060-1079 PR00011B 
13.08 2.333e-14 485-504 
PR00011B 13.08 6.333e-14 1060- 
1079 PR00011D 14.03 7.429e-14 
485-504 PR00011B 13.08 1.458e- 
13 535-554 PR00011D 14.03 
1.849e-13 535-554 PR00011A 
14.06 3.593e-13 1107-1126 
PR00011A 14.06 7.254e-13 485- 
504 PR00011B 13.08 9.847e-13 
798-817 PR0001 1A 14.06 1.58 le- 
12 798-817 PR00011C 24.25 
7.623e-12 817-846 PR00011D 
14.03 1.148e-ll 798-817 
PR00011A 14.06 3.492e-ll 433- 
452 PR0001 ID 14.03 9.262e-ll 
433-452 PR00011C 24.25 8.800e- 
10 440-469 PR00011C 24.25 
3. 143e-09 455-484 PR00011B 
13.08 6.548e-09 433-452 


917 


PD00320 


LAMININ CHAIN EGF-LIKE 


PD00320A 14.49 4.115e-10 813- 
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DOMAIN P. 


827 PD00320A 14.49 8.071 e-09 
1075-1089 


918 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR00010C 1 1.16 2.636e-10 132- 
143 


918 


BL00790 


Receptor tyrosine kinase class V 
proteins. 


BL00790E 29.58 4.621e-10 666- 
714 


918 


BL01177 


Anaphylatoxin domain proteins. 


BL01 177E 20.64 6.203e-12 377- 
404 BL01 177D 17.50 3.000e-09 
277-295 BL01 177C 17.39 4.000e- 
09 121-140 


918 


BL01187 


Calcium-binding EGF-like domain 
proteins pattern proteins. 


BL01 187B 12.04 8.412e-15 379- 
395 BL01187B 12.04 5.304e-13 
299-315 BL01187B 12.04 7.652e- 
13 339-355 BL01187B 12.04 
8.826e-13 127-143 BL01187B 
12.04 6.000e-12 46-62 BL01187B 
12.04 5.200e-ll 258-274 
BL01 187B 12.04 6.400e-10 86-102 
BL01187A 9.98 8.286e-10 110-122 
BL01187A 9.98 5. 125e-09 282-294 
BL01187A 9.98 6.250e-09 68-80 
BL01187A 9.98 8.125e-09 323-335 


918 


PR00907 


THROMBOMODULIN SIGNATURE 


PR00907G 11.63 6.774e-10 46-73 
PR00907B 1 1.29 7.545e-10 215- 
232 PR00907G 1 1.63 9.321e-10 
379-406 PR00907B 11.29 7.805e- 
09 82-99 PR00907B 11.29 9.232e- 
09 335-352 PR00907D 5.59 
9.427e-09 344-370 


919 


BL00335 


Parathyroid hormone family proteins. 


BL00335B 24.23 1.000e-40 133- 
182 BL00335A 17.98 8.393e-39 
80-115 


920 


BL01177 


Anaphylatoxin domain proteins. 


BL01177C 17.39 4.508e-10 92-1 11 


920 


BL01187 


Calcium-binding EGF-like domain 
proteins pattern proteins. 


BL01187B 12.04 8.043e-13 12-28 
BL01187B 12.04 1.000e-ll 139- 
155 BL01187A9.98 1.474e-ll 82- 
94 BL01187B 12.04 4.900e-l 1 98- 
114 BL01187A9.98 7.429e-10 
122-134 


920 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


PR00011A 14.06 2.964e-14 288- 
307 PR0001 IB 13.08 4.356e-13 
288-307 PR00011D 14.03 6.434e- 
13 376-395 PR00011B 13.08 
2. 8 87e- 12 376-395 PR00011D 
14.03 5.421e-12 288-307 
PR00011D 14.03 5,721e-ll 331- 
350 PR00011B 13.08 5.826e-10 
331-350 PR00011A 14.06 S.957e- 
10 376-395 


920 


DM00864 


EGF-LIKE DOMAIN. 


DM00864A 15.21 3.314e-09 232- 
253 


920 


PR00764 


COMPLEMENT C9 SIGNATURE 


PR00764F 16.89 5.675e-09 236- 
257 


920 


BL00022 


EGF-like domain proteins. 


BL00022B 7.54 7.300e-09 67-74 


920 


BL00799 


Granulins proteins. 


BL00799H 14.15 9.083e-09 227- 
268 
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920 


PR00907 


THROMBOMODULIN SIGNATURE 


PR00907B 11.29 8.714e-10 54-71 
PR00907B 11.29 9.561e-09 135- 
152 


920 


BL00243 


Integrins beta chain cysteine-rich 
domain proteins. 


BL00243I 31.77 1.000e-08 387-430 


921 


BL00472 


Small cytokines (intercrine/chemokine) 
C-C subfamily signatur. 


BL00472B 14.67 2.000e-16 35-53 
BL00472A 7.45 4.724e-09 1-13 


922 


BL00132 


Zinc carboxypeptidases, zinc-binding 
region 1 proteins. 


BL00132C 21.35 1.837e-25 227- 
268 BL00132A 26.07 9.791 e-24 
149-190 BL00132E 17.72 2.350e- 
22 297-324 BL00132F 13.26 
5.3 13e-18 325-347 BL00132B 
15.93 5.065e-16 197-211 
BL00132G 10.94 6.318e-14 382- 
400 BL00132D 12.70 5.313e-12 
271-286 


922 


PR00765 


C ARB OX YPEPTIDASE A 
METALLOPROTEASE (Ml 4) 
FAMILY SIGNATURE 


PR00765B 15.57 2.875e-15 197- 
212 PR00765D 14.16 8.412e-15 
330-344 PR00765C 12.55 2.432e- 
09 277-286 


923 


BL00514 


Fibrinogen beta and gamma chains C- 
[ terminal domain proteins. 


BL00514C 17.41 3.618e-25 93-130 
BL00514E 14.28 8.286e-14 153- 
170 BL00514D 15.35 2,915e-12 
134-147 BL00514G 15.98 4.444e- 
12 223-253 


927 


BL00453 


FKBP-type peptidyl-prolyl cis-trans 
isomerase proteins. 


BL00453B 23.86 8.500e-20 80-1 14 
BL00453A 15.57 1.000e-15 55-70 
BL00453C9.72 8.650e-ll 109-122 


937 


BL00796 


14-3-3 proteins. 


BL00796C 17.44 6.250e-38 143- 
193 BL00796B 10.67 1.514e-36 
77-110 BL00796D 17.39 5.696e-34 
194-240 BL00796E 14.15 7.353e- 
29 242-278 


937 


PR00305 


14-3-3 PROTEIN ZETA SIGNATURE 


PR00305F 15.95 3.250e-37 248- 
278 PR00305A9.33 1.500e-33 77- 
107 PR00305D 16.34 6.400e-29 
194-221 PR00305C 8.68 1.000e-28 
159-182 PR00305B 9.99 4.375e-23 
126-151 PR00305E 13.01 3.571e- 
10 221-248 


938 


BL00303 


S-100/ICaBP type calcium binding 
protein. 


BL00303A 21.77 9.526e-31 64-101 
BL00303B 26.15 5.737e-30 111- 
148 


938 


BL00018 


EF-hand calcium-binding domain 
proteins. 


BL00018 7.41 9.471e-ll 123-136 


940 


PD02365 


CHAIN FACTOR INTERLEUKIN- 12 
BETA PRECURSOR IL- 1 . 


PD02365C 7.89 6.680e-10 342-372 


940 


DM00202 


w T-CELL IG HEAVY ALPHA. 


DM00202A 9.44 9.813e-09 34-44 


944 


BL00284 ] 


Serpins proteins. 


BL00284C 28.56 6.400e-25 319- 
361 BL00284E 19.15 1.000e-15 
508-533 BL00284A 15.64 2.742e- 
15 200-224 BL00284B 17.99 
6. 182e-12 292-313 BL00284D 
16.34 7.070e-12 430-457 


944 


PD02080 


T-CELL GLYCOPROTEIN CDS 
CHAIN SURFACE ALPHA PRE. 


PD02080A 10.03 9.750e-10 35-51 
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944 


PR00743 


GLYCOSYL HYDROLASE FAMILY 
36 SIGNATURE 


PR00743B 14.95 8.831e-09 240- 
261 


947 


DM01970 


0 kw ZK632.12 YDR313C 
ENDOSOMAL III. 


DM01970B 8.60 4.750e-17 552- 
565 


947 


BL01160 


Kmesm light chain repeat proteins. 


BL01160B 19.54 2.373e-09 203- 
257 


947 


BL00518 


Zinc finger, C3HC4 type (RING 
finger), proteins. 


BL00518 12.23 4.000e-09 559-568 


947 


BL00422 


Granins proteins. 


BL00422E 26.86 8.615e-09 462- 
498 


950 


PR00356 


TYPE II ANTIFREEZE PROTEIN 
SIGNATURE 


PR00356D 13.09 8.038e-09 174- 
191 


951 


BL00615 


C-type lectin domain proteins. 


BL00615A 16.68 4.316e-13 209- 
227 


951 


PR00356 


TYPE II ANTIFREEZE PROTEIN 
SIGNATURE 


PR00356B 14.85 6.294e-10 209- 
227 PR00356A 12.90 7.188e-10 
197-210 


952 


BL00570 


Bacterial ring hydroxylating 
dioxygenases alpha-subumt signa. 


BL00570B 19.03 9.357e-09 271- 
303 


953 


PD00126 


PROTEIN REPEAT DOMAIN TPR 
NUCLEA. 


PD00126A 22.53 1.931e-09 72-93 


953 


PR00910 


LUTEOVIRUS ORF6 PROTEIN 
SIGNATURE 


PR00910A2.51 3.571e-09 107-120 


953 


BL00415 


Synapsins proteins. 


BL00415N 4.29 5.730e-09 13-57 


954 


PR00901 


PHEROMONE B ALPHA- 1 
RECEPTOR SIGNATURE 


PR00901H 14.99 4.706e-09 56-67 


958 


PR00138 


MATRIXIN SIGNATURE 


PR00138C 16.41 6.478e-32 178- 
207 PR00138D 16.56 1.360e-28 
236-262 PR00138B 15.82 S.071e- 
18 155-171 PR00138A 15.14 
5.091e-16 108-122 PR00138E6.01 
9.250e-15 269-283 


958 


BL00024 


Hemopexin domain proteins. 


BL00024C 22.98 1.000e-40 177- 
226 <BL00024B 21.53 2.636e-33 
129-163 BL00024D 17.28 4.086e- 
31 230-262 BL00024F 11.30 
2.73 le-22 305-326 BL00024H 
11.35 1.947e-14 41 1-423 
BL00024E 7.58 5.500e-14 269-283 
BL00024G 13.31 4.115e-13 342- 
355 BL00024A 11.49 5.050e- 13 
108-119 


958 


BL00142 


Neutral zinc metallopeptidases, zinc- 
binding region proteins. 


BL00142 8.38 3.455e-l 1 236-247 


958 


PR00480 


ASTACIN FAMILY SIGNATURE 


PR00480B 15.41 1.643e-09 231- 
250 


958 


BL00546 


Matrixins cysteine switch. 


BL00546B 20.11 1.000e-40 178- 
222 BL00546C 16.41 1.771e-31 
230-262 BL00546A 19.62 5.304e- 
28 88-118 BL00546E 10.23 
3.323e-22 305-326 BL00546G 
16.84 7.300e-19 363-383 
BL00546D 10.34 1.486e-14 269- 
283 BL00546F 12.40 2.800e-13 
342-355 BL00546H 10.76 6.625e- 
12 458-469 BL00546H 10.76 
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A C t >~\ All A T*l 

4.512e-09 411-422 


959 


PD02043 


RIBULOSE BISPHOSPHATE 
CARBOXYLAS. 


PD02043A 12.92 5,800e-09 137- 
171 


962 


BL00598 


Chromo domain proteins. 


BL00598 14.45 5.781e-16 63-85 


962 


PR00504 


CHROMODOMAIN SIGNATURE 


PR00504C 11.19 5.186e-10 72-85 
PR00504B 9.12 3.250e-09 57-72 


967 


PF00420 


NADH-ubiquinone/plastoquinone 
oxidoreductase chain 4L. 


PF00420A 16.63 9.526e-10 89-120 


967 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245C 7.84 5.000e-16 185-201 
PR00245A 18.03 6.878e-16 59-81 
PR00245D 10.47 8.500e-15 221- 
233 PR00245E 12.40 1.000e-09 
238-253 


967 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237A 11.48 5.065e-09 26-51 


967 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 2.184e-12 90-130 
BL00237D 11.23 5.909e-09 229- 
246 


968 


BL00059 


Zinc- containing alcohol 
dehydrogenases proteins. 


BL00059B 16.08 5.705e-14 223- 
251 


968 


BL01162 


Quinone oxidoreductase / zeta- 
crystallin proteins. 


BL01 162C 22.80 5.846e-14 284- 
328 BL01162B 18.11 4.205e-ll 
223-251 BL01162A 15.38 3.805e- 
09 194-217 


970 


PF00676 


Dehydrogenase El component. 


PF00676B 24.71 2.800e-39 257- 
295 PF00676D 14.40 7.545e-24 
341-361 PF00676C 16.88 5.737e- 
23 309-333 PF00676A 12.85 
5.050e-12 131-144 


970 


BL00801 


Transketolase proteins. 


BL00801D 22.48 7.750e-l 1 250- 
290 


977 


PR00457 


ANIMAL IIAEM PEROXIDASE 
SIGNATURE 


PR00457E 20.67 9.591e~26 409- 
436 PR00457D 16.81 5.667e-22 
384-405 PR00457G 17.45 l.OOOe- 
15 590-611 PR00457B 13.29 
4.343e-15 216-232 PR00457C 
19.25 3.250e-12 366-385 
PR00457A 15.80 5.645e-12 162- 
174 PR00457H 15.90 1.900e-10 
684-699 PR00457F 13.69 6.800e- 
09 461-472 


978 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 3.942e-17 95-124 
BL00269B 19.17 4.122e-15 57-86 
BL00269A 8.53 3.250e-13 31-51 


982 


DM0003 1 


IMMUNOGLOBULIN V REGION. 


DM0003 IB 15.41 8.297e-10 79- 
113 


985 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 9.663e-09 78-106 


987 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 1.000e-09 247- 
261 PR00019B 11.36 2.080e-09 
316-330 PR00019B 11.36 3. 160e- 
09 244-258 PR00019B 11.36 
4.240e-09 100-114 PR00019B 
11.36 5.680e-09 148-162 


988 


BL01215 


Mrp family proteins. 


BL01215C 18.97 6.447e-36 191- 
233 BL01215D 30.07 1.000e-33 
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250-300 BL01215A9.75 4.400e-22 

iai -too r>T ai o 1 <Xi Q 74. 9 % 
It/ 1-1— -o lj\-X) i _ i .j ij y.D 1 * _.ujoc ij 

134-147 


988 


PR00364 


DISEASE RESISTANCE PROTEIN 
SIGNATURE 


j^KUUod4A o.iy z.uz:5e-uy iu>i_i 


988 


BL00746 


NifH/frxC family proteins. 


o t nr\'7/1^A OA A'X 9 Q1 Qp AQ 1 A7_ 

_>L,UU/40A _-4.4j Z.oiye-vy iu/- 

152 


988 


PR00091 


NITROGENASE COMPONENT II 
SIGNATURE 


r>t> aaaoi A Q i A A 790^ no 1 HQ 1 97 
rKUUUyiA o.lU 4.J,_ye-uy iuy-izo 


988 


BL01128 


Shikimate kinase proteins. 


"OT All TO A 10 QA Q 1 <=» HQ 1 Af^ 

BLUllzoA lo.o4 o.zzie-uy iuo- 
140 


989 


PF00094 


von Willebrand factor type D domain 
proteins. 


rx*uuuy4_> ivj.4.j o.4uue-i/ ^71- 
509 


989 


PF00054 


-T- * • 1 * 2 " 

Lammm G domain proteins. 


dttaaa^/IO £1 1 '\C\C\c* HQ ^SR 

rrUUlo4_> lo.ol /.juue-uy ojo- 
670 


989 


BL00779 


Glycoprotein hormones alpha chain 
proteins. 


DT AAT7Q A 1/1 A1 T QiHo HQ 79Q 

_5JLUU//yA 14. ui /.o4ue-uy J_y- 
356 


989 


BL01185 


C-termmal cystine knot proteins. 


DT All QCTJ O 1 1/1 Q KOa AQ 97 R_ 

r>_,ll 1 icOo z 1 . 14 y. 10 JQ-vy Zoo- 

287 


992 


DM0003 1 


IMMUNOGLOBULIN Y REGION. 


DMUUUolr> ID. 41 o.yooe-11 /o- 
112 


992 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


■QT aaogaa 9n, so 9 nnnp aq 1 
177 


994 


BL00240 


Receptor tyrosine kinase class III 
proteins. 


BL00240B 24.70 4.692e-ll 148- 
172 


994 


DM00179 


w KINASE ALPHA ADHESION T- 
CELL. 


DMuOl/y 13. y / j.3U4e-uy iui-111 


994 


PD02870 


RECEPTOR INTERLEUKIN- 1 
PRECURSOR. 


PD02870D 15.74 9.234e-09 274- 
309 


995 


BL01009 


Extracellular proteins SCP/Tpx- 
1/Ag5/PR-1/Sc7 proteins. 


BL01009D 14.19 4.300e-/0 12/- 
148 BL01009A 13.75 6.586e-13 
57-75 BL01009E 13.50 1.439e-ll 
159-175 


995 


PR00837 


ALLERGEN V5/TPX-1 FAMILY 
SIGNATURE 


PR00837C 17.21 6.l43e-20 126- 
143 PR00837A 14.77 1.973e-13 
57-76 PR00837D 11.12 3.700e-ll 
160-174 


995 


PR00838 


VENOM ALLERGEN 5 SIGNATURE 


PR00838G 16.07 2.033e-17 125- 

1 a r Tin AA01 on O A 01 /Iq AO C H 

145 PR00o3oD o. / J 4.214e-uy D /- 
76 


996 


PD01270 


RECEPTOR FC 
IMMUNOGLOBULIN AFFIN. 


PD01270B 22.18 j.Do/e-lo 111- 
148 PD01270C 19.54 1.167e-17 
154-183 PD01270A 17.22 4.960e- 
14 57-97 PD01270D 24.66 4.284e- 
09 188-224 


999 


BL01305 


moaA / nifB / pqqE family proteins. 


BL01305D 14.97 7.279e-09 7-22 


1000 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 4.o00e-12 42-oD 
BL00290B 13.17 1.474e-ll 98-116 


1002 


PD00066 


PRO I LIN ZlIN C-rlJN LrxiK Mii 1 AJL- 
BINDI. 


pnnnnfifi n Q? S 846e-l S 785-298 

1_J UUUUU 1 J.J/Z/ w'.OT^V/t/ 1 ^ Z/ O— ' 

PD00066 13.92 1.600e-14 201-214 
PD00066 13.92 2.800e-14 313-326 
PD00066 13.92 2.000e-13 341-354 
PD00066 13.92 5.500e-13 229-242 
PD00066 13.92 8.435e-ll 257-270 


1002 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 9.471e-14 269-286 
BL00028 16.07 3.769e-ll 241-258 
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BL00028 16.07 7.577e-ll 325-342 
BL00028 16.07 7.577e-ll 353-370 
BL00028 16.07 6.700e-10 185-202 
BL00028 16.07 1.257e-09 297-314 


1002 


PR00043 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 1.000e-12 266- 
280 PR00048A 10.52 3.118e-12 
238-252 PR00048A 10.52 8.941e- 
12 294-308 PR00048B 6.02 
2.688e-10 310-320 PR00048B 6.02 
3.250e-10 198-208 PR00048A 
10.52 3.348e-10 322-336 
PR00048A 10.52 3.739e-10 210- 
224 PR00048A 10.52 5.696e-10 
350-364 PR00048A 10.52 l.OOOe- 
09 182-196 PR00048B 6.02 
1.947e-09 282-292 PR00048B 6.02 
5.737e-09 338-348 


1003 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 4.706e- 14 607-624 
BL00028 16.07 6.400e-13 411-428 
BL00028 16.07 9.550e-13 579-596 
BL00028 16.07 1.000e-l 2 439-456 
BL00028 16.07 1.391e-12 495-512 
BL00028 16.07 1.7S3e- 12 523-540 
BL00028 16.07 7.652e-12 383-400 
BL00028 16.07 9.217e-12 355-372 
BL00028 16.07 9.217e-12 663-680 
BL00028 16.07 5.846e-ll 467-484 
BL00028 16.07 6.538e-ll 635-652 
BL00028 16.07 2.800e-10 327-344 
BL00028 16.07 6.143e-09 262-279 


1003 


PD00066 


PROTEIN ZINC-FINGER METAL - 
BINDI. 


PD00066 13.92 4.462e-15 399-412 
PD00066 13.92 8.615e-15 343-356 
PD00066 13.92 1.600e- 14 427-440 
PD00066 13.92 2.800e-14 511-524 
PD00066 13.92 2.800e-14 623-636 
PD00066 13.92 5.200e-14 595-608 
PD00066 13.92 6.400e-14 567-580 
PD00066 13.92 8.800e-14 371-384 
PD00066 13.92 7.000e-13 315-328 
PD00066 13.92 9.000e-13 483-496 
PD00066 13.92 6.870e-ll 455-468 
PD00066 13.92 7.600e-09 539-552 
PD00066 13.92 7.900e-09 651-664 


1003 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 9.182e-15 492- 
506 PR00048A 10.52 1.750e-14 
408-422 PR00048A 10.52 7.000e- 
14 576-590 PR00048A 10.52 
1.000e-13 604-618 PR00048A 
10.52 3.571e-13 380-394 
PR00048A 10.52 4.214e-13 464- 
478 PR00048A 10.52 5.235e- 12 
352-366 PR00048B 6.02 L000e-ll 
424-434 PR00048B6.02 1.692e-ll 
340-350 PR00048B 6.02 7.23 le-11 
536-546 PR00048B 6.02 7.23 le-11 
620-630 PR00048A 10.52 8.579e- 
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11 259-273 PR00048A 10.52 
9.053e-ll 436-450 PR00048B 6.02 
9.308e-ll 508-518 PR00048B 6.02 
1.563e-10 592-602 PR00048B 6.02 
2. 125e~ 10 396-406 PR00048A 
10.52 4.130e-10 660-674 
PR00048A 10.52 4.522e-10 520- 
534 PR00048A 10.52 6.478e- 10 
324-338 PR00048B6.02 1.474e-09 
564-574 PR00048B 6.02 2.42 le-09 
676-686 PR00048A 10.52 2.800e- 
09 548-562 PR00048A 10.52 
2.800e-09 632-646 PR00048B 6.02 
2.895e-09 368-378 PR00048B 6.02 
1.000e-08 648-658 


1007 


BL00478 


LIM domain proteins. 


BL00478B 14.79 3.739e-14 469- 
484 BL00478B 14.79 3.500e-12 
411-426 BL00478B 14.79 6.000e- 
12 536-551 


1007 


DM00984 


w MYOD MYOBLAST 
DETERMINATION SHORT. 


DM00984B 15.18 4.822e-25 426- 
481 DM00984C7.66 8.036e-ll 
481-495 


1007 


BL00115 


Eukaryotic RNA polymerase II 
heptapeptide repeat proteins. 


BL00115Z 3.12 7.022e-09 260-309 


1008 


DM00475 


w LOW TRANSPOSASE SAPA 12K. 


DM00475B 12.12 8.269e-09 115- 
135 


1011 


PR00456 


RIBOSOMAL PROTEIN P2 
SIGNATURE 


PR00456F 5.86 6.400e-12 76-88 


1012 


PF00756 


Putative esterase. 


PF00756C 14.12 7.692e-10 103- 
133 


1016 


PR00310 


ANTIPROLIFERATIVE PROTEIN 
BTG1 FAMILY SIGNATURE 


PR00310D 9.10 1.540e-37 183-213 
PR00310C 12.74 5.286e-35 153- 
183 PR00310A 11.17 7.000e-27 
16-41 PR00310E 13.58 6.914e-24 
229-249 PR00310B 10.59 3.687e- 
23 123-153 


1016 


BL00960 


BTG1 family proteins. 


BL00960B 24.47 3.288e-26 116- 
161 BL00960C 12.68 3.647e-26 
180-202 BL00960A 10.98 5.304e- 
12 14-26 


1017 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 6.143e-32 44-83 


1017 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 6.063e-15 523-540 
BL00028 16.07 7.188e-15 383-400 
BL00028 16.07 3.700e-13 467-484 
BL00028 16.07 5.950e-13 439-456 
BL00028 16.07 8.650e-13 271-288 
BL00028 16.07 4.115e-ll 355-372 
BL00028 16.07 6.885e-ll 327-344 
BL00028 16.07 2.800e- 10 411-428 
BL00028 16.07 3.100e-10 495-512 


1017 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 6.625e-18 399-412 
PD00066 13.92 2.385e-15 371-384 
PD00066 13.92 8.615e-15 343-356 
PD00066 13.92 3.500e-13 511-524 
PD00066 13.92 l.OOOe- 12 483-496 
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PD00066 13.92 1.000e-l 1 427-440 
PD00066 13.92 3.769e-10 455-468 
PD00066 13.92 5.800e-09 539-552 


1017 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 6.727e-15 520- 
534 PR00048A 10.52 1.750e-14 
380-394 PR00048A 10.52 3.250e- 
14 408-422 PR00048A 10.52 
4.857e-13 464-478 PR00048A 
10.52 1.529e-12 436-450 
PR00048A 10.52 2.059e-12 352- 
366 PR00048B6.02 3.769e-ll 
368-378 PR00048B 6.02 5.846e-ll 
396-406 PR00048B6.02 5.846e-ll 
508-518 PR0004SA 10.52 1.783e- 
10 492-506 PR00048B 6.02 
8.3 13e-l 0 480-490 PR00048A 
10.52 4.240e-09 324-338 
PR00048B 6.02 5.737e-09 340-350 
PR00048A 10.52 8.200e-09 146- 
160 


1020 


BL00478 


LIM domain proteins. 


BL00478B 14.79 6.000e-15 219- 
234 BL00478B 14.79 8.250e-12 
99-114 BL00478B 14.79 8.250e-12 
160-175 BL00478B 14.79 2.800e- 
11 282-297 


1024 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NIL 


PD01066 19.43 9.400e-38 17-56 


1024 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDL 


PD00066 13.92 1.600e-14 230-243 
PD00066 13.92 8.200e- 14 286-299 
PD00066 13.92 8.200e-14 398-411 
PD00066 13.92 1.000e-13 342-355 
PD00066 13.92 6.478e-ll 314-327 
PD00066 13.92 6.478e-ll 370-383 
PD00066 13.92 6.870e-ll 258-271 


1024 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL0002S 16.07 6.850e-13 354-371 
BL00028 16.07 4.522e-12 242-259 
BL00028 16.07 8.826e-12 214-231 
BL00028 16.07 9.609e-12 270-287 
BL00028 16.07 2.73 le- 11 326-343 
BL00028 16.07 1.900e-10 382-399 
BL00028 16.07 8.500e-10 298-315 
BL00028 16.07 5.629e-09 410-427 


1024 


BL00479 


Phorbol esters / diacylglycerol binding 
domain proteins. 


BL00479A 19.86 7.319e-09 201- 
224 


1024 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 4.273e-15 351- 
365 PR00048A 10.52 5.500e-13 
239-253 PR00048A10.52 5.500e- 
13 267-281 PR00048A 10.52 
8.941e-12 323-337 PR00048A 
10.52 5.263e-ll 379-393 
PR00048A 10.52 1.783e-10 407- 
421 PR00048B 6.02 3.250e-10 
339-349 PR00048B 6.02 8.875e-10 
227-237 PR00048B6.02 1.000e-09 
283-293 PR00048B 6.02 1.000e-09 
395-405 PR00048A 10.52 1.360e- 
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09 295-309 PR00048B 6.02 
2.895e-09 423-433 PR00048A 
10.52 7.480e-09 21 1-225 


1025 


BL00824 


Elongation factor 1 beta/beta7delta 
chain proteins. 


BL00824B 9.21 4.892e-09 394-414 


1025 


PR00966 


POTYVIRUS NUCLEAR 
INCLUSION A CYSTEINE 
PROTEASE (C4) SIGNATURE 


PR00966H 13.17 9.727e-09 215- 
235 


1026 


BL01282 


BIR repeat proteins. 


BL01282B 30.49 3.829e-09 12-51 


1026 


PF00992 


Troponin. 


PF00992A 16.67 5.263e-09 136- 
171 


1026 


BL00518 


Zinc finger, C3HC4 type (RING 
finger), proteins. 


BL00518 12.23 8.333e-09 31-40 


1029 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 5.613e-09 171- 
199 


1029 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 5.213e-10 91-137 
BL00203 13.94 5.041e-09 71-117 
BL00203 13.94 8.898e-09 53-99 


1031 


BL00170 


Cyclophilin-type peptidyl -prolyl cis- 
trans isomerase signatur. 


BL00170B 20.97 1.000e-40 48-88 
BL00170C 18.49 L000e-40 95-140 
BL00170A 17.08 2.452e-16 18-45 


1031 


PR00153 


CYCLOPHILIN PEPTIDYL-PROLYL 
CIS-TRANS ISOMERASE 
SIGNATURE 


PR00153C 11,01 4.375e-20 96-112 
PR00153B 11.57 1.500e-17 53-66 
PR00153E 9.10 7.632e-17 124-140 
PR00153D 11.99 6.400e-16 1 1 1- 
124 PR00153A 12.98 3.093e-ll 
24-40 


1031 


PFO0638 


RanBPl domain proteins. 


PF00638 11.91 5.569e-09 68-83 


1035 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 2.149e-29 6-45 


1035 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 1.692e-15 234-247 
PD0Q066 13.92 3.077e-15 374-3S7 
PD00066 13.92 3.077e-15 458-471 
PD00066 13.92 5.846e-15 486-499 
PD00066 13.92 3.400e-14 150-163 
PD00066 13.92 5.800e-14 206-219 
PD00066 13.92 5.800e-14 290-303 
PD00066 13.92 5.800e-14 318-331 
FD00066 13.92 8.800e~ 14 514-527 
PD00066 13.92 3.500e-13 346-359 
PD00066 13.92 7.500e-13 402-415 
PD00066 13.92 8.714e-12 178-191 
PD00066 13.92 3.700e-09 430-443 
PD00066 13.92 5.500e-09 262-275 


1035 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 4.750e-14 215- 
229 PR00048A 10.52 4.750e-14 
467-481 PR00048A 10.52 7.000e- 
14 355-369 PR00048A 10.52 
4.857e-13 299-313 PR00048A 
10.52 5.500e-13 99-113 PR00048A 
10.52 5.500e-13 327-341 
PR00048A 10.52 1.000e-12 495- 
509 PR00048A 10.52 2.059e-12 
271-285 PR00048A 10.52 2.588e- 
12 523-537 PR00048A 10.52 
8.412e-12 243-257 PR00048A 
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10.52 4.316e-ll 131-145 
PR00048B 6.02 4.462e-ll 115-125 
PR00048A 10.52 8.579e-ll 439- 
453 PR00048A 10.52 9.053e-ll 
383-397 PR00048A 10.52 1.391e- 
10 159-173 PR00048B 6.02 
1.563e-10 455-465 PR00048A 
10.52 3.739e-10 411-425 
PR00048B 6.02 6.625e-10 203-213 
PR00048B 6.02 6.625e-10 287-297 
PR00048A 10.52 3.160e-09 551- 
565 PR00048A 10.52 4.600e-09 
187-201 PR00048B 6.02 8.579e-09 
147-157 


1035 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 1.900e-13 358-375 
BL00028 16.07 2.350e-13 102-119 
BL00028 16.07 8.826e-12 470-487 
BL00028 16.07 2.038e-ll 554-571 
BL00028 16.07 5.500e-ll 190-207 
BL00028 16.07 5.846e-ll 274-291 
BL00028 16.07 6. 192e-l 1 218-235 
BL00028 16.07 8.269e-ll 330-347 
BL00028 16.07 1.300e-10 526-543 
BL00028 16.07 1.600e-10 414-431 
BL00028 16.07 3.700e-10 386-403 
BL00028 16.07 6.700e-10 442-459 
BL00028 16.07 7.000e-10 302-319 
BL00028 16.07 1.000e-09 134-151 
BL00028 16.07 3.314e-09 246-263 
BL00028 16.07 L000e-08 498-515 


1038 


BL01130 


Sulfate transporters proteins. 


BL01130A 21.63 7.407e-25 331- 
385 BL01130B 23.34 2.286e-23 
429-481 


1038 


DM01292 


ESICULAR LUMEN DOMAIN. 


DM01292I 12.82 9.400e-10 148- 
190 DM01292I 12.82 9.400e-10 
591-633 


1042 


PD02327 


GLYCOPROTEIN ANTIGEN 
PRECURSOR IMMUNOGLO. 


PD02327B 19.84 5.574e-10 169- 
191 


1042 


PD02365 


CHAIN FACTOR INTERLEUKIN-12 
BETA PRECURSOR IL-1. 


PD02365C 7.89 4.196e-09 365-395 


1043 


BL00983 


Ly-6 / u-PAR domain proteins. 


BL00983C 12.69 3.500e-15 142- 
158 BL00983B 8.19 1.643e-12 84- 
94 BL00983A 5.84 7.2ole-10 /4- 
83 


1047 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290B 13.17 6.400e-22 281- 
299 BL00290A 20.89 4.600e-16 
34-57 BL00290A 20.89 2.080e-lU 
224-247 


1051 


BL01221 


PMF-/2 / tiJYLr / ivirzu iamny proteins. 


T*T 01991R 13 29 6 745e-09 54-68 


1052 


BL00983 


Ly-6 / u-PAR domain proteins. 


BL00983C 12.69 4.981e-09 46-62 


1052 


BL00272 


Snake toxins proteins. 


BL00272C 8.27 8.326e-09 50-62 


1054 


BL00420 


Speract receptor repeat proteins domain 
proteins. 


BL00420B 22.67 4.627e-30 723- 
778 BL00420C 11.90 9.100e-13 
809-820 


1054 


PR00258 


SPERACT RECEPTOR SIGNATURE 


PR00258B 9.63 3.813e-15 738-750 
PR00258E 13.33 2.047e-12 808- 
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821 PR00258C 9.05 2.837e-10 
753-764 


1054 


BL00514 


Fibrinogen beta and gamma chains C- 
terminal domain proteins. 


BL00514G 15.98 4.326e-09 542- 
572 


1055 


BL01212 


ATP P2X receptors proteins. 


BL01212A 34.89 1.000e-40 41-94 
BL01212E 24.87 1.000e-40 225- 
280 BL01212G 11.86 3.700e-34 
309-337 BL01212D 11.42 9.609e- 
27 182-206 BL01212B 19.25 
8.393e-21 126-151 BL01212F 
10.12 2.421e-15 290-301 
BL01212C 8.40 2.500e-14 158-169 


1056 


PR00920 


SPUMAVIRUS ASPARTIC 
PROTEASE (A9) SIGNATURE 


PR00920C 13.24 7.310e-09 149- 
171 


1057 


BL00682 


ZP domain proteins. 


BL00682C 20.71 1.706e-12 439- 
464 


1057 


BL00025 


P-type Trefoil' domain proteins. 


BL00025 17.17 5.645e-09 231-252 


1059 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 4.273e-14 98-138 


1059 


PR00854 


PROSTAGLANDIN D RECEPTOR 
SIGNATURE 


PR00854E 10.50 4.649e-26 236- 
260 PR00854B 7.30 8.154e-21 41- 
59 PR00854G 10.66 1.783e-18 
341-358 PR00854D9.41 2.500e-18 
185-201 PR00854A 15.24 9,077e- 
18 6-21 PR0O854H 14.71 6.203e- 
17 369-390 PR00854C 12.92 
1.643e-12 93-105 PR00854F 12.83 
9.682e-ll 321-333 


1059 


PR00856 


PROSTACYCLIN (PROSTANOID IP) 
RECEPTOR SIGNATURE 


PR00856E 9.82 1.724e-09 178-195 


1060 


BL01271 


Sodium: sulfate symporter family 
proteins. 


BL01271D 25.26 L000e-40 480- 
535 BL01271B 12.02 6.400e-24 
208-233 BL01271A 8.06 7.955e-23 
132-152 BL01271C 13.62 7.429e- 
20 407-429 


1062 


PF00798 


Arenavirus glycoprotein. 


PF00798I 18.55 8.811e-09 53-90 


1064 


BL01017 


Ergosterol biosynthesis ERG4/ERG24 
family proteins. 


BL01017D 20.82 1.000e-40 232- 
278 BL01017F 23,34 9.196e-35 
291-344 BL01017C 15.91 7.324e- 
23 181-207 BL01017B 12.69 
9.4l9e-17 loo-lol 


1065 


BL00874 


Bacterial type II secretion system 
protein F proteins. 


BL00874B 29.89 9.724e-09 414- 
469 


1066 


PD01270 


RECEPTOR FC 
IMMUNOGLOBULIN AFFIN. 


PD01270C 19.54 2,895c-lo 43-72 


1066 


DM00179 


w KINASE ALPHA ADHESION T- 
CELL. 


DM00179 13.97 8.435e-09 183-193 


1067 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 3.455e-14 77-117 


1067 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237C 15.69 1.257e-10 91-114 
199 


1067 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245A 18.03 9.581e-18 46-68 
PR00245C 7.84 4.780e-13 214-230 
PR00245E 12.40 6.741e-09 267- 
282 


1067 


PR00534 


MELANOCORTIN RECEPTOR 
FAMILY SIGNATURE 


PR00534A 11.49 9.229e-09 38-51 
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1069 


BL00107 


Protein kinases ATP-binding region 
proteins. i 


BL00107A 18.39 5.909e-15 409- 
440 BL00107B 13.31 4.214e-ll 
484-500 


1069 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109E 14.41 4.353e-09 549- 
572 


1071 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 4.789e-13 222- 
245 


1072 


PR00308 


TYPE I ANTIFREEZE PROTEIN 
SIGNATURE 


PR00308C 3.83 8.892e-10 4-14 
PR00308C3.83 8.892e-10 5-15 
PR00308C3.83 8.013e-09 3-13 


1072 


PR00698 


C.ELEGANS SRG FAMILY 
INTEGRAL MEMBRANE PROTEIN 
SIGNATURE 


PR00698E 14.43 8.714e-09 111- 
137 


1075 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 7.000e-ll 69-85 
PF00023B 14.20 2.636e-09 131- 
141 


1075 


PD00078 


REPEAT PROTEIN ANK NUCLEAR 
ANKYR. 


PD00078B 13.14 6.087e-09 128- 
141 


1075 


PR00806 


VINCULIN SIGNATURE 


PR00806C 11.07 8.839e-09 350- 
368 


1075 


PF00791 


Domain present in ZO-1 and Unc5-like 
netrin receptors. 


PF00791B 28.49 9.505e-09 135- 
190 PF0079 IB 28.49 9.835e-09 
69-124 


1076 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 5.610e-ll 174- 
214 BL00237C 13.19 4.176e-10 
317-344 


1076 


PR00237 


RHODOP SIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237F 13.57 7.677e-ll 322- 
347 PR00237E 13.03 6.100e-10 
276-300 PR00237A 11.48 8.839e- 
09 103-128 


1077 


BL00216 


Sugar transport proteins. 


BL00216B 27.64 1.831e-09 139- 
189 


1079 


BL00462 


Gamma-glutamyltranspeptidase 
proteins. 


BL00462A 20.89 4.000e-20 108- 
151 BL00462D 23.07 7.256e- 12 
356-396 BL00462B 17.88 9.153e- 
12 183-220 


1080 


BL00115 


Eukaryotic RNA polymerase II 
heptapeptide repeat proteins. 


BL00115Z 3.12 2.125e-09 1363- 
1412 BL00115Z 3.12 6.096e-09 
1349-1398 


1083 


BL00272 


Snake toxins proteins. 


BL00272C 8.27 9.182e-10 109-121 


1083 


BL00107 


Protein kinases ATP-binding region 
proteins. 


BL00107A 18.39 3.348e-14 366- 
397 BL00107B 13.31 4.176e-09 
441-457 


1083 


PR00653 


ACTIVIN TYPE II RECEPTOR 
SIGNATURE 


PR00653D 13.25 7.200e-09 385- 
407 


1083 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109E 14.41 6.727e-ll 535- 
558 PR00109D 17.04 7.609e-09 
442-465 


1083 


BL00983 


Ly-6 / u-PAR domain proteins. 


BL00yo3C iz.oy y. i33e-uy iio- 
121 


1084 


BL00272 


Snake toxins proteins. 


BL00272C 8.27 9.182e-10 109-121 


1084 


BL00107 


Protein kinases ATP-binding region 
proteins. 


BL00107A 18.39 3.348e-14 428- 
459 BL00107B 13.31 4.176e-09 
503-519 


1084 


PR00653 


ACTIVIN TYPE II RECEPTOR 
SIGNATURE 


PR00653D 13.25 7.200e-09 447- 
469 
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1084 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109E 14.41 6.727e-ll 597- 
620 PR00109D 17.04 7.609e-09 
504-527 


1084 


BL00983 


Ly-6 / u-PAR domain proteins. 


BL00983C 12.69 9.135e-09 105- 
121 


1085 


PR00541 


MUSCARINIC M4 RECEPTOR 
SIGNATURE 


PR00541C 8.06 7.726e-09 486-507 


1086 


PR00541 


MUSCARINIC M4 RECEPTOR 
SIGNATURE 


PR00541C 8.06 7.726e-09 755-776 


1087 


PD01101 


INHIBITOR HEAVY CHAIN 
CHANNEL IN. 


PD01101B 21.53 3.318e-22 343- 
396 


1088 


BL00129 


Glycosyl hydrolases family 31 proteins. 


BL00129A 26.21 2.400e-28 114- 
160 BL00129D 16.76 6.806e-26 
364-408 BL00129C 15.12 5.295e- 
24 326-354 BL00129E 22.60 
4.857e-23 428-464 BL00129B 
19.19 4.436e-15 225-252 
BL00129F 26.19 2.500e-13 544- 
582 


1090 


PR00887 


STRUCTURE-SPECIFIC 
RECOGNITION PROTEIN 
SIGNATURE 


PR00887A 11.39 1.643e-22 343- 
360 PR00887F 12.74 2.000e-22 
498-516 PR00887B 9.94 3.250e-22 
365-382 PR00887C 13.16 4.000e- 
22 388-405 PR00887E 10.36 
5.200e-22 480-499 PR00887H 
11.84 8.313e-22 537-556 
PR00887G 14.17 9.438e-20 521- 
538 PR00887D 15.12 8.313e-17 
453-467 


1090 


PR00886 


HIGH MOBILITY GROUP 
(HMG1/HMG2) PROTEIN 
SIGNATURE 


PR00886C 11.84 8.500e-13 696- 
715 PR00886A 10.08 3.192e-10 
710-733 


1090 


PD02448 


TRANSCRIPTION PROTEIN DNA- 
BINDIN. 


PD02448A 9.37 5.576e-10 686-725 


1090 


BL00353 


HMG1/2 proteins. 


BL00353B 11.47 8.244e-24 664- 
714 BL00353A 9.60 2.549e-09 
674-723 


1091 


BL00284 


Serpins proteins. 


BL00284C 28.56 4.000e-25 472- 
514 BL00284D 16.34 5.655e-17 
578-605 BL00284A 15.64 2.742e- 
15 341-365 BL00284E 19.15 
4.818e-15 659-684 BL00284B 
17.99 3.667e-14 445-466 
BL00284A 15.64 2.600e- 11 375- 
399 


1092 


PR00839 


V8 SERINE PROTEASE FAMILY 
SIGNATURE 


PR00839B 11.20 8.119e-10 357- 
375 


1097 


BL00605 


ATP synthase c subunit proteins. 


BL00605 27.67 3.172e-33 79-133 


1097 


PR00124 


ATP SYNTHASE C SUBUNIT 
SIGNATURE 


PR00124C 12.42 o.4UUe-lo 113- 
139 PR00124A 8.81 8.054e-14 75- 
95 PR00124B 14.66 6.897e-12 96- 
112 


1098 


BL00450 


Aconitase family proteins. 


BL00450B 42.34 8.393e-30 386- 
441 BL00450D 21.14 2.800e-18 
665-689 BL00450E 16.34 8.875e- 
13 710-725 BL00450B 42.34 
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6.400e-12 446-501 BL00450A 
13.76 2.406e-ll 351-365 
BL00450C 11.95 6.657e-10 612- 
622 


1098 


PR00415 


ACONITASE FAMILY SIGNATURE 


PR00415D 12.72 5.696e-16 390- 
406 PR00415I 13.62 4.1 15e-15 
675-689 PR00415G14.24 8.105e- 
15 548-563 PR00415C 13.34 
7.828e-14 376-390 PR00415E 
10.04 7.828e-14 452-466 
PR00415F 11.66 7.273e-13 466- 
480 PR00415H 12.39 y./UUe-l3 
613-625 PR00415A 11.15 1.621e- 
10 323-337 PR00415B 8.14 
9.03oe-uy J4/-J0O 


1104 


DM00179 


w KINASE ALPHA ADHESION T- 
CELL. 


mirnni nr\ 1 i qh 1 nnn*=» 1 1 1 ^Q-1 A.Q 


1104 


BL00240 


Receptor tyrosine kinase class III 
proteins. 


BL0024Ut> 24. /U 4.zCOe-uy oD-luy 


1106 


BL01002 


Translationally controlled tumor 
protein. 


BL01002A 13.19 1.360e-24 1-24 

hJL^UlUUZij /.jy j.lloc-iH-M-o-oz, 


1107 


PF00997 


Kappa casein. 


PF00997D 9.95 8.306e-09 513-548 


1109 


BL00018 


EF-hand calcium-binding domain 
proteins. 


BL00018 7.41 1.391e-09 42-55 


1110 


BL01310 


ATP1G1 / PLM / MATS family 
proteins. 


BL01310 14.74 8.981e-24 99-135 


|TTT2 


PR00764 


COMPLEMENT C9 SIGNATURE 


PR00764B 13.56 2.250e-ll 122- 
143 


1112 


PR00261 


LOW DENSITY LIPOPROTEIN 
(LDL) RECEPTOR SIGNATURE 


PR00261E 11.08 o.iUae-Uy 12/- 
149 PR00261F 11.57 7.152e-09 
127-149 


1115 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 4.273e-l4 188- 
228 


1115 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245A 18.03 3.250e-19 157- 
179 PR00245B 10.38 1.918e-09 
275-290 


1115 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237C 15.69 4.150e-09 202- 
225 


1116 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 2.658e-12 163- 
203 


1116 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245A 18.03 9.325e-19 132- 
154 PR00245C 7.84 4.073e-15 
311-327 PR00245B 10.38 5.500e- 
13 250-265 PR00245E 12.40 
7.618e-13 364-379 PK00245D 
10.47 4.o/3e-uy o4/-JDy 


1116 


PR00237 


RHODOPSIN-LIKE GPCR 


PR00237C 15.69 6.400e-10 177- 
200 PR00237G 19.63 5.814e-09 
345-372 


1119 


PD01719 


PRECURSOR GLYCOPROTEIN 
SIGNAL RE. 


PD01719A 12.89 7.955e-13 969- 
997 PD01719A 12.89 8.111e-09 
305-333 


1120 


BL00355 


HMG14 and HMG17 proteins. 


BL00355 5.97 1.692e-37 18-49 


1120 


PR00925 


NONHISTONE CHROMOSOMAL 
PROTEIN HMG17 FAMILY 


PR00925A 5.47 2.800e-19 18-33 
PR00925B 3.73 3.400e-16 34-47 
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SIGNATURE 


PR00925D o.do z./UUe-13 oo-// 
PR00925C 5.57 8.235e-09 47-58 


1124 


BL00615 


C-type lectin domain proteins. 


BL00615A 16. oo 4.24Ue-ll ZIU- 
228 


1125 


PD02327 


GLYCOPROTEIN ANTIGEN 
PRECURSOR IMMUNOGLO. 


PD02327B 19.84 2.091e-09 191- 
213 


1125 


DM00179 


w KINASE ALPHA ADHESION T- 
CELL. 


DM00179 13.97 7.652e-09 338-348 


1127 


PD02870 


RECEPTOR INTERLEUKIN- 1 
PRECURSOR. 


PD02870D 15.74 8.755e-09 96-131 


1133 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245A 18.03 8.364e-14 122- 
144 PR00245C 7.84 9.280e-13 
300-316 PR00245B 10.38 4.600e- 
11 240-255 PR00245E 12.40 
7.623e-10 353-368 


1133 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 5.371e-13 153- 
193 BL00237D 11.23 7.750e-10 
344-361 


1133 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237G 19.63 6.063e-12 334- 
361 PR00237C 15.69 6.175e-09 
167-190 


1134 


BL00221 


MIP family proteins. 


BL00221B 10.22 1.871e-ll 141- 
152 BL00221D 12.33 2.174e-ll 
240-255 BL00221E 8.47 9.710e-ll 
307-318 BL00221 A 6.39 5.935e-09 
92-103 


1134 


PR00783 

r 


MAJOR INTRINSIC PROTEIN 
FAMILY SIGNATURE 


PR00783B 15.98 4.130e-15 127- 
152 PR00783F 12.33 9.156e-14 
308-329 PR00783A 12,72 7.462e- 
12 88-108 PR00783E 16.78 
8.263e-10 128-151 PR00783C 
13.54 1.340e-09 164-184 
PR00783E 16.78 o.754e-09 22o- 
249 


1136 


PD02886 


GLYCOPROTEIN PRECURSOR 
IMMUNOGLOBULIN FOL. 


PD02886C 21.92 7.907e-10 112- 
151 


1137 


PR00122 


VACUOLAR ATP SYNTHASE 16 KD 
SUBUNIT SIGNATURE 


PR00122C 8.20 1.000e-33 104-131 
PR00122B 8.60 2.125e-28 56-81 
PR00122D 9.97 4.375e-28 131-155 
PR00122A 11.44 o.UZ)Je-iy dv-dj 


1137 


BL00605 


ATP synthase c subunit proteins. 


BL00605 27.67 1.778e-10 94-148 


1137 


PR00124 


ATP SYNTHASE C SUBUNIT 
SIGNATURE 


PR00124C 12.42 2.161e-10 128- 
154 


1138 


BL00665 


Dihydrodipicolinate synthetase 
proteins. 


BL00665B 30.33 8.265e-12 52-105 
BL00665D 14.76 1.000e-ll 164- 
187 BL00665C 25.58 5.832e-ll 
105-156 


1138 


PR00146 


DIHYDRODIPICOLINATE 

n\r\TTf T A Ct? CT/^TvT A TPT T"D T? 

SYNTHASE MLriNAl UKL 


PR00146D 16.26 2.525e-10 163- 

1 Q 1 
lo 1 


1139 


BL00456 


Sodium: solute symporter family 
proteins. 


BL00456C 24.55 4.886e-28 165- 
220 BL00456A 22.59 3.127e-27 
27-82 BL00456B 18.94 1.220e-17 
103-133 


1139 


BL00415 


Synapsins proteins. 


BL00415O 3.44 6.270e-09 514-552 


1139 


BL00136 


Serine proteases, subtilase family, 
aspartic acid proteins. 


BL00136B 9.63 7.796e-09 773-786 
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1141 


BL00310 


Lysosome-associated membrane 
glycoproteins duplicated domain 
proteins. 


BL00310F 23.26 4.162e-09 194- 
249 


1143 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 4.1 15e-18 126- 
166 BL00237C 13.19 7.545e-15 
263-290 BL00237D 11.23 8.962e- 
11 324-341 


1143 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237G 19.63 7.120e-15 314- 
341 PR00237F 13.57 2.565e-14 
268-293 PR00237C 15.69 6.667e- 
12 140-163 PR00237A 11.48 
8.125e-ll 63-88 PR00237B 13.50 
1.563e-10 96-118 PR00237E 13.03 
3,118e-09 226-250 


1144 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 2.06Se-09 7-22 


1145 


BL01159 


WW/rsp5/WWP domain proteins. 


BL01159 13.85 1.310e-14 387-402 


1145 


PR00403 


WW DOMAIN SIGNATURE 


PR00403B 12.19 6.906e-15 387- 
402 PR00403A 16.82 5.200e- 11 
373-387 


1145 


BL01179 


Phosphotyrosine interaction domain 
proteins (PID) profile. 


BL01179A 12.63 8.286e-ll 394- 
406 BL01179B 15.18 7.968e-10 
667-682 


1147 


BL00594 


Aromatic amino acids permeases 
proteins. 


BL00594A 16.75 3.851e-09 107- 
151 


1148 


PR00399 


SYNAPTOTAGMIN SIGNATURE 


PR00399B 14.27 1.305e-09 242- 
256 


1148 


PR00360 


C2 DOMAIN SIGNATURE 


PR00360B 13.61 6.318e-09 279- 
293 


1150 


BL00291 


Prion protein. 


BL00291A 4.49 8.241e-09 21-56 


1152 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 6.447e-12 210- 
250 


1152 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237G 19.63 3.512e-09 392- 
419 PR00237C 15.69 4.825e-09 
224-247 


1152 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245A 18.03 1.500e-20 179- 
201 PR00245B 10.38 3.571e-16 
297-312 PR00245E 12.40 l.OOOe- 
12 411-426 PR00245D 10.47 
1.000e-10 394-406 PR00245C 7.84 
6.727e-09 358-374 


1153 


PR00962 


LETHAL(2) GIANT LARVAE 
PROTEIN SIGNATURE 


PR00962B 11.98 2.800e-28 310- 
333 PR00962G 15.71 5.655e-28 
609-634 PR00962D 10.40 1.225e- 
27 451-475 PRO 0962F 12.39 
6.786e-23 568-588 PR00962H 
13.32 9.710e-23 639-659 
PR00962I 11.68 3.829e-22 708-728 

PK.UUyOZ\^ o.UU ^f.ZDUe-ZZ ^OZ-jOj 

PR00962A 13.28 7.612e-22 17-36 
PR00962E 8.81 1.628e-20 531-550 


1153 


PR00320 


G-PROTEIN BETA WD-40 REPEAT 
SIGNATURE 


PR00320A 16.74 9.122e-09 454- 
469 


1155 


BL00218 


Amino acid permeases proteins. 


BL00218D 21.49 2.038e-10 385- 
430 BL0021 8E 23.30 6.400e-10 
466-506 BL0021SB 21.44 5.790e- 
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09 217-249 


1155 


BL00341 


Surfactant associated polypeptide SP-C 
palmitoylation site proteins. 


BL00341B 8.70 7.895e-09 54-88 


1158 


PR00783 


MAJOR INTRINSIC PROTEIN 
FAMILY SIGNATURE 


PR00783C 13.54 1.474e-17 31-51 


1158 


BL00221 


MIP family proteins. 


BL00221B 10.22 1.643e-14 8-19 


1158 


PD00302 


PROTEASE POLYPROTEIN 
HYDROLASE ASP. 


PD00302B 9.52 1.360e-l 4 261-277 
PD00302A6.33 3.323e-ll 198-209 


1158 


PF00692 


dUTPase. 


PF00692B 8.14 3.613e-ll 113-124 


1158 


DM00892 


3 RETROVIRAL PROTEINASE. 


DM00892C 23.55 4.81Se-13 292- 
326 DM00892B 9.78 1.000e-08 
264-270 


1160 


PD00320 


LAMININ CHAIN EGF-LIKE 
DOMAIN P. 


PD00320A 14.49 8.286e-12 1681- 
1695 


1160 


BL01187 


Calcium-binding EGF-like domain 
proteins pattern proteins. 


BL01187B 12.04 1.225e-09 1745- 
1761 


1160 


BL01248 


Laminin-type EGF-like (LE) domain 
proteins. 


BL01248 11.02 9.308e-18 1705- 
1718 BL01248 11.02 4.000e-15 
393-406 BL01248 11.02 1.900e-ll 
321-334 BL01248 11.02 6.094e-09 
445-458 


1160 


PR00877 


PLANT PEC FAMILY 
METALLOTHIONEIN SIGNATURE 


PR00877D 4.18 6.516e-09 1308- 
1316 


1160 


PF00054 


Laminin G domain proteins. 


PF00054B 16.61 8.200e-09 558- 
570 


1160 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


PR00011B 13.08 2.841e-17 547- 
566 PR00011D 14.03 4.150e-16 
547-566 PR00011B 13.08 7.750e- 
16 1370-1389 PR00011A 14.06 
1.170e-15 1277-1296 PR00011D 
14.03 4.000e-15 501-520 
PR00011A 14.06 4.736e-15 547- 
566 PR00011A 14.06 1.327e-14 
1370-1389 PR00011A 14.06 
5.909e-14 1416-1435 PR00011D 
14.03 7.980e-14 1370-1389 
PR00011D 14.03 3.717e-13 1416- 
1435 PR00011D 14.03 6.434e-13 
1277-1296 PR00011B 13.08 
8.932e-13 1277-1296 PR00011A 
14.06 2.161e-12 436-455 
PR00011D 14.03 6.053e-12 436- 
455 PR00011B 13.08 1.138e-ll 
436-455 PR0001 IB 13.08 6.954e- 
11 1416-1435 PR00011A 14.06 
8.062e-ll 1318-1337 PR00011A 
14.06 1.913e-10 695-714 
PR00011B 13.08 2.043e-10 644- 
663 PR00011D 14.03 3.215e-10 
695-714 PR00011C 24.25 4.600e- 
10 651-680 PR00011C 24.25 
9.100e-10 1423-1452 PR00011D 
14.03 1.261e-09 644-663 
PR00011B 13.08 1.370e-09 695- 
714 PR00011A 14.06 1.986e-09 
644-663 PR00011B 13.08 3.096e- 
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09 1318-1337 PR00011D 14.03 
8.435e-09 591-610 PR00011C 
24.25 9.857e-09 1769-1798 


1161 


PD00919 


CALCIUM-BINDING PRECURSOR 
SIGNAL R. 


PD00919A 11.53 8.377e-10 185- 
197 


1161 


BL01187 


Calcium-binding EGF-like domain 
proteins pattern proteins. 


BL01187B 12.04 9.486e-10 158- 
174 BL01187B 12.04 2.800e-09 
196-212 


1161 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


PR00011D 14.03 4.158e-12 39-58 
PR00011B 13.08 2.973e-09 39-58 


1161 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR00010C 11.16 5.929e-09 163- 
174 


1161 


BL00243 


Integrins beta chain cysteine-rich 
domain proteins. 


BL00243H 17.53 7.276e-09 65-91 


1161 


BL01185 


C-terminal cystine knot proteins. 


BL01185B 21.14 9.047e-09 137- 
186 


1161 


DM00060 


338 lew NEUREXIN ALPHA III 
CYSTEINE. 


DM00060 6.92 9.460e-09 108-118 


1161 


BL01248 


Laminin-type EGF-like (LE) domain 
proteins. 


BL01248 11.02 9.660e-09 48-61 


1162 


BL00252 


Interferon alpha, beta and delta family 
proteins. 


BL00252A 18.49 6.657e-23 35-72 
BL00252B 19.78 9.125e-16 73-124 


1162 


PR00266 


INTERFERON ALPHA AND BETA 
SUB UNIT SIGNATURE 


PR00266A 13.61 1.000e-13 67-80 


1163 


PR00264 


INTERLEUKIN- 1 SIGNATURE 


PR00264B 20.98 8.453e-ll 63-90 
PR00264C 17.77 1.851e-10 103- 
132 


1164 


BL00249 


Platelet-derived growth factor (PDGF) 
family proteins. 


BL00249 18.21 3.250e-34 75-105 


1165 


BL00253 


Interleukin~l proteins. 


BL00253D 25.67 3.464e-ll 95-135 


1165 


PR00264 


INTERLEUKIN- 1 SIGNATURE 


PR00264C 17.77 3.294e-17 95-124 
PR00264B 20.98 6.250e-09 56-83 


1166 


PR00573 


INTERLEUKIN 8B RECEPTOR 
SIGNATURE 


PR00573D 15.57 1.450e-20 114- 
130 


1166 


PR00427 


INTERLEUKIN- 8 RECEPTOR 
SIGNATURE 


PR00427E 7.08 9.446e-l8 6-22 
PR00427G 10.85 2.667e-16 98-115 
PR00427F 11.77 9.500e-13 48-65 


1166 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237F 13.57 4.789e-16 23-48 
PR00237G 19.63 1.844e-12 70-97 


1166 


BL00237 


G-protein coupled receptors proteins. 


BL00237C 13.19 1.857e-16 18-45 
BL00237D 11.23 6.464e-10 80-97 


1167 


PR00414 


PALMITOYL PROTEIN 
THIOESTERASE SIGNATURE 


PR00414A 10.74 1.000e-31 26-49 
PR00414D 17.19 1.450e-27 151- 
171 PR00414E 16.76 1.600e-27 
185-205 PR00414G 12.38 4.000e- 
25 258-277 PR00414B 12.48 
4.2 14e-25 66-86 PR00414C 13.14 
L000e-22 106-123 


1168 


BL00284 


Serpins proteins. 


BL00284C 28.56 4.000e-25 204- 
246 BL00284A 15.64 7.750e-22 
73-97 BL00284E 19.15 8.826e-19 
391-416 BL00284D 16.34 5. 655e- 
17 310-337 BL00284B 17.99 
3.667e-14 177-198 BL00284A 
15.64 2.600e-ll 107-131 


1170 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 6.786e-26 103- 
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132 BL00269A 8.53 2.607e-20 38- 
58 BL00269B 19.17 5.500e-17 65- 
94 


1172 


BL00383 


Tyrosine specific protein phosphatases 
proteins. 


BL00383A 13.34 5.235e~16 274- 
289 BL00383F 15.51 1.429b- 15 
477-493 BL00383E 10.35 7.000e- 
15 439-450 BL00383D 11.92 
4.000e-13 403-416 BL00383C 
10.10 6.344e-10 326-337 


1172 


PR00700 


PROTEIN TYROSINE 
PHOSPHATASE SIGNATURE 


PR00700B 16.80 5.320e-21 313- 
334 PR00700D 12.47 9.217e-19 
436-455 PR00700C 13.17 3.143e- 
15 399-417 PR00700F 11.18 
8.941e-13 483-494 PR00700A 6.96 
8.714e-10 295-303 PR00700E 
17.57 7.618e-09 467-483 


1174 


BL00472 


Small cytokines (intercrine/chemokine) 
C-C subfamily signatur. 


BL00472A 7.45 3.483e-09 1-13 


1175 


BL00472 


Small cytokines (intercrine/chemokine) 
C-C subfamily signatur. 


BL00472A 7.45 3.483e-09 1-13 


1187 


PR00360 


C2 DOMAIN SIGNATURE 


PR00360B 13.61 4.273e-09 88-102 


1188 


PR00860 


VERTEBRATE 

METALLOTHIONEIN SIGNATURE 


PR00860B 7.04 6.776e-09 52-66 


1189 


BL00280 


Pancreatic trypsin inhibitor (Kunitz) 
family proteins. 


BL00280 24.61 1. 5 14e-30 276-320 
BL00280 24.61 6.553e-24 238-282 


1189 


PR00759 


BASIC PROTEASE (KUNITZ-TYPE) 
INHIBITOR FAMILY SIGNATURE 


PR00759C 14.15 2.385e-14 304- 
320 PR00759B 11.26 8.105e-12 
256-267 PR00759B 11.26 2.452e- 
10 294-305 


1189 


BL00213 


Lipocalin proteins. 


BL00213A 12.95 2.636e-09 38-52 


1189 


PR00179 


LIPOCALIN SIGNATURE 


PR00179B 9.56 8.125e-12 122-135 
PR00179A 13.78 6.760e-10 38-51 
PR00179C 19.02 6.727e-09 150- 
166 


1190 


DM0003 1 


IMMUNOGLOBULIN V REGION. 


DM00031B 15.41 1.000e-10 83- 
117 


1191 


DM0003 1 


IMMUNOGLOBULIN V REGION. 


DM0003 IB 15.41 1.000e-10 83- 
117 


1198 


BL00154 


E1-E2 ATPases phosphorylation site 
proteins. 


BL00154G 21.18 9.783e-09 203- 
237 


1200 


DM01970 


0 kw ZK632.12 YDR313C 
ENDOSOMAL III. 


DM01970B 8.60 3.494e-09 120- 
133 


1201 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 8.500e-28 35-74 


1202 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 8.500e-28 35-74 


1204 


BL00134 


Serine proteases, trypsin family, 
histidine proteins. 


BL00134A 11.96 7.158e-19 81-98 


1204 


PR00722 


CHYMOTRYPSIN SERINE 
PROTEASE FAMILY (SI) 
SIGNATURE 


PR00722A 12.27 7.353e-18 82-98 


1204 


BL00495 


Apple domain proteins. 


BL00495K 12.58 4.543e-ll 83-116 


1204 


BL01253 


Type I fibronectin domain proteins. ! 


BL01253E 16.01 3.415e-13 154- 
191 BL01253D4.84 9.585e-ll 81- 
95 


1204 


BL00021 


Kringle domain proteins. i 


BL00021B 13.33 2.370e-17 81-99 
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BL00021C 22.21 6.604e-09 165- 
187 


1208 


PR00179 


LIPOCALIN SIGNATURE 


PR00179B 9.56 7.214e-09 121-134 


1212 


PR00364 


DISEASE RESISTANCE PROTEIN 
SIGNATURE 


PR00364D 10,54 6.700e-09 172- 
189 


1212 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 8,615e-13 177- 
191 PR00019B 11.36 2.2S6e-12 
174-188 PR00019A 11.19 4.913e- 
10 319-333 PR00019B 11.36 
4.960e-09 387-401 PR00019A 
11.19 8.667e-09 151-165 


1213 


BL00132 


Zinc carboxypeptidases, zinc-binding 
region 1 proteins. 


BL00132C 21.35 3.308e-28 129- 
170 BL00132B 15.93 1.871e-16 
99-113 BL00132A 26.07 1.682e-14 
50-91 BL00132F 13.26 7.254e- 14 
228-250 BL00132D 12.70 2.875e- 
12 173-188 BL00132E 17.72 
3.552e-12 199-226 


1213 


PR00765 


CARBOXYPEPTIDASE A 
METALLOPROTEASE (Ml 4) 
FAMILY SIGNATURE 


PR00765B 15.57 7.857e-16 99-1 14 
PR00765D 14.16 5.500e-ll 233- 
247 PR00765C 12.55 1.290e-10 
179-188 


1214 


PD01270 


RECEPTOR FC 
IMMUNOGLOBULIN AFFIN. 


PD01270C 19.54 5.329e-09 32-61 


1221 


PR00007 


COMPLEMENT C1Q DOMAIN 
SIGNATURE 


PR00007A 19.33 1.000e-16 143- 
170 PR00007C 15.60 8.200e-15 
214-236 PR00007B 14.16 5.846e- 
14 170-190 PR00007D9.64 
5.250e- 10 249-260 


1221 


BL01113 


Clq domain proteins. 


BL01113B 18.26 1.581e-29 149- 
185 BL01113C 13.18 3.077e-15 
214-234 BL01113A 17.99 1.243e- 
13 50-77 BL01113A 17.99 6.108e- 
13 35-62 BL01113A 17.99 3.077e- 
12 41-68 BL01113A 17.99 1.574e- 
10 38-65 BL01113A 17.99 9.617e- 
1044-71 BL01113A 17.99 7.577e- 
09 59-86 


1221 


BL00420 


Speract receptor repeat proteins domain 
proteins. 


BL00420A 20.42 2.038e-12 86-1 15 
BL00420A 20.42 5.154e-12 44-73 
BL00420A 20.42 4.185e-09 47-76 
BL00420A 20.42 9.031e-09 50-79 


1223 


BL00240 


Receptor tyrosine kinase class III 
proteins. 


BL00240E 11.56 2.043e-21 260- 
298 BL00240F 17.74 4.035e-18 
313-361 BL00240G 28.45 4.923e- 
17 361-414 


1223 


BL50001 


Src homology 2 (SH2) domain proteins 
profile. 


BL50001B 17.40 3.077e-16 271- 
292 


YL2L5 


BL00107 


Protein kinases ATP-bmdmg region 
proteins. 


BL00107A 18.39 4.500e-19 274- 
305 BL00107B 13.31 7.231e-12 
342-358 


1223 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109D 17.04 4.462e-20 343- 
366 PR00109E 14.41 5.950e-20 
388-411 PR00109B 12.27 9.571e- 
19 274-293 PR00109A 15.00 
5.065e-10 237-251 PR00109C 
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12.85 5. 333e-10 324-335 


1223 


BL00239 


Receptor tyrosine kinase class II 
proteins. 


BL00239E 17.14 1.281e-25 314- 
364 BL00239F 28.15 9.684e-19 
369-414 BL00239C 18.75 8.765e- 
15 261-284 BL00239D 16.81 
5.57Ie-10 286-312 


1223 


PR00046 


MAJOR SIGMA-70 FACTOR 
SIGNATURE 


PR00046A 13.26 8.788e-09 296- 
310 


1223 


BL00790 


Receptor tyrosine kinase class V 
proteins. 


BL00790O 7.68 9.357e-16 321-354 
BL00790Q 15.61 6.057e-ll 380- 
429 BL00790N 13.25 3.937e-10 
287-314 BL00790M8.74 5.138e- 
10 265-287 BL00790K 9.30 
1.000e-08 182-236 


1224 


PR00453 


VON WILLEB RAND FACTOR TYPE 
A DOMAIN SIGNATURE 


PR00453A 12.79 9.827e-09 50-68 


1227 


BL00615 


C-type lectin domain proteins. 


BL00615B 12.25 9.571e-10 151- 
165 BL00615A 16.68 3.100e-09 
48-66 


1237 


BL00514 


Fibrinogen beta and gamma chains C- 
terminal domain proteins. 


BL00514E 14.28 7.750e-12 313- 
330 


1257 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 1.000e-09 247- 
261 PR00019B 11.36 2.080e-09 
316-330 PR00019B 11.36 3.160e- 
09 244-258 PR00019B 11.36 
4.240e-09 100-114 PR00019B 
11.36 5.680e-09 148-162 


1263 


BL00427 


Disintegrins proteins. 


BL00427 13.93 4.035e-17 436-491 


1263 


PR00289 


DISINTEGRIN SIGNATURE 


PR00289A 13.62 5.667e-14 450- 
470 


1263 


PR00873 


ECHINOIDEA (SEA URCHIN) 
METALLOTHIONEIN SIGNATURE 


PR00873D 8.43 5.989e-09 608-627 


1264 


BL01173 


Lipolytic enzymes G-D-X-G family, 
histidine. 


BL01173C 8.98 1.000e-16 216-230 
BL01173B 13.27 1.000e-12 174- 
201 BL01173A9.41 7.200e-ll 
141-154 


1266 


BL00492 


Clusterin proteins. 


BL00492F 16.61 1.000e-40 163- 
205 BL00492G 13.20 1.000e-40 
230-280 BL00492H 16.50 6.870e- 
40 358-395 BL00492C 8.35 
1.000e-38 52-86 BL00492B 10.36 
9.053e-28 26-49 BL00492E 12.52 
6.423e-26 73-122 BL00492A 11.53 
4.724e-18 2-19 


1267 


BL00284 


Serpins proteins. 


BL00284C 28.56 6.538e-29 207- 
249 BL00284A 15.64 3.739e-18 
107-131 BL00284D 16.34 3.793e- 
17 314-341 BL00284E 19.15 
2.909e-15 401-426 


1268 


BL00284 


Serpins proteins. 


BL00284C 28.56 6.538e-29 243- 
285 BL00284A 15.64 3.739e-18 
125-149 BL00284E 19.15 8.269e- 
18 437-462 BL00284D 16.34 
3.793e-17 350-377 BL00284B 
17.99 3.483e-13 214-235 


1273 


BL00380 


Rhodanese proteins. 


BL0O380D 15.90 8.200e-28 110- 
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136 BL00380G 11.26 5.800e-16 
251-264 BL0038OB 14.77 7.000e- 
14 49-62 BL00380C 15.67 7.387e- 
13 82-98 BL0O380E 12.44 7.000e- 
11 181-193 BL00380A 10.48 
1.000e-09 10-20 


1274 


BL00134 


Serine proteases, trypsin family, 
histidine proteins. 


BL00134C 13.45 9.000e-17 333- 
347 BL00134A 11.96 7.429e-16 
147-164 BL00134B15.99 3.793e- 
15 297-321 


1274 


BL01253 


Type I fibronectin domain proteins. 


BL01253H 13.15 5.909e-19 315- 
350 BL01253G 11.34 8.791e-13 
296-310 BL01253F 14.35 2.241e- 
12 252-291 BL01253E 16.01 
7.194e-10 215-252 BL01253D4.84 
7.686e-10 147-161 


1274 


BL00495 


Apple domain proteins. 


BL00495N 11.04 3.957e-24 289- 
324 BL00495O 13.75 6.610e-16 
324-353 BL00495K 12.58 1.957e- 
11 149-182 BL00495L 11.94 
2.154e-09 178-217 


1274 


PR00722 


CHYMOTRYPSIN SERINE 
PROTEASE FAMILY (SI) 
SIGNATURE 


PR00722A 12.27 7.480e-15 148- 
164 PR00722C 10.87 2.286e-14 
296-309 PR00722B 12.51 3.250e- 
09 203-218 


1274 


BL00021 


Kr ingle domain proteins. 


BL00021D 24.56 8.773e-27 305- 
347 BL00021B 13.33 9.217e-17 
147-165 BL00021C 22.21 3.717e- 
09 226-248 


1279 


PR00704 


CALPAIN CYSTEINE PROTEASE 
(C2) FAMILY SIGNATURE 


PR00704I 9.52 4.682e-14 145-174 


1287 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 8.313e-28 16-55 


1287 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 3.769e-15 191-204 
PD00066 13.92 1.500e-13 247-260 
PD00066 13.92 1.500e-13 387-400 
PD00066 13.92 7.500e-13 275-288 
PD00066 13.92 8.000e-13 359-372 
PD00066 13.92 9.143e-12 219-232 
PD00066 13.92 4.913e-ll 303-316 


1287 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 4.857e-16 371-388 
BL00028 16.07 8.4 12e- 14 203-220 
BL00028 16.07 5.500e-13 287-304 
BL00028 16.07 1.346e-ll 259-276 
BL00028 16.07 5.154e-ll 399-416 
BL00028 16.07 7.577e-ll 343-360 
BL00028 16.07 1.000e-10 175-192 
BL00028 16.07 6.700e-10 231-248 
BL00028 16.07 3.829e-09 315-332 


1287 


PD02462 


PROTEIN BOLA TRANSCRIPTION 
REGULATION AC. 


PD02462A 22.48 6.817e-09 280- 
315 


1287 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 7.750e-16 368- 
382 PR00048A 10.52 4.273e-15 
284-298 PR00048A 10.52 1.643e- 
13 200-214 PR00048B 6.02 
1.900e-13 188-198 PR00048B 6.02 
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6.400e-13 272-282 PR00048A 
10.52 3.118e-12 396-410 
PR00048B 6.02 5.000e-12 384-394 
PR00048B 6.02 4.462e-ll 244-254 
PR00048B 6.02 1.563e-10 356-366 
PR00048A 1 0.52 2. 1 74e- 1 0 256- 
270 PR00048A 10.52 4.913e-10 
312-326 PR00048A 10.52 9.217e- 
10 340-354 PR00048B 6.02 
3.368e-09 300-310 PR00048A 
10.52 7. 480e-09 228-242 


1292 


BL00243 


Integrins beta chain cysteine-rich 
domain proteins. 


BL00243H 17.53 8.696e-ll 42-68 


1295 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 3.512e-31 10-49 


1295 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDL 


PD00066 13.92 9.100e-16 337-350 
PD00066 13.92 4.462e-15 477-490 
PD00066 13.92 5.800e-14 421-434 
PD00066 13.92 9.400e-14 449-462 
PD00066 13.92 9.500e-13 365-378 
PD00066 13.92 9.500e-13 393-406 


1295 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 7.353e-14 321-338 
BL00028 16.07 7.750e-13 405-422 
BL00028 16.07 9.550e-13 349-366 
BL00028 16.07 9.550e-13 377-394 
BL00028 16.07 2.957e-12 433-450 
BL00028 16.07 5.800e-10 461-478 
BL00028 16.07 7.600e-10 489-506 


1295 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 1.000e-13 346- 
360 PR00048A 10.52 1.000e-13 
486-500 PR00048A 10.52 5.500e- 
13 430-444 PR00048A 10.52 
6.786e-13 402-416 PR00048A 
10.52 6.786e-13 458-472 
PR00048A 10.52 4.7S9e-ll 318- 
332 PR00048A 10.52 9.526e-ll 
374-388 PR00048B 6.02 L000e-10 
334-344 PR00048B 6.02 9.438e-10 
418-428 PR00048B 6.02 9.438e-10 
474-484 PR00048B 6.02 8.579e-09 
446-456 


1296 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 8.615e-33 16-55 


1296 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDL 


PD00066 13.92 6.400e- 16 348-361 
PD00066 13.92 3.077e-15 236-249 
PD00066 13.92 4.462e-15 264-277 
PD00066 13.92 4.462e-15 320-333 
PD00066 13.92 2.800e-14 432-445 
PD00066 13.92 4.600e-14 292-305 
PD00066 13.92 8. 200e- 14 488-501 
PD00066 13.92 8. 200e-14 516-529 
PD00066 13.92 6.500e-13 376-389 
PD00066 13.92 7.000e-13 208-221 
PD00066 13.92 4.429e-12 460-473 


1296 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 9.182e-15 385- 
399 PR00048A 10.52 9.182e-15 
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469-483 PR00048A 10.52 4.750e- 
14 329-343 PR00048A 10.52 
3.571e-13 413-427 PR00048A 
10.52 9.357e-13 245-259 
PR00048A 10.52 1.529e-12 525- 
539 PR00048A 10.52 2.588e-12 
189-203 PR00048A 10.52 3.11 8e- 
12 301-315 PR00048A 10,52 
3.647e-12 357-371 PR00048B 6,02 
l.OOOe-11 205-215 PR00048B 6.02 
l.OOOe-11 261-271 PR00048B 6.02 
l.OOOe-ll 429-439 PR00048B 6.02 
2.385e-ll 345-355 PR00048B 6.02 
4.462e-ll 513-523 PR00048B 6.02 
7.23 le-11 317-327 PR00048A 
10.52 9.526e-ll 217-231 
PR00048A 10.52 2.174e-10 273- 
287 PR00048B 6.02 2.688e-10 
485-495 PR00048B 6.02 3.813e-10 
289-299 PR00048B6.02 4.938e-10 
233-243 PR00048A 10.52 6.478e- 
10 497-511 PR00048B6.02 
3.842e-09 373-383 


1296 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 2.588e-14 192-209 
BL00028 16.07 1.450e-13 388-405 
BL00028 16.07 1.450e-13 528-545 
BL00028 16.07 5.304e- 12 472-489 
BL00028 16.07 6.478e-12 416-433 
BL00028 16.07 8.826e-12 332-349 
BL00028 16.07 9.609e-12 248-265 
BL00028 16.07 2.385e-ll 276-293 
BL00028 16.07 3.769e-ll 360-377 
BL00028 16.07 7.577e-ll 304-321 
BL00028 16.07 6.10Oe-lO 220-237 
BL00028 16.07 5. 886e-09 500-517 


1299 


BL00142 


Neutral zinc metallopeptidases, zinc- 
binding region proteins. 


BL00142 8.38 4.375e-10 188-199 


1299 


PR00138 


MATRIXIN SIGNATURE 


PR00138D 16.56 l,321e-19 188- 
214 PR00138B 15.82 1.655e-10 
106-122 PRO0138C 16.41 2.393e- 
10 132-161 PR00138E6.01 
6.5 86e- 10 222-236 


1299 


BL00024 


Hemopexin domain proteins. 


BL00024D 17.28 3.411e-22 182- 
214 BL00024B 21.53 8.788e-19 
80-114 BL00024H 11.35 3.077e-10 
337-349 BL00024E 7.58 9.21 le-10 
222-236 BL00024C 22,98 6,000e- 
09 131-180 


1299 


PR00480 


ASTACIN FAMILY SIGNATURE 


PR00480B 15.41 6.529e-09 183- 
202 


1299 


BL00546 


Matrixins cysteine switch. 


BL00546C 16.41 1.643e-22 182- 
214 BL00546D 10.34 8.017e-10 
222-236 BL00546B20.il 2.443e- 
09 132-176 BL00546H 10.76 
4.5 12e-09 337-348 BL00546G 
16.84 6.684e-09 338-358 
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1301 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290B 13.17 2.875e-09 170- 
188 


1305 


PD02811 


PROTEIN PEPTIDE REDUCTASE 
MG448 PILB FIMBRIA TRAN. 


PD02811A 20.67 3.017e-22 60-98 
PD02811B 17.07 2.263e-21 111- 
144 PD02811C 13.25 5.696e-13 
147-160 


1307 


BL00282 


Kazal serine protease inhibitors family 
proteins. 


BL00282 16.88 8.043e-18 28-51 


1307 


PR00290 


KAZAL-TYPE SERINE PROTEASE 
INHIBITOR SIGNATURE 


PR00290B 9.78 6.559e-ll 39-51 
PR00290A 10.88 8.851e-09 28-39 


1308 


BL00112 


ATP:guanido phosphotransferases 
proteins. 


BL00112F 21.37 7.728e-09 51-102 


1309 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 6.786e-26 139- 
168 BL00269A 8.53 2.607e-20 74- 
94 BL00269B 19.17 5.5O0e-17 
101-130 


1312 


BL01115 


GTP-binding nuclear protein ran 
proteins. 


BL01115A 10.22 3.559e-12 63-107 


1312 


PR00449 


TRANSFORMING PROTEIN P21 
RAS SIGNATURE 


PR00449A 13.20 7.545e-17 63-85 
PR00449E 13.50 9.280e-14 201- 
224 PR00449D 10.79 7.158e-13 
167-181 PR00449C 17.27 9.229e- 
12 103-126 PR00449B 14.34 
9.280e-10 86-103 


1314 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 2.059e-14 361-378 
BL00028 16.07 7.353e-14 193-210 
BL00028 16.07 1.900e-13 165-182 
BL00028 16.07 3.250e-13 333-350 
BL00028 16.07 4.150e-l3 221-238 
BL00028 16.07 9.550e-13 277-294 
BL00028 16.07 1.000e-l 1 305-322 
BL00028 16.07 1.600e-10 249-266 


1314 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 2.800e-14 321-334 
PD00066 13.92 1.000e-13 265-278 
PD00066 13.92 1.429e- 12 237-250 
PD00066 13.92 7.857e-12 181-194 
PD00066 13.92 2. 1 74e-l 1 349-362 
PD00066 13.92 2.385e-10 209-222 
PD00066 13.92 7.23 le-10 293-306 


1314 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 5.500e-14 218- 
232 PR00048A 10.52 5.500e-14 
358-372 PR00048A 10.52 8.500e- 
14 330-344 PR00048A 10.52 
2.286e-13 190-204 PR0004SA 
10.52 8.071e-13 162-176 
PR00048A 10.52 3.118e-12 274- 
288 PR00048A 10.52 6.684e-l 1 
246-260 PR00048B 6.02 7.23 le-11 
318-328 PR00048B 6.02 1.563e-10 
178-188 PR00048A 10.52 7.261e- 
10 302-316 PR00048B 6.02 
8.579e-09 262-272 


1315 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 1.947e-10 95-123 


1315 


PR00874 


FUNGI-IV METALLOTHIONEIN 
SIGNATURE 


PR00874C 4.37 3.750e-09 48-63 
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1315 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 4.434e-09 50-79 


1315 


PD00866 


GLYCOPROTEIN PROTEIN SPIKE 
E2 PRECURSOR PEPLOMER. 


PD00866L 3.73 2.770e-09 24-34 
PD00866L 3.73 2.770e-09 82-92 
PD00866L 3.73 2.918e-09 4-14 
PD00866L 3.73 4.984e-09 14-24 


1315 


PR00876 


NEMATODE METALLOTHIONEIN 
SIGNATURE 


PR00876B 7.66 5.014e-09 89-103 


1315 


BL00243 


Integrins beta chain cysteine-rich 
domain proteins. 


BL00243I 31.77 5.655e-13 3-46 
BL00243I 31.77 9.286e-ll 81-124 
BL00243I 31.77 6.691e-10 13-56 
BL00243I 31.77 2.775e-09 10-53 
BL00243I 31.77 5.310e-09 52-95 
BL00243I 31.77 5.437e-09 18-61 


1315 


PR00858 


CRUSTACEAN 

METALLOTHIONEIN SIGNATURE 


PR00858B 5.93 3.394e-09 49-68 
PR00858B 5.93 7.223e-09 15-34 


1315 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 9.893e-13 72-118 
BL00203 13.94 7.414e-12 34-80 
BL00203 13.94 2.879e-l 1 46-92 
BL00203 13.94 4.956e-ll 87-133 
BL00203 13.94 5.945e-ll 101-147 
BL00203 13.94 8.3 19e-l 1 78-124 
BL00203 13.94 1.383e-10 63-109 
BL00203 13.94 3.106e-10 77-123 
BL00203 13.94 6.266e-10 15-61 
BL00203 13.94 8.085e-10 33-79 
BL00203 13.94 8.372e-10 102-148 
BL00203 13.94 9.138e-10 56-102 
BL00203 13.94 9.521e-10 88-134 
BL00203 13.94 1.000e-09 5-51 
BL00203 13.94 1.827e-09 82-128 
BL00203 13.94 3.847e-09 83-129 
BL00203 13.94 5. 133e-09 48-94 
BL00203 13.94 5.500e-09 25-71 
BL00203 13.94 6.143e-09 28-74 
BL00203 13.94 6.327e-09 93-139 
BL00203 13.94 7. 337e-09 73-119 
BL00203 13.94 8.071e-09 61-107 
BL00203 13.94 8.439e-09 91-137 
BL00203 13.94 9.082e-09 13-59 


1315 


DM00864 


EGF-LIKE DOMAIN. 


DM00864A 15.21 9.229e-09 6-27 


1315 


BL00198 


4Fe-4S ferredoxins, iron-sulfur binding 
region proteins. 


BL00198 10.43 1.600e-09 23-35 
BL00198 10.43 9.700e-09 3-15 


1316 


DM00406 


GLIADIN. 


DM00406 7.73 9.514e-10 162-175 


1316 


BL00243 


Integrins beta chain cysteine-rich 
domain proteins. 


BL00243I 31.77 8.859e-09 66-109 


1316 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 4.553e-09 24-53 
BL00269C 16.52 9.289e-09 94-123 


1316 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 3.700e-09 45-73 
PD02283C 17.54 a./oie-uy od-H 
PD02283C 17.54 9.325e-09 80-108 
PD02283C 17.54 9.550e-09 90-118 


1316 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 4.363e-ll 47-93 
BL00203 13.94 5.846e-ll 88-134 
BL00203 13.94 7.527e-ll 57-103 
BL00203 13.94 8.714e-ll 22-68 
BL00203 13.94 4.447e-10 87-133 
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BL00203 13.94 7.798e-10 92-138 
BL00203 13.94 2.010e-09 37-83 
BL00203 13.94 3.939e-09 38-84 
BL00203 13.94 5.133e~09 18-64 
BL00203 13.94 5.224e-09 17-63 
BL00203 13.94 5.592e-09 52-98 
BL00203 13.94 5.684e-09 68-114 
BL00203 13.94 7.153e-09 33-79 
BL00203 13.94 7.245e-09 78-124 
BL00203 13.94 8.163e-09 43-89 
BL00203 13.94 8.439e-09 23-69 
BL00203 13.94 9.265e-09 42-88 
BL00203 13.94 9.633e-09 93-139 


1317 


BL00243 


Integrins beta chain cysteine-rich 
domain proteins. 


BL00243I 3 1 .77 8.279e-10 216-259 
BL00243I 31.77 5.310e-09 219-262 


1317 


BL01208 


VWFC domain proteins. 


BL01208B 15.83 5.865e-09 244- 
259 


1317 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 7.750e-10 163- 
191 PD02283C 17.54 4.938e-09 
198-226 PD02283C 17.54 6.175e- 
09 243-271 PD02283C 17.54 
6.400e-09 62-90 PD02283C 17.54 
6.625e-09 153-181 PD02283C 
17.54 6.738e-09 173-201 
PD02283C 17.54 6.738e-09 208- 
236 PD02283C 17.54 7.975e-09 
188-216 PD02283C 17.54 9.325e- 
09 178-206 


1317 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 9.703e-ll 130-176 
BL00203 13.94 1.191e-10 104-150 
BL00203 13.94 2.149e-10 64-1 10 
BL00203 13.94 4.447e-10 200-246 
BL00203 13.94 5.883e-10 215-261 
BL00203 13.94 5.979e-10 210-256 
BL00203 13.94 5.979e-10 225-271 
BL00203 13.94 6.362e-10 120-166 
BL00203 13.94 6.649e-10 24-70 
BL00203 13.94 7.032e-10 34-80 
BL00203 13.94 8.181e-10 165-211 
BL00203 13.94 8.947e-10 85-131 
BL00203 13.94 9.809e-10 60-106 
BL00203 13.94 1.000e-09 170-216 
BL00203 13.94 1.367e-09 176-222 
BL00203 13.94 1.643e-09 70-116 
BL00203 13.94 2.378e-09 239-285 
BL00203 13.94 2.469e-09 125-171 
BL00203 13.94 3.755e-09 175-221 
BL00203 13.94 3.847e-09 19-65 
BL00203 13.94 3.847e-09 190-236 
BL00203 13.94 4.122e-09 201-247 
BL00203 13.94 4.490e-09 160-206 
BL00203 13.94 5.224e-09 186-232 
BL00203 13.94 5.867e-09 115-161 
BL00203 13.94 6.235e-09 80-126 
BL00203 13.94 6.418e-09 55-101 
BL00203 13.94 7.337e-09 15-61 
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- 


BL00203 13.94 7.337e-09 69-115 
BL00203 13.94 7.520e-09 45-91 
BL00203 13.94 7.888e-09 95-141 
BL00203 13.94 8.255e-09 29-75 
BL00203 13.94 8.439e-09 206-252 
BL00203 13.94 8.531e-09 126-172 
BL00203 13.94 8.806e-09 38-84 
BL00203 13.94 8.898e-09 151-197 
BL00203 13.94 9.265e-09 216-262 
BL00203 13.94 9.449e-09 180-226 


1317 


PR00876 


NEMATODE METALLOTHIONEIN 
SIGNATURE 


PR00876B 7.66 9.757e-09 113-127 


1317 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 6.425e-10 132- 
161 BL00269C 16.52 9.882e-09 
21-50 


1318 


PR00876 


NEMATODE METALLOTHIONEIN 
SIGNATURE 


PR00876B 7.66 2.268e-10 107-121 


1318 


PR00874 


FUNGI-IV METALLOTHIONEIN 
SIGNATURE 


PR00874C 4.37 9.739e-10 95-110 


1318 


BL00427 


Disintegrins proteins. 


BL00427 13.93 2.274e-09 114-169 


1318 


PD00866 


GLYCOPROTEIN PROTEIN SPIKE 
E2 PRECURSOR PEPLOMER. 


PD00866L3.73 6.564e-10 1-11 
PD00866L 3.73 1.443e-09 33-43 
PD00866L3.73 2.770e-09 119-129 
PD00866L 3.73 2.918e-09 12-22 
PD00866L3.73 2.918e-09 19-29 


1318 


BL01208 


VWFC domain proteins. 


BL01208B 15.83 2.946e-09 143- 
158 


1318 


DM01724 


kw ALLERGEN POLLEN CIM1 HOL- 
LL 


DM01724 8.14 7.465e-10 17-37 
DM01724 8.14 4.434e-09 19-39 
DM01724 8.14 6.684e-09 10-30 


1318 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 8.763e-09 132- 
160 


1318 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 1.724e-12 124-170 
BL00203 13.94 2.034e-12 119-165 
BL00203 13.94 4.207e-12 1 15-161 
BL00203 13.94 4.931e-12 93-139 
BL00203 13.94 3.275e-ll 109-155 
BL00203 13.94 3.967e-ll 120-166 
BL00203 13.94 3.298e-10 114-160 
BL00203 13.94 4.064e-10 35-81 
BL00203 13.94 4.447e-10 100-146 
BL00203 13.94 5.117e-10 98-144 
BL00203 13.94 7.989e-10 83-129 
BL00203 13.94 9.426e-10 125-171 
BL00203 13.94 2.929e-09 34-80 
BL00203 13.94 2.929e-09 94-140 
BL00203 13.94 3.663e-09 32-78 
BL00203 13.94 4.490e-09 88-134 
BL00203 13.94 4.582e-09 110-150 
BL00203 13.94 4.673e-09 101-147 
BL00203 13.94 4.857e-09 104-150 
BL00203 13.94 5.592e-09 43-89 
BL00203 13.94 6.694e-09 111-157 
BL00203 13.94 8.898e-09 55-101 
BL00203 13.94 9.082e-09 99-145 


1318 


BL00243 


Integrins beta chain cysteine-rich 


BL00243I 31.77 1.662e-10 68-111 
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domain proteins. 


BL00243I 31.77 3.912e-10 38-81 
BL00243I 31.77 5.500e-10 58-101 
BL00243I 31.77 8.941e-10 48-91 
BL00243I 31.77 6.324e-09 25-68 
BL00243I31.77 7.465e-09 118-161 
BL00243I 31.77 9.239e-09 61-104 
BL00243I 31.77 9.620e-09 98-141 


1319 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDL 


PD00066 13.92 2.800e-14 107-120 
PD00066 13.92 2.800e-14 219-232 
PD00066 13.92 4.000e-13 79-92 
PD00066 13.92 4.000e-12 51-64 
PD00066 13.92 9.143e-12 191-204 
PD00066 13.92 2.038e-10 163-176 


1319 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 3.250e-13 203-220 
BL00028 16.07 8.650e-13 35-52 
BL00028 16.07 9.550e-13 91-108 
BL00028 16.07 3.739e-12 147-164 
BL00028 16.07 4.522e-12 119-136 
BL00028 16.07 4.462e-ll 175-192 
BL00028 16.07 8.500e- 10 63-80 


1319 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 4.706e-12 60-74 
PR00048B 6.02 6.000e-12 104-114 
PR00048B 6.02 1.000e-ll 216-226 
PR00048A 10.52 3.368e-ll 88-102 
PR00048B 6.02 7.231e-ll 160-170 
PR00048A 10.52 7.632e-ll 32-46 
PR00048B 6.02 1.000e-10 188-198 
PR00048A 10.52 2.565e-10 172- 
186 PR00048A 10.52 2.957e-10 
200-214 PR00048B 6.02 1.000e-09 
48-58 PR00048A 10.52 4.240e-09 
144-158 PR00048B 6.02 6.684e-09 
76-86 PR00048B 6.02 7.632e-09 
132-142 PR00048A 10.52 8.560e- 
09 116-130 


1319 


PD02462 


PROTEIN BOLA TRANSCRIPTION 
REGULATION AC. 


PD02462A 22.48 9.122e-09 168- 
203 


1320 


PR00449 


TRANSFORMING PROTEIN P21 
RAS SIGNATURE 


PR00449A 13.20 8.800e-15 4-26 


1320 


BL01115 


GTP-binding nuclear protein ran 
proteins. 


BL01115A 10.22 3.250e-09 4-48 


1321 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 5.055e-ll 71-117 
BL00203 13.94 8.439e-09 146-192 
BL00203 13.94 9.082e-09 53-99 


1327 


BL00799 


Granulins proteins. 


BL00799H 14.15 9.667e-09 18-59 


1329 


PF00711 


Beta defensins. 


PF00711 15.76 7.915e-ll 60-93 


1329 


PR00858 


CRUSTACEAN 

METALLOTHIONEIN SIGNATURE 


PR00858B 5.93 1.479e-09 55-74 


1329 


BL00317 


WAP-type 'four-disulfide core' domain 
proteins. 


BL00317B 14.58 2.216e-09 63-85 


1329 


DM01724 


kw ALLERGEN POLLEN CIM1 HOL- 
LI. 


DM01724 8.14 4.484e-12 19-39 
DM01724 8.14 4.484e-12 23-43 
DM01724 8.14 4.484e-12 27-47 
DM01724 8.14 4.484e-12 31-51 
DM01724 8.14 4.296e-10 35-55 
DM01724 8.14 4.197e-09 1 1-31 
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DM01724 8.14 5.382e-09 7-27 
DM01724 8.14 6.803e-09 15-35 


1329 


PR00872 


DIPTERA (DROSOPHILA) 
METALLOTHIONEIN SIGNATURE 


PR00872A 5.17 7.409e-09 99-112 


1329 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 7.429e-13 47-93 
BL00203 13.94 1.989e-ll 54-100 
BL00203 13.94 3.967e-ll 50-96 
BL00203 13.94 8.372e-10 51-97 
BL00203 13.94 8.947e-10 58-104 
BL00203 13.94 1.184e-09 75-121 
BL00203 13.94 3.204e-09 74-120 
BL00203 13.94 3.663e-09 57-103 
BL00203 13.94 4.673e-09 65-111 
BL00203 13.94 5.592e-09 70-116 
BL00203 13.94 6.786e-09 55-101 
BL00203 13.94 7.429e-09 40-86 


1329 


BL00264 


Neurohypophysial hormones proteins. 


BL00264 8.98 7.632e-09 94-121 


1329 


PD00866 


GLYCOPROTEIN PROTEIN SPIKE 
E2 PRECURSOR PEPLOMER. 


PD00866L 3.73 7.709e-10 74-84 
PD00866L 3.73 6.902e-09 4-14 
PD00866L 3.73 9.262e-09 1-11 


1329 


DM00864 


EGF-LIKE DOMAIN. 


DM00864A 15.21 9.486e-09 9-30 


1329 


BL00243 


Integrins beta chain cysteine-rich 
domain proteins. 


BL00243I 31.77 2.000e- 11 57-100 
BL00243I 31.77 5.897e-10 69-112 
BL00243I 31.77 1.254e-09 60-103 
BL00243I 31.77 7.465e-09 8-51 
BL00243I 31.77 1.000e-08 16-59 


1330 


BL00412 


Neuromodulin (GAP-43) proteins. 


BL00412D 16.54 2.957e-10 175- 
226 


1330 


DM01206 


CORONA VIRUS NUCLEOCAPSID 
PROTEIN. 


DM01206B 10.69 3.278e-09 70-90 
DM01206B 10.69 4.418e-09 105- 
125 


1330 


PF01140 


Matrix protein (MA), pi 5. 


PF01140D 15.54 5.291e-09 125- 
160 


1330 


PF00992 


Troponin. 


PF00992A 16.67 5.026e-09 183- 
218 PF00992A 16.67 8.934e-09 
191-226 


1330 


BL00048 


Protamine PI proteins. 


BL00048 6.39 7.107e-16 64-91 
BL00048 6.39 9.196e-16 63-90 
BL00048 6.39 1.132e-12 62-89 
BL00048 6.39 2.059e-12 66-93 
BL00048 6.39 3.250e-12 65-92 
BL00048 6.39 7.618e-12 92-119 
BL00048 6.39 2.625e-ll 60-87 
BL00048 6.39 6.500e-ll 113-140 
BL00048 6.39 6.750e-ll 78-105 
BL00048 6.39 6.875e-ll 104-131 
BL00048 6.397.125e-ll 112-139 
BL00048 6.39 8.625e-ll 74-101 
BL00048 6.39 2.539e-10 108-135 
BL00048 6.39 4.434e-10 61-88 
BL00048 6.39 5.855e-10 110-137 
BL00048 6.39 6.921e-10 98-125 
BL00048 6.39 7.158e-10 109-136 
BL00048 6.39 7.750e-10 97-124 
BL00048 6.39 8.105e-10 79-106 
BL00048 6.39 8.579e-10 19-46 
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BL00048 6.39 8.934e-10 94-121 








BL00048 6.39 9.526e-10 103-130 








BL00048 6.39 1.675e-09 101-128 








BL00048 6.39 1.900e-09 73-100 








BL00048 6.39 3.250e-09 81-108 








BL00048 6.39 3.475e-09 111-138 








BL00048 6.39 3.700e-09 82-109 








BL00048 6.39 3.700e-09 96-123 








BL00048 6.39 4.263e-09 99-126 








BL00048 6.39 5.163e-09 107-134 








BL00048 6.39 5.275e-09 67-94 








BL00048 6.39 5.275e-09 80-107 








BL00048 6.39 5.388e-09 49-76 








BL00048 6.39 6.738e-09 116-143 








BL00048 6.39 8.650e-09 52-79 








BL00048 6.39 8.763e-09 18-45 








BL00048 6.39 9.100e-09 21-48 






< 


BL00048 6.39 9.550e-09 76-103 








BL00048 6.39 9.550e-09 100-127 








BL00048 6.39 9.663e-09 102-129 








T-*T r\r\r\ a ct f <*> /"» t f\f\r\~ f~\0 T7 1 A /} 

BL00048 6.39 1.000e-08 77-104 


1331 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290B 13.17 7.750e-09 74 r 92 


1332 


BL00280 


Pancreatic trypsin inhibitor (Kunitz) 
family proteins. 


BL00280 24.61 2.154e-28 83-127 


1332 


PR00759 


BASIC PROTEASE (KUNITZ-TYPE) 
INHIBITOR FAMILY SIGNATURE 


PR00759B 11.26 3.323e-10 101- 
112 PR00759C 14.15 2.125e-09 
111-127 


1333 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 4.162e-34 8-47 


1333 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDL 


PD00066 13.92 2.800e-14 273-286 
PD00066 13.92 2.800e-14 301-314 
PD00066 13.92 2.800e-14 329-342 
PD00066 13.92 2.800e-14 357-370 
PD00066 13.92 5.500e-13 245-258 
PD00066 13.92 1.783e-l 1 217-230 
PD00066 13.92 8.043e-ll 189-202 


1333 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 6.850e-13 257-274 
BL00028 16.07 2.957e-12 201-218 
BL00028 16.07 4.522e-12 341-358 
BL00028 16.07 5.304e-12 285-302 
BL00028 16.07 1.346e-ll 369-386 
BL00028 16.07 2.038e-ll 229-246 
BL00028 16.07 4.900e-10 313-330 


1333 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 2.286e-13 198- 
212 PR00048A 10.52 2.929e-13 
254-268 PR00048A 10.52 2.929e- 
13 282-296 PR00048A 10.52 
5.500e-l3 366-3 80 rKUUU4ol3 o.uz 
6.400e-l3 354-364 PR00048A 
10.52 8.941e-12 226-240 
PR00048B 6.02 1.000e-ll 326-336 
PR00048B 6.02 5.846e-ll 242-252 
PR00048A 10.52 6.211e-ll 310- 
324 PR00048A 10.52 8.579e-ll 
338-352 PR00048B6.02 9.308e-ll 
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270-280 PR00048B 6.02 7.750e-10 
298-308 PR00048B 6.02 8.3 13e-10 
214-224 


1336 


PR00806 


VINCULIN SIGNATURE 


PR00806B 4.28 8.920e-09 333-347 
PR00806B 4.28 9.640e-09 332-346 


1337 


PR00806 


VINCULIN SIGNATURE 


PR00806B 4.28 8.920e-09 333-347 
PR00806B 4.28 9.640e-09 332-346 


1340 


BL01221 


PMP-22 / EMP / MP20 family proteins. 


BL01221B 13.29 6.745e-09 54-68 


1343 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 8.200e-12 208- 
231 


1348 


DM00179 


w KINASE ALPHA ADHESION T- 
CELL. 


DM00179 13.97 1.947e-10 316-326 


1352 


DM00973 


3 kw RESISTANCE BENOMYL 
YLL028W CYCLOHEXIMIDE. 


DM00973A 21.17 2.306e-13 66- 
103 


1353 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019B 11.36 8.200e-10 190- 
204 PR00019A 11.19 7.667e-09 
193-207 


1356 


PD02327 


GLYCOPROTEIN ANTIGEN 
PRECURSOR IMMUNOGLO. 


PD02327B 19.84 2.091e-09 144- 
166 


1356 


DM00179 


w KINASE ALPHA ADHESION T- 
CELL. 


DM00179 13.97 7.652e-09 291-301 


1357 


PF01130 


CD36 family. 


PF01130D 26.65 1.955e-23 337- 
391 PF01130C 22.91 1.851e-18 
255-305 PF01130B 21.40 1.537e- 
15 200-246 


1359 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 7.480e-10 160- 
183 BL00290B 13.17 2.875e-09 
226-244 


1366 


PR00007 


COMPLEMENT C1Q DOMAIN 
SIGNATURE 


PR00007A 19.33 6.936e-13 274- 
301 PR00007C 15.60 9.250e-13 
349-371 PR00007B 14.16 9.372e- 
13 301-321 PR00007D9.64 
5.500e-ll 381-392 


1366 


BL01113 


CI q domain proteins. 


BL01113B 18.26 1.675e-24 280- 
316 BL01113A 17.99 4.194e-15 
200-227 BL01113D7.47 3.250e-13 
383-393 BL01113A 17.99 3.919e- 
13 191-218 BL01113C 13.18 
9.294e-12 349-369 BL01113A 
17.99 5.500e-ll 185-212 
BL01113A 17.99 6.727e-ll 182- 
209 BL01113A 17.99 8.773e-ll 
203-230 BL01113A 17.99 3.681e- 
10 188-215 BL01113A 17.99 
6.936e-10 176-203 BL01113A 
17.99 7.319e-10 194-221 
BL01113A 17.99 4.635e-09 209- 
236 BL01113A 17.99 5.500e-09 
179-206 


1366 


BL00420 


Speract receptor repeat proteins domain 
proteins. 


BL00420A 20.42 3.250e-12 185- 
214 BL00420A 20.42 2.623e-10 
203-232 BL00420A 20.42 2.385e- 
09 191-220 BL00420A 20.42 
6.400e-09 182-211 BL00420A 
20.42 7.646e-09 188-217 
BL00420A 20.42 8.200e-09 173- 
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202 


1366 


PR00524 


^^™^# !■* W' B^™^ F ^P" ^P""* JIM ^BW ^» — — > - - ■ -V BBBfBB ^B> ^^BBh- ^BBBfc k 

CHOLECYSTOKININ TYPE A 
RECEPTOR SIGNATURE 


PR00524F 5.36 1.766e-09 200-214 
PR00524F 5.36 8.564e-09 188-202 


1367 


PR00794 


PANCREATIC RIBONUCLEASE 
FAMILY SIGNATURE 


PR00794D 17.96 8.800e-17 94-117 
PR00794A 14.47 6.897e-16 46-66 
PR00794B 16.95 5.655e-14 66-86 


1367 


BL00127 


Pancreatic nbonuclease family proteins. 


BL00127B 26.57 9.357e-29 46-91 
BL00127C 31.49 9.763e-22 82-126 
BL00127A 12.84 8.200e-10 31-41 


1368 


BL00067 


3-nydroxyacyl-CoA dehydrogenase 
, proteins. 


BL00067A 21.37 9.053e-10 53-84 


1 O/CO 

lioo 


t"»t r\r\^'~i'H 

BL00677 


D-ammo acid oxidases proteins. 


BL00677A 6.96 3.821e-09 55-68 


1368 


BL00982 


Bacterial-type phytoene dehydrogenase 
proteins. 


BL00982A 18.41 4.000e-09 57-89 


1369 


BL00067 


3 -hydr oxyacyl -Co A dehydrogenas e 
proteins. 


BL00067A 21.37 9.053e-10 53-84 


1369 


BL00677 


D-amino acid oxidases proteins. 


BL00677A 6.96 3.821e-09 55-68 


1369 


BL00982 


Bacterial-type phytoene dehydrogenase 
proteins. 


BL00982A 18.41 4.000e-09 57-89 


1371 


PR00762 


CHLORIDE CHANNEL SIGNATURE 


PR00762C 9.29 5.200e-20 203-223 
PR00762A 14.22 8.364e-19 108- 
126 PR00762D 11.29 9.379e-19 
440-461 PR00762B 12.12 7.750e- 
16 139-159 PR00762E 12.07 
2.286e-15 475-492 PR00762F 
15.12 6.559e-15 493-513 
PR00762G 14.13 1.692e-ll 531- 
546 


1371 


PD00866 


GLYCOPROTEIN PROTEIN SPIKE 
E2 PRECURSOR PEPLOMER. 


PD00866L 3.73 3.951e-09 9-19 


1371 


PD00120 


REPEAT PRECURSOR 
GLYCOPROTEIN EG. 


PD00120A 12.94 4.750e-09 6-19 


1372 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245A 18.03 4.349e-18 100- 
122 


1372 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 2.800e-13 131- 
171 


1372 


PR00642 


EDGl ORPHAN RECEPTOR 
SIGNATURE 


PR00642D 9.70 5.065e-12 89-104 


1372 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237C 15.69 6.400e-09 145- 
168 


1374 


BL01022 


PTR2 family proton/oligopeptide 
symporters proteins. 


BL01022B 22,19 2.241e-15 74-120 
BL01022A 11.58 2.212e-12 44-63 
BL01022C 16.62 4.919e-10 160- 
184 


1374 


PR00308 


TYPE I ANTIFREEZE PROTEIN 
SIGNATURE 


PR00308C 3.83 2.169e-09 20-30 


1375 


PR00705 


PAPAIN CYSTEINE PROTEASE (CI) 
FAMILY SIGNATURE 


PR00705A 10.55 4.000e-21 132- 
148 PR00705B 10.22 2.385e-10 
276-287 


1375 


BL00139 


Eukaryotic thiol (cysteine) proteases 
cysteine proteins. 


BL00139D 9.24 1.818e-18 295-312 
BL00139A 10.29 1.000e-14 132- 
142 BL0O139C9.23 2.8O0e-10 
275-285 


1375 


PR00704 


CALPAIN CYSTEINE PROTEASE 
(C2) FAMILY SIGNATURE 


PR00704C 11.88 6.162e-09 132- 
149 


1376 


PR00237 


RHODOPSIN-LIKE GPCR 


PR00237C 15.69 3.333e-12 111- 
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SUPERFAMILY SIGNATURE 


134 PR00237E 13.03 5.667e-ll 
193-217 PR00237F 13.57 1.474e- 
09 397-422 PR00237B 13.50 
4.750e-09 66-88 


1 inc. 
13/0 


BL00237 


G-protem coupled receptors proteins. 


BL00237A 27.68 9.625e-19 97-137 
BL00237C 13.19 8.846e-09 392- 
419 BL00237B5.28 9.182e-09 
201-213 


1378 


BL00649 


G-protein coupled receptors family 2 
proteins. 


BL00649C 17.82 5.773e-10 325- 
351 


YD /O 




aECKEl IN -LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00249C 17.08 4.130e-09 327- 
351 




JrJJUlo/o 


AN 1 ICjEN MELANOMA- 
ASSOCIATED MULTIGENE 
FAMILY TUM. 


PD01876A 12.01 2.964e-26 1-42 
PD01876C 21.73 2.343e-25 99-152 
PD01876B 14.18 2.500e-12 62-96 


1382 


DM01724 


kw ALLERGEN POLLEN CIM1 HOL- 

r t 
LI. 


DM01724 8.14 3.789e-10 291-311 


1382 


PR00213 


MYELIN P0 PROTEIN SIGNATURE 


PR00213E 5.51 8.969e-09 263-288 


1382 


PD01849 


CHORION CLASS PRECURSOR 
EGGSHELL REPEAT MULTIGEN. 


PD01849C 11.22 9.625e-09 265- 
293 


1388 


BL00170 


Cyclophilin-type peptidyl-prolyl cis- 
trans isomerase signatur. 


BL00170B 20.97 1.000e-40 88-128 
BL00170C 18.49 1.409e-37 135- 
180 BL00170A 17.08 6.850e-22 
58-85 


1388 


PR00153 


CYCLOPHILIN PEPTIDYL-PROLYL 
CIS-TRANS ISOMERASE 
SIGNATURE 


PR00153C 11.01 2.800e-18 136- 
152 PR00153B 11.57 8.500e-17 
93-106 PR00153D 11.99 4.000e-16 
151-164 PR00153A 12.98 4.789e- 
16 64-80 PR00153E 9.10 2.980e- 
10 164-180 


i o r» a 

1390 


BL00237 


G-protem coupled receptors proteins. 


BL00237C 13.19 6.571e-16 147- 
174 BL00237A 27.68 2.976e- 11 
13-53 BL00237B 5.28 2.421e-10 
102-114 


1390 


PR00529 


GONADOTROPHIN RELEASING 
HORMONE RECEPTOR 
SIGNATURE 


PR00529D 23.59 7.380e-13 2-18 
PR00529H 10.07 1.000e-10 176- 
197 PR00529F 5.98 5.138e-10 75- 
90 


1390 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237F 13.57 7.429e- 15 152- 
177 PR00237E 13.03 3.250e-14 
94-118 PR00237D 8.94 7.000e-09 
58-80 PR00237G 19.63 8.1 16e-09 
192-219 




rJJUZ44o 


IRANbCRlPTlON PROTEIN DNA- 
BINDIN. 


PD02448A 9.37 9.195e-09 34-73 


1392 


BL00216 


Sugar transport proteins. 


BL00216B 27.64 3.732e-ll 196- 
246 


1393 


PR00237 


RHODOPSIN-LIKE GPCR 

5sT TPFRPA1V/TTT V QTfTMATT IP R 


PR00237C 15.69 9.333e-12 143- 
loo 


1393 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 3.200e-10 129- 
169 


1393 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245A 18.03 2.429e-12 116- 
138 PR00245B 1038 7.525e-ll 
216-231 PR00245E 12.40 2.019e- 
10 330-345 PR00245C7.84 
7.286e- 10 277-293 | 
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1397 


PR00014 


FIBRONECTIN TYPE III REPEAT 
SIGNATURE 


PR00014C 15.44 1.783e-09 585- 
604 PR00014A8.22 3.045e-09 
747-757 PR00014C 15.44 6.087e- 
09 683-702 


1397 


BL00790 


Receptor tyrosine lcinase class V 
proteins. 


BL00790I 20.01 1.750e-12 1016- 
1047 BL007901 20.01 6.125e-12 
694-725 BL00790I 20.01 6.679e- 
09 596-627 


1397 


PR00096 


GLUTAMINE 
AMIDOTRANSFERASE 

ni TT1T7TJ T? A A KIT ~\T C\ T/~<"K T A T>T TT> T~> 

SUPERFAMILY SIGNATURE 


PR00096B 9.72 9.827e-09 1063- 
1075 


1401 


PD00866 


GLYCOPROTEIN PROTEIN SPIKE 

r~> TITI T7 /^T TT> O /~\T> T»T"7T»T All mn 

E2 PRECURSOR PEPLOMER. 


PD00866L 3.73 8.820e-09 51-61 


1402 


PR00250 


FUNGAL PHEROMONE MATING 
FACTOR STE2 GPCR SIGNATURE 


PR00250G 9.58 9.746e-09 21-43 


1404 


PD01652 


RECEPTOR CELL NK 
GLYCOPROTEIN IMMUNOGLOB, 


PD01652A 15.35 6.625e-10 24-60 
PD01652B 8.50 1.836e-09 14-66 
PD01652B 8.50 4.021e-09 111-163 


1409 


BL00240 


Receptor tyrosine kinase class III 
proteins. 


BL00240B 24.70 8.250e-12 268- 
292 


1409 


DM00179 


w KINASE ALPHA ADHESION T- 
CELL. 


DM00179 13.97 3.842e-10 312-322 


1409 


PD02870 


RECEPTOR INTERLEUKIN- 1 
PRECURSOR. 


PD02870B 18.83 5.500e-10 421- 
454 PD02870D 15.74 6.266e-09 
182-217 


1409 


t»t* i 

PR00021 


SMALL PROLINE-RICH PROTEIN 
SIGNATURE 


PR00021A 4.31 8.405e-09 496-509 


1414 


tit /"\rv> 1 ^ 

BL00215 


Mitochondrial energy transfer proteins. 


BL00215A 15.82 3.455e-18 46-71 
BL00215B 10.44 8.714e-12 183- 
196 BL00215A 15.82 7.319e-ll 
143-168 


1414 


t»t> nnnT7 
rKUl)y2 / 


A TVT7XTTX.TT? 'XTT T/""1T TATTTM^ 

ADENINE NUCLEOTIDE 

TO A XTCT (~~\f~^ A Tf~"*"n> 1 PT/^M A TT TO tj 

IKAJNolAJOAl UK 1 MUJNAIUKL 


PR00927E 14.93 6.040e-09 74-96 


1414 


BL00439 


Acyltransferases ChoActase / COT / 
CPT family proteins. 


BL00439A 9.40 8.279e-09 318-335 


1416 


PD01270 


RECEPTOR FC 
IMMUNOGLOBULIN AFFIN. 


PD01270D 24.66 9.286e-12 12-48 


1417 


BL00518 


Zinc finger, C3HC4 type (RING 
finger), proteins. 


BL00518 12.23 2.714e-10 42-51 


1417 


BL01282 


BIR repeat proteins. 


BL01282B 30.49 5.114e-09 23-62 


1418 


BL00604 


Synaptophysin / synaptoporin proteins. 


BL00604A 9.13 7.035e-29 41-96 
BL00604E 8.32 4.759e-24 201-243 
BL00604C 14.66 8.807e-17 134- 
166 BL00604B9.95 5.154e-16 
104-134 BL00604D 12.28 5.100e- 
14 166-201 


1418 


PR00220 


SYNAPTOPHYSIN/SYNAPTOPORIN 
FAMILY SIGNATURE 


PR00220A 10.93 1.353e-21 38-61 
PR00220E 3.46 4.150e-18 216-235 
PR00220D 8,32 4.575e-15 149-173 
PR00220C 1 1.05 5.557e-14 117- 
142 PR00220B 15.48 2.703e-13 
62-88 


1421 


BL01219 


Ammonium transporters proteins. 


BL01219D 11.63 2.957e-10 217- 
241 BL01219F 15.24 8.809e-09 
289-314 


1421 


PR00342 


RHESUS BLOOD GROUP PROTEIN 


PR00342G8.18 1.458e-19 220-239 
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SIGNATURE 


PR00342B 11.09 8.657e-13 61-79 
PR00342D 8.46 2.857e-12 128-145 
PR00342H 7.61 6.927e-l 1 250-273 
PR00342C 10.10 4.770e- 10 90-108 
PR00342E 14.49 5.950e-10 151- 
175 PR00342F7.02 1.556e-09 
185-201 PR00342L 7.61 9.600e-09 
352-378 


1423 


PR00794 


PANCREATIC RIBONUCLEASE 
FAMILY SIGNATURE 


PR00794C 17.32 8.07 le- 14 96-115 
PR00794B 16.95 6.455e-13 7 1 -9 1 
PR00794A 14.47 9.538e-13 51-71 
PR00794D 17.96 9 368e-10 117- 
140 


1423 


BL00127 


Pancreatic nbonuclease family proteins, 


BL00127B 26.57 2.895e-23 51-96 
BLU0127C 3 1 .49 3.045e-l 1 105- 
149 BL00127A 12.84 2.161e-09 
36-46 


1426 


PR00830 


ENDOPEPTIDASE LA (LON) 

ODD TXIP nn Z^T'E? A on /01/C\ 

SERINE PRO 1 EASE (SI 6) 
SIGNATURE 


PR00830A8.41 1.143e-ll 165-185 


1426 


BL0U1 13 


Adenylate kinase proteins. 


BLUU113A 12.74 4.4oze-U9 4 / 1- 
488 


1426 


BL00674 


AAA-protein family proteins. 


BL00674B 4.46 8.905e-O9 467-489 


1427 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019B 11.36 9.500e-ll 74-88 
PR00019A 11.19 5.696e-10 77-91 
PR00019B 1 1.36 6.400e-10 25-39 
PR00019B 11.36 5.320e-09 49-63 


1427 


PR00014 


FIBRONECTIN TYPE III REPEAT 
SIGNATURE 


PR00014C 15.44 8.043e-09 382- 
401 


1437 


PD01719 


PRECURSOR GLYCOPROTEIN 
SIGNAL RE. 


PD01719A 12.89 6.850e-15 889- 
917 PD01719A 12.89 6.727e-13 
643-671 PD01719A 12.89 1.913e- 
12 832-860 PD01719A 12.89 
4.261e-12 70U729 


t A A*\ 

1442 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


TlT» /\r\f\ 1 A A 11 1 A 1 OAA. 11 

PR00019A 1L19 l.900e-ll 106- 
120 


1443 


BL00353 


HMG1/2 proteins. 


BL00353B ll. 47 l.643e-32 72-122 


1 A A -i 

1443 


PR00886 


HIGH MOBILITY GROUP 
(HMG1/HMG2) PROTEIN 

OI/~*KT A Tl TDT7 


PR00886A 10.08 7. 218e-18 37-60 

nr> t\r\o 0£. o c\ oo 1 tc a„ in n 01 

PROO806B 9.8b l .750e-12 0U-0 1 

rKUtJoooU 1 1 .84 y.U 1 oe- 1 / IU4- 


1443 


BL00649 


G-protein coupled receptors family 2 
proteins. 


BL00649C 17.82 6.226e-ll 286- 
j i / 


1443 


PR00249 


SECRETIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00249C 17.08 1 .783e-09 288- 
312 PR00249D 13.09 5.143e-09 
327-353 


1446 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 2.373e-09 781-814 


1 A A Q 

144o 




r> D f^i T 1 XT t? DirU DDATCIM Q 

rKULINb-KlCH rRUlblN 3. 


DMUU21!) o. /oUe-Uy 4U/-^^D 


1448 


PF00761 


Polyomavirus coat protein. 


PF00761A 12.61 8.925e-09 1-26 


1449 


PR00180 


CELLULAR RETIN ALDEHYDE- 
BINDING PROTEIN SIGNATURE 


PR00180B 16.42 7.146e-0°236- 
261 


1452 


PR00926 


MITOCHONDRIAL CARRIER 
PROTEIN SIGNATURE 


PR00926F 17.75 4.706e-24 132- 
155 PR00926E 1 1 .70 7.000e-19 
89-108 PR00926D 10.53 9.308e-17 
41-60 PR00926F 17.75 9.859e-10 
35-58 
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1452 


BL00215 


Mitochondrial energy transfer proteins. 


BL00215A 15.82 8.235e-10 33-58 
BL00215B 10.44 6.400e-09 79-92 
BL00215B 10.44 8.200e-09 174- 
187 


1452 


PR00927 


ADENINE NUCLEOTIDE 
TRANSLOCATOR 1 SIGNATURE 


PR00927E 14.93 7.652e-22 67-89 
PR00927G 11.07 3.500e-15 178- 
194 PR00927F 11.79 6.21 le- 14 
128-145 PR00927D 11.02 4.000e- 
13 26-40 PR00927B 14.66 2.597e- 
12 164-186 PR00927E 14.93 
7.818e-ll 162-184 PR00927A7.98 
9.667e-09 28-41 


1456 


BL00239 


Receptor tyrosine kinase class II 
proteins. 


BL00239B 25.15 1.000e-40 1037- 
1085 BL00239E 17.14 1.000e-40 
1154-1204 BL00239F 28.15 
1.750e-31 1208-1253 BL00239C 
18.75 1.000e-24 1099-1122 
BL00239D 16.81 2.884e-15 1124- 
1150 


1456 


BL00107 


Protein kinases ATP-binding region 
proteins. 


BL00107B 13.31 1.900e-14 1182- 
1198 BL00107A 18.39 9.217e-14 
1112-1143 


1456 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109D 17.04 4.316e-23 1183- 
1206 PR00109B 12.27 4.857e-19 
1112-1131 PR00109E 14.41 
5.500e-16 1227-1250 PR00109C 
12.85 4.000e-13 1164-1175 
PR00109A 15.00 8.962e-ll 1066- 
1080 


1456 


BL00790 


Receptor tyrosine kinase class V 
proteins. 


BL00790O 7.68 4.532e-18 1161- 
1194 BL00790K9.30 5.091e-16 
1010-1064 BL00790H 13.42 
1.466e-15 830-856 BL00790M 
8.74 9.203e-12 1103-1125 
BL00790Q 15.61 5.200e-ll 1219- 
1268 BL00790P 12.33 8.057e-10 
1194-1219 


1456 


BL00240 


Receptor tyrosine kinase class III 
proteins. 


BL00240F 17.74 3.875e-26 1153- 
1201 BL00240C 22.58 4.414e-16 
978-1027 BL00240E 11.56 5. 061e- 
15 1098-1136 BL00240G 28.45 
8.909e-10 1200-1253 


1456 


PR00014 


FIBRONECTIN TYPE III REPEAT 
SIGNATURE 


PR00014D 12.04 2.350e-09 893- 
908 PR00014B 14.77 7.000e-09 
842-853 


1456 


BL50001 


Src homology 2 (SH2) domain proteins 
profile. 


BL50001B 17.40 9.640e-09 1109- 
1130 


1457 


PF01307 


Plant virus coat protein. 


PF01307A 21.19 3.025e-09 321- 
351 


1462 


BL00546 


Matrixins cysteine switch. 


BL00546C 16.41 1.643e-33 232- 
264 BL00546B20.il 8.895e-29 
18-62 BL00546A 19.62 7.667e-26 
91-121 BL00546B 20.11 7.720e-19 
180-224 BL00546D 10.34 l.OOOe- 
15 273-287 BL00546E 10.23 
4.529e-15 309-330 
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1462 


BL00024 


Hemopexin domain proteins. 


BL00024D 17.28 7.375e-34 232- 
264 BL00024C 22.98 5.091e-32 
17-66 BL00024B 21.53 3.143e-23 
130-164 BL00024C 22.98 8.732e- 
22 179-228 BL00024E 7.58 
5.500e-15 273-287 BL00024F 
11.30 1.900e-14 309-330 
BL00024A 11.49 9.100e-13 111- 
122 


1462 

* 


BL00142 


Neutral zinc metallopeptidases, zinc- 
binding region proteins. 


BL00142 8.38 1.000e-12 238-249 


1462 


PR00138 


MATRIXIN SIGNATURE 


PR00138D 16.56 6.500e-30 238- 
264 PR00138C 16.41 5.846e-16 
18-47 PR00138A 15.14 7.136e-16 
111-125 PRO0138E6.01 7.000e-15 
273-287 PR00138B 15.82 3. 824e- 
11 156-172 


1462 


PR00480 


ASTACIN FAMILY SIGNATURE 


PR00480B 15.41 6.045e-10 233- 
252 


1475 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354Z 9.06 9.566e-17 216-265 
DM01354Z 9.06 2.534e-15 161-210 


1478 


PR00185 


EUKARYOTIC PORIN SIGNATURE 


PR00185D 16.54 1.136e-ll 111- 
129 


1482 


PR00449 


TRANSFORMING PROTEIN P21 
RAS SIGNATURE 


PR00449E 13.50 4.971e-ll 464- 
487 


1484 


PR00879 


FISH ACETYLCHOLINESTERASE 
SIGNATURE 


PR00879A 6.28 1.000e-08 39-45 


1487 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354Q 9.23 1.000e-40 283- 
338 DM01354T9.48 4.882e-39 
389-432 DM01354R8.50 8.380e- 
28 338-368 DM01354P9.18 
2.957e-26 249-283 DM01354S 
11.61 1.000e-18 368-389 


1489 


PD02327 


GLYCOPROTEIN ANTIGEN 
PRECURSOR IMMUNOGLO. 


PD02327C 15.47 3.538e-09 60-75 


1491 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354B 14.71 6.426e-14 271- 
299 


1493 


PR00502 


MUTT DOMAIN SIGNATURE 


PR00502B 15.92 4.643e-09 348- | 
364 


1494 


PF00997 


Kappa casein. 


PF00997D 9.95 8.306e-09 117-152 


1495 


DM01162 


BARLEY YELLOW DWARF VIRUS 
RNA-DIRECTED RNA 
POLYMERASE. 


DM01 162B 11.50 9.743e-09 104- 
113 


1497 


DM0003 1 


IMMUNOGLOBULIN V REGION. 


DM00031B 15.41 9.757e-10 79- 
113 


1498 


PR00069 


ALDO-KETO REDUCTASE 
SIGNATURE 


PR00069D 19.36 1.659e-19 260- 
290 


1498 


BL00798 


Aldo/keto reductase family proteins. 


BL00798F 23.30 5.125e-25 314- 
363 BL00798E 20.32 7.4Dze-24 
256-294 BL00798A 14.97 3.000e- 
10 207-222 


1499 


BL00826 


MARCKS family proteins. 


BL00826B 12.51 2.116e-09 545- 
597 


1499 


BL00970 


Nuclear transition protein 2 proteins. 


BL00970C 14.80 5.538e-09 567- 
605 


1499 


DM00303 


6 LEA 1 1-MER REPEAT REPEAT. 


DM00303 A 13.20 8.327e-09 574- 
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624 


1499 

■■ 


PD01457 


RIBOSOMAL PROTEIN 40S ZINC- 
FINGER METAL. 


PD01457A 16.51 7.708e~ll 561- 
606 PD01457A 16.51 1.409e-10 
565-610 PD01457A 16.51 5.745e- 
10 559-604 PD01457A 16.51 
2.397e-09 558-603 PD01457A 
16.51 3.483e-09 567-612 
PD01457A 16.51 4.103e-09 560- 
605 PD01457A 16.51 8.060e-09 
563-608 PD01457A 16.51 8.293e- 
09 557-602 PD01457A 16.51 
8.914e-09 562-607 


1499 


BL00348 


p53 tumor antigen proteins. 


BL00348F 23.19 5.571e-09 543- 
586 BL00348F 23.19 6.429e-09 
583-626 BL00348F 23.19 9.286e- 
09 560-603 


1501 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354X 13.86 6.230e-15 3-42 


1502 


BL01210 


Caveolins proteins. 


BL01210A 17.61 9.438e-14 132- 
171 


1503 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 9.571e-10 65-80 


1503 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 1.161e-10 53-86 
DM00215 19.43 8.393e-10 52-85 
DM00215 19.43 4.661e-09 47-80 
DM00215 19.43 5.729e-09 46-79 


1506 


DM01354 


lew TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354Z 9.06 8.548e-13 66-115 


1508 


PD01941 


TRANSMEMBRANE 
COTRANSPORTER SYMP. 


PD01941E 15.92 8.714e-23 198- 
245 PD01941F28.52 5.154e-ll 
496-55 1 


1510 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354N 13.17 1.000e-40 148- 
193 DM01354V 12.97 1.000e-40 
274-321 DM01354V 12.97 6.932e- 
40 218-265 DM01354O 8.73 
1.882e-19 199-245 DM01354W 
12.64 2.427e-13 265-285 
DM01354W 12.64 1.300e-ll 321- 
341 


1511 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 4.913e-19 230-269 


1511 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 4.600e-13 343-360 
BL00028 16.07 6.087e-12 371-388 


1511 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 3.250e-16 340- 
354 PR00048A 10.52 3.348e-10 
368-382 PR00048B6.02 4.938e-10 
384-394 


1511 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 3.077e-10 331-344 
PD00066 13.92 6.700e-09 359-372 


1517 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 8.875e-15 449-466 
BL00028 16.07 6.824e-14 309-326 
BL00028 16.07 7.353e-14 505-522 
BL00028 16.07 2.800e-13 1038- 
1055 BL00028 16.07 8.650e-13 
337-354 BL00028 16.07 1.783e-12 
926-943 BL00028 16.07 4.913e-12 
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982-999 BL00028 16.07 5.304e-12 
898-915 BL00028 16.07 5.304e-12 
1094-1111 BL00028 16.07 8.435e- 
12 421-438 BL00028 16.07 
3.077e-ll 393-410 BL00028 16.07 
3.423e-ll 954-971 BL00028 16.07 
3.769e-ll 365-382 BL00028 16.07 
4.462e-ll 870-887 BL00028 16.07 
5.154e-ll 281-298 BL00028 16.07 
4.000e-10 477-494 BL00028 16.07 
7.000e-10 1010-1027 BL00028 
16.07 7.000e-10 1066-1083 


1517 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 9.400e-30 76-115 
PD01066 19.43 7.525e-15 675-714 
PD01066 19.43 1.943e-09 716-755 


1517 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 5.500e-16 446- 
460 PR00048A 10.52 9.182e-15 
306-320 PR00048A 10.52 9.182e- 
15 1035-1049 PR00048A 10.52 
5. 500e-14 502-516 PR00048A 
10.52 7.750e-14 923-937 
PR00048A 10.52 1.643e-13 895- 
909 PR00048A 10.52 4.214e-13 
1091-1105 PR00048A 10.52 
4.857e-13 334-348 PR00048A 
10.52 4.706e-12 362-376 
PR00048B 6.02 5.000e-12 434-444 
PR00048A 10.52 6.294e-12 474- 
488 PR00048A 10.52 9.471e-12 
951-965 PR00048A 10.52 3.842e- 
11 979-993 PR00048A 10.52 
4.789e-ll 418-432 PR00048B 6.02 
9.308e-ll 1023-1033 PR00048B 
6.02 1.563e-10 294-304 PR00048B 
6.02 1.563e-10 883-893 PR00048B 
6.02 2. 125e-10 911-921 PR00048B 
6.02 2.688e-10 350-360 PR00048B 
6.02 8.3 13e-10 939-949 PR00048B 
6.02 9.438e-10 490-500 PR00048B 
6.02 1.000e-09 995-1005 
PR00048B 6.02 3.368e-09 462-472 
PR00048A 10.52 3.880e-09 390- 
404 PR00048B 6.02 4.3 16e-09 
1079-1089 PR00048A 10.52 
4.600e-09 867-881 


1517 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 2.800e-14 325-338 
PD00066 13.92 4.600e-14 437-450 
PD00066 13.92 5.200e-14 998- 
1011 PD00066 13.92 6.400e-14 
493-506 PD00066 13.92 8.800e-14 
942-955 PD00066 13.92 5.500e-13 
1026-1039 PD00066 13.92 6.000e- 
13 970-983 PD00066 13.92 
6.500e-13 297-310 PD00066 13.92 
6.500e-13 886-899 PD00066 13.92 
7.000e-13 465-478 PD00066 13.92 
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7.000e-13 914-927 PD00066 13,92 
9.000e-13 353-366 PD00066 13.92 

c\ c r\r\ „ 101 n o o 1 aac t>"Piaaa^£ 

9.500e-13 10o2-109o rDUUUoo 
13.92 4.808e-10 409-422 PD00066 
13.92 4.000e-09 381-394 PD00066 


1517 


BL00479 


Phorbol esters / diacyl glycerol binding 
domain proteins. 


BL00479A 19.86 7.319e-09 1025- 
1048 


1518 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


t^t /\AinA a on on T i r o« 1 o A o •'n 

BL00290A 20.89 7.iDoC-13 4Z-OD 
BL00290B 13.17 4.500e-12 98-116 


1519 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354K9.35 7.706e-28 71-103 
DM01354L 11.73 8.250e-14 103- 
130 


1521 


BL00250 


TGF-beta family proteins. 


BL00250A 21.24 2.500e-31 640- 
676 BL00250B 27.37 2.000e-27 
705-741 


1521 


BL01273 


CoA transferases proteins. 


BL01273C 12.!)4 l.UUUe-4U 13U- 
170 BL01273D 19.11 9.471e-28 
206-250 BL01273B 14.85 9.830e- 
20 81-115 B.L012/3A 
2.2 8 6e- 16 62-75 


1521 


PR00669 


INHIBIN ALPHA CHAIN 
SIGNATURE 


PR00669E lo. /.U/ee-li o4U- 
658 


1523 


BL00427 


Disintegrins proteins. 


T-»T i\r\ a 1 o no A 1 nn« i f /ice ■CIA 

BL00427 13.93 4.100e-lD 43S-D 1U 


1523 


PR00289 


DISINTEGRIN SIGNATURE 


PR00289A 13.62 3.700e-18 469- 
489 PR002o9J3 1 1. fy 3.143e-lZ 
498-511 


1523 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


nn/iAAl 1 T~\ 1 A AO A QOA« AO 

PR00011D 14.03 y.o/ue-uy o/O- 
694 


1524 


BL00383 


Tyrosine specific protein phosphatases 

• 

proteins. 


BL0O383E 10.35 8.683e-12 186- 

1 AT 

197 


1524 


PR00700 


PROTEIN TYROSINE 
PHOSPHATASE SIGNATURE 


PR00700D 12.47 3.100e-09 183- 

OAT 


1525 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237G 19.63 7.000e-17 35-62 


1525 


BL00237 


G-protein coupled receptors proteins. 


BL00237D 11.23 3.500e-14 45-62 


1525 


PR00424 


ADENOSINE RECEPTOR 
SIGNATURE 


PR00424F 8.50 2.688e-12 35-46 


1525 


PR00554 


ADENOSINE A2B RECEPTOR 
SIGNATURE 


PR00554F 8.86 1.750e-16 24-37 
PR00554G 14.18 7,288e-12 69-89 


1528 


BL00018 


EF-hand calcium-binding domain 
proteins. 


BL00018 7.41 9.217e-09 338-351 


1530 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354N 13.17 1.000e-40 36-81 

DM01354O 8.73 1.000e-40 87-133 

DM01354Q 9.23 1.000e-40 167- 

222 DM0 1354T 9.48 1.000e-40 

273-316 DM01354V 12.97 l.OOOe- 

40 382-429 DM01 354Y 10.69 
1 AAA.-. ac\ aqq ^ifi ni\/rn 1 

l.(JUue-4U 4:7 SO 3 o JJ1V1W 1 j j4A 

13.86 5.219e-39 449-488 
DM01354P 9.18 2.723e-37 133-167 
DM01354R 8.50 7.395e-32 222- 
252 DM01354W 12.64 3.000e-25 
429-449 DM01354S 11.61 9.550e- 
23 252-273 DM01354U 12.24 
2.350e-20 362-382 
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1530 


DM00895 


7 kw REVERSE TRANSCRIPTASE 
RNA POLYMERASE. 


DM00895E 15.72 3.889e-ll 149- 
174 


1532 


PR00036 


LACI BACTERIAL REGULATORY 
PROTEIN HTH SIGNATURE 


PR00036B 10.57 1.000e-08 282- 
293 


1533 


BL01160 


Kinesin light chain repeat proteins. 


BL01160D 10.17 7.077e-09 218- 
247 


1533 


PD00126 


PROTEIN REPEAT DOMAIN TPR 
NUCLEA. 


PD00126A 22.53 1.000e-08 485- 
506 


1535 


PR00671 


INHIBIN BETA B CHAIN 
SIGNATURE 


PR00671C 4.18 9.690e-09 44-64 


1538 


BL00107 


Protein kinases ATP-binding region 
proteins. 


BL00107A 18.39 9.217e-14 89-120 
BL00107B 13.31 6.786e-ll 159- 
175 


1538 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12.27 6.766e-09 89-108 


1539 


PR00502 


MUTT DOMAIN SIGNATURE 


PR00502B 15.92 7.000e-09 131- 
147 


1540 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 1.786e-10 813-828 


1541 


DM01418 


352 FIBRILLAR COLLAGEN 
C ARBOXYL-TERMINA L. 


DM01418A 20.83 1.000e-40 631- 
679 DM01418C 20.48 L000e-40 
800-842 DM0 141 SB 22.51 4.103e- 
38 729-771 


1541 


BL00420 


Speract receptor repeat proteins domain 
proteins. 


BL00420A 20.42 2.180e-10 549- 
578 


1541 


BL01113 


Clq domain proteins. 


BL01113A 17.99 1.957e-10 555- 
582 BL01113A17.99 2.149e-10 
549-576 BL01113A 17.99 7.3 19e- 
10 552-579 


1542 


BL00279 


Membrane attack complex components 
/ perforin proteins. 


BL00279E37.il 9.053e-12 901- 
949 


1542 


PR00907 


THROMBOMODULIN SIGNATURE 


PR00907B 11.29 8.274e-ll 1016- 
1033 


1542 


PR00764 


COMPLEMENT C9 SIGNATURE 


PR00764F 16.89 3.836e-10 918- 
939 


1542 


DM00864 


EGF-LIKE DOMAIN. 


DM00864B 11.34 6.910e-10 1025- 
1044 


1542 


BL01187 


Calcium-binding EGF-like domain 
proteins pattern proteins. 


BL01187B 12.04 4.150e-14 1020- 
1036 BL01187B 12.04 2.575e-09 
927-943 


1542 


BL01177 


Anaphylatoxin domain proteins. 


BL01177E 20.64 7.632e-09 1018- 
1045 


1542 


BL00799 


Granulins proteins. 


BL00799B 11.02 9.679e-09 1011- 
1047 


1542 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR00010A 11.79 5.154e-ll 912- 
924 PR00010C 11.16 7.545e-10 
1025-1036 PR00010C 11.16 
9.786e-09 932-943 


1543 


DM0003 1 


IMMUNOGLOBULIN V REGION. 


DM0003 IB 15,41 /.U3be-(Jy /y- 
113 


1544 


BL00972 


Ubiquitin carboxyl-terminal hydrolases 
family 2 proteins. 


BL00972A 11.93 6.318e-19 478- 
496 BL00972D 22.55 7.968e-16 
762-787 BL00972B 9.45 1.600e-12 
559-569 


1544 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 8.008e-13 235-250 
PR00049D 0.00 7.375e-12 239-254 
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PR00049D 0.00 5.916e-l 1 242-257 
PR00049D 0.00 6.748e-ll 236-251 
PR00049D 0.00 9.395e-l 1 240-255 
PR00049D 0.00 1.286e-10 233-248 
PR00049D 0.00 8.929e-10 241-256 
PR00049D 0.00 2.678e-09 243-258 
PR00049D 0.00 4.051e-09 237-252 
PR00049D 0.00 4.051e-09 238-253 
PR00049D 0.00 4.051e-09 244-259 


1544 


PR0021 1 


GLUTELIN SIGNATURE 


PR0021 IB 0.86 7.500e-09 238-259 


1544 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 5.339e-10 222-255 
DM00215 19.43 7.268e-10 226-259 
DM00215 19.43 2.525e-09 220-253 
DM00215 19.43 9.695e-09 221-254 


1544 


BL00048 


Protamine PI proteins. 


BL00048 6.39 9.888e-09 259-286 


1546 


PR00416 


EUKARYOTIC DNA 
TOPOISOMERASE I SIGNATURE 


PR00416B 15.58 5.000e-09 389- 
409 


1546 


BL00048 


Protamine PI proteins. 


BL00048 6.39 8.200e-09 510-537 


1546 


PR00138 


MATRIXIN SIGNATURE 


PR00138E 6.01 8.429e-09 146-160 


1546 


BL00024 


Hemopexin domain proteins. 


BL00024E 7.58 8.694e-09 146-160 


1546 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019B 1 1.36 4.857e-12 317- 
331 PR00019A 11.19 8.333e-09 
133-147 PR00019B 11.36 8.920e- 
09 269-283 


1547 


DM01595 


kw ALLANTOICASE SPACIF7.09C. 


DM01595B 10.54 2.625e-15 143- 
171 DM01595I8.912.714e-15 
457-475 DM01595D 10.94 3.274e- 
11 456-481 


1549 


PR00192 


F-ACTIN CAPPING PROTEIN BETA 
SUBUNIT SIGNATURE 


PR00192A 8.23 1.474e-27 47-68 


1549 


BL0023 1 


F-actin capping protein beta subunit 
proteins. 


BL00231A 8.59 1.829e-24 47-93 


1551 


BL00708 


Prolyl endopeptidase family serine 
proteins. 


BL00708B 24.91 7.197e-12 439- 
470 


1551 


PF0093O 


Dipeptidyl peptidase IV (DPP IV) N~ 
terminal region. 


PF00930I 15.96 6.373e-17 481-509 
PF00930H 20.16 2.482e-13 402- 
445 PF00930J8.78 1.000e-ll 533- 
554 PF00930G 21.30 9.613e-09 
362-400 


1552 


BL00878 


Orn/DAP/Arg decarboxylases family 2 
pyridoxal-P attachment si. 


BL00878F 19.67 7.231e-20 171- 
194 BL00878C 17.74 3.676e-ll 
38-58 


1552 


PR00163 


RUBREDOXIN SIGNATURE 


PR00163B 14.45 8.200e-09 143- 
160 


1555 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354V 12.97 1.000e-40 705- 
752 DM01354U 12.24 6.700e-22 
685-705 DM0 1354T 9.48 8.355e- 
15 596-639 DM01354W 12.64 
9. 122e-13 752-772 


1557 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 

* 


DM01354N 13.17 1.000e-40 158- 
203 DM01354O 8.73 1.000e-40 
209-255 DM01354Q9.23 l.OOOe- 
40 289-344 DM01354T 9.48 
1.000e-40 398-441 DM01354V 
12.97 1.000e-40 507-554 
DM01354P 9.18 5.213e-37 255-289 
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DM01354R 8.50 6.760e-28 344- 
374 DM01354L 11.73 1.200e-26 
101-128 DM01354S 11.61 7.300e- 
26 377-398 DM01354M 12.50 
6.094e-24 128-158 DM01354U 
12.24 6.700e-22 487-507 
DM01354K9.35 1.22Se-17 69-101 
DM01354W 12.64 l.UUUe-lo jj4- 
574 


1557 


DM00895 


7 kw REVERSE TRANSCRIPTASE 
RNA POLYMERASE. 


DM00895E 15.72 7.828e-10 271- 
296 


1558 


PR00834 


HTRA/DEGQ PROTEASE FAMILY 
SIGNATURE 


PR00834F 10.91 2.946e-09 194- 
207 


1558 


BL00741 


Guanine-nucleotide dissociation ; 
stimulators CDC24 family sign. 


BL00741B 14.27 2.962e-09 1104- 
1 127 


1558 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 4.814e-09 1318- 
1333 PR00049D0.00 5.729e-09 
259-274 


1558 


PR00554 


ADENOSINE A2B RECEPTOR 
SIGNATURE 


PR00554B 12.52 8.855e-09 917- 
926 


1560 


PF01140 


Matrix protein (MA), pi 5. 


PF01140D 15.54 3.700e-10 977- 
1012 


1565 


BL01226 


Hydroxymethylglutaryl-coenzyme A 
synthase proteins. 


BL01226A 13.79 1.000e-40 50-89 
BL01226C 13.51 1.000e-40 127- 
167 BL01226H 17.74 1.000e-40 
478-526 BL01226G 15.76 3.483e- 
32 384-413 BL01226B 13.35 
1.818e-31 95-127 BL01226b y./o 
8.714e-23 345-363 BL01226E 
13.74 7.716e-22 304-345 
BL01226D 11.60 5.680e-18 226- 
262 BL01226D 11. oU o.y4Ue-lU 
174-210 


1568 


BL00189 


2-oxo acid dehydrogenases 
acyltransferase component lipoyl bi. 


BL00189A 19.58 4.414e-13 103- 
138 


1574 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 4.661e-09 254-269 


1574 


PR00456 


RIBOSOMAL PROTEIN P2 
SIGNATURE 


PR00456E 3.06 6.241e-09 184-199 


1577 


PR00756 


MEMBRANE ALANYL 
DIPEPTIDASE (Ml) FAMILY 
SIGNATURE 


PR00756B 14.06 2.364e-13 232- 
248 PR00756C 11.60 7.300e-12 
310-321 


1579 


BL00048 


Protamine PI proteins. 


BL00048 6.39 3.605e-10 636-663 
BL00048 6.39 5.500e-10 630-657 
BL00048 6.39 1.225e-09 628-655 
BL00048 6.39 4.263e-09 648-675 
BL00048 6.39 5.388e-09 627-654 
BL00048 6.39 8.988e-09 626-653 


1580 


BL00284 


Serpins proteins. 


234 BL00284A 15.64 1.000e-18 
63-87 BL00284D 16.34 1.500e-14 
407-434 BL00284B 17.99 1.273e- 
12 162-183 


1581 


BL01052 


Calponin family repeat proteins. 


BL01052B 15.31 3.308e-ll 110- 
136 


1581 


PR00888 


SMOOTH MUSCLE 


PR00888C 12.27 2.141e-09 110- 
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PROTEIN/CALPONIN FAMILY 
SIGNATURE 


126 


1582 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019B 11.36 6.500e-ll 140- 
154 PR00019B 11.36 7.750e-10 
92-106 PR00019A 11.19 5.000e-09 
95-109 


1582 


PR00423 


CELL DIVISION PROTEIN FTSZ 
SIGNATURE 


PR00423E 7.36 9.550e-09 505-527 


1587 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 2.500e-10 277-294 


1587 


BL00518 


Zinc finger, C3HC4 type (RING 
finger), proteins. 


BL00518 12.23 8.286e-10 186-195 


1588 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354R8.50 7.750e-29 192- 
222 DM01354S 11.61 7.300e-26 
222-243 


1589 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354U 12.24 5.891e-19 62-82 
DM01 354V 12.97 L175e-09 82- 
129 


1590 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354Z 9.06 5.065e-13 167-216 


1591 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354R8.50 1.840e-18 141- 
171 


1591 


BL00300 


SRP54~type proteins GTP-binding 
domain proteins. 


BL00300C 25.57 7.100e-09 648- 
702 


1593 


BL00573 


Pyridine nucleotide- disulphide 
oxidoreductases class-II activ. 


BL00573A 9.65 7.273e-10 750-768 


1593 


PR00411 


PYRIDINE NUCLEOTIDE 
DISULPHIDE REDUCTASE CLASS-I 
SIGNATURE 


PR00411A 15.95 4.569e-09 747- 
770 


1593 


PR00350 


VITAMIN D RECEPTOR 
SIGNATURE 


PR00350E 11.55 5.546e-09 13-33 


1593 


PR00368 


FAD-DEPENDENT PYRIDINE 
NUCLEOTIDE REDUCTASE 
SIGNATURE 


PR00368A 17.76 7.589e-09 747- 
770 


1593 


BL01113 


Clq domain proteins. 


BL01113A 17.99 9.827e-09 484- 
511 


1593 


PR00308 


TYPE I ANTIFREEZE PROTEIN 
SIGNATURE 


PR00308C 3.83 8.892e-10 373-383 
PR00308C 3.83 1.701e-09 375-385 
PR00308A 5.90 4.282e-09 752-767 
PR00308A 5.90 6.824e-09 372-387 
PR00308C 3.83 L000e-08 324-334 


1594 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354X 13.86 6.595e-36 13-52 
DM01354Y 10.69 3.455e-09 62- 
102 


1595 


PR00756 


MEMBRANE ALANYL 
DIPEPTIDASE (Ml) FAMILY 
SIGNATURE 


PR00756D 10.58 7.000e-19 187- 
203 PR00756E 11.91 3.700c-15 
206-219 PR00756C 1 1.60 4.000e- 
12 151-162 


1595 


BL00142 


Neutral zinc metallopeptidases, zinc- 
binding region proteins. 


BL00142 8.38 2.286e-09 187-198 


1596 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354I 15.55 1.000e-40 119- 
160 DM01354V 12.97 1.209e-14 
401-448 DM01354H 18.00 5.280e- 
10 79-119 


1600 


BL00484 


Thyroglobulin type-1 repeat proteins 
proteins. 


BL00484C 17.01 3.854e-ll 96-111 
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1601 


PR00342 


RHESUS BLOOD GROUP PROTEIN 
SIGNATURE 


PR00342A 9.71 1.000e-23 12-31 


1608 


PD01941 


TRANSMEMBRANE 
COTRANSPORTER SYMP. 


PD01941E 15.92 1.000e-40 135- 
182 PD01941F 28.52 9.229e~ 17 
254-309 


1609 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354W 12.64 9.375e-16 123- 
143 


1614 


BL00420 


Speract receptor repeat proteins domain 
proteins. 


BL00420B 22.67 5.808e-26 49-104 
BL00420C 11.90 2.500e-10 134- 
145 


1614 


PR00258 


SPERACT RECEPTOR SIGNATURE 


PR00258B 9.63 7.947e-ll 64-76 
PR00258C 9.05 3.204e-10 79-90 
PR00258E 13.33 3.333e-10 133- 
146 PR00258D 14.41 3.286e-09 
110-125 


1616 


BL00326 


Tropomyosins proteins. 


BL00326D 8.76 1.505e-09 377-418 


1616 


PR00261 


LOW DENSITY LIPOPROTEIN 
(LDL) RECEPTOR SIGNATURE 


PR00261D 12.47 3.338e-10 43-65 
PR00261E 11.08 4.575e-10 84-106 
PR00261C 11.37 5.050e-10 43-65 
PR00261F 11.57 5.920e-10 43-65 
PR00261A 11.02 6.769e-10 43-65 
PR00261A 11.02 9.192e-10 84-106 
PR00261D 12.47 2.667e-09 84-106 
PR00261F 11.57 4.304e-09 84-106 
PR00261C 11.37 5,500e-09 84-106 
PR00261B 14.12 5.714e-09 84-106 
PR00261B 14.12 6464e-09 43-65 


1616 


BL01209 


LDL-receptor class A (LDLRA) 
domain proteins. 


BL01209 9.31 6.464e-09 93-106 


1616 


BL00412 


Neuromodulin (GAP-43) proteins. 

- 


BL00412D 16.54 5.613e-13 306- 
357 BL00412D 16.54 8.821e-12 

301- 352 BL00412D 16.54 1.098e- 
10 307-358 BL00412D 16.54 
9.609e-10 304-355 BL00412D 
16.54 1.184e-09 308-359 
BL00412D 16.54 1.918e-09 309- 
360 BL00412D 16.54 2.745e-09 

302- 353 BL00412D 16.54 5.684e- 
09 299-350 BL00412D 16.54 
8.347e-09 303-354 


1616 


PF01140 


Matrix protein (MA), pi 5. 


PF01140D 15.54 8.640e-09 319- 
354 


1616 


BL00422 


Granins proteins. 


BL00422C 16.18 8.085e-10 326- 
354 BL00422C 16.18 8.468e-10 
330-358 BL00422C 16.18 8.851e- 
10 323-351 BL00422C 16.18 
4.529e-09 325-353 BL00422C 
16.18 9.47 le-09 318-346 


1618 


PR00681 


RIBOSOMAL PROTEIN SI 
SIGNATURE 


-¥-»-«-<» /■\/"\/'"oitoo1 n om„ Art yl 1 1 All ' 

PR00681I 8.81 9. 897e-09 412-431 


1619 


BL00134 


Serine proteases, trypsin family, 
histidine proteins. 


BL00134A 11.96 3.813e-15 55-72 


1619 


PR00722 


CHYMOTRYPSIN SERINE 
PROTEASE FAMILY (SI) 
SIGNATURE 


PR00722A 12.27 8.448e-14 56-72 


1619 


BL00021 


Kringle domain proteins. 


BL00021B 13.33 1.763e-13 55-73 
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1619 


PR00839 


V8 SERINE PROTEASE FAMILY 
SIGNATURE 


PR00839B 1 1.20 4.945e-09 55-73 


1620 


BL00125 


Serine/threonine specific protein 
phosphatases proteins. 


BL00125C 19.97 1.000e-40 86-133 
BL00125D 33.11 9.679e-38 154- 
209 


1620 


PR00114 


SERINE/THREONINE 
PHOSPHATASE FAMILY 
SIGNATURE 


PR00114E 17.75 2.500e-27 106- 
134 PR00114D 12.34 9.591e-26 
77-104 PR00114F 17.51 3.400e-17 
170-191 PR00114G 17.20 3.676e- 
17 198-215 


1625 


BL01103 

* 


Aspartate-semialdehyde dehydrogenase 
proteins. 


BL01103F 16.78 3.625e-24 256- 
282 BL01103C 15.07 6.727e-20 
91-113 BL01103B 20.29 5.846e-16 
51-73 BL01103E 11.30 4.115e-14 
238-251 BL01103D7.29 1.000e-09 
155-166 


1626 


PD02410 


SYNTHETASE BETA CHAIN 
LIGASE PHENYL ALANYL-TRNA. 


PD02410E 10.29 2.241 e- 13 83-97 


1628 


BL00044 


Bacterial regulatory proteins, lysR 
family proteins. 


BL00044 19.03 7.188e~l 1 21-55 


1631 


BL00370 


PEP-utilizing enzymes phosphorylation 
site proteins proteins. 


BL00370C 11.73 9.182e-17 96-110 


1632 


PR00101 


ASPARTATE 

CARBAMOYLTRANSFERASE 
SIGNATURE 


PR00101C 12.35 9.328e-10 410- 
428 


1632 


BL00097 


Aspartate and ornithine 
carbamoyltransferases proteins. 


BL00097C 16.87 1.621e-09 411- 
425 


1634 


BL00530 


Ribonuclease T2 family histidine 
proteins 1. 


BL00530C 11.43 1.000e-17 101- 
113 


1643 


BL00050 


Ribosomal protein L23 proteins. 


BL00050A 23.71 3.659e-18 217- 
250 


1645 


PD01719 


PRECURSOR GLYCOPROTEIN 
SIGNAL RE. 


PD01719A 12.89 5.571e-14 479- 
507 PD0 17 19A 12.89 2.000e-09 
666-694 PD01719A 12.89 4.889e- 
09 606-634 PD01719A 12.89 
8.1 lle-09 419-447 


1647 


BL00564 


Argininosuccinate synthase proteins. 


BL00564D 22.43 2.723e-22 157- 
194 BL00564A 19.93 6.582e-ll 
37-74 


1651 


PR00453 


VON WILLEBRAND FACTOR TYPE 
A DOMAIN SIGNATURE 


PR00453A 12.79 9.719e-13 88-106 
PR00453B 14.65 1.818e-12 125- 
140 PR00453C 12.26 3.769e-10 
246-255 


1652 


DM00031 


IMMUNOGLOBULIN V REGION. 


DM0003 IB 15.41 9.471e-27 84- 
118 DM00031 A 16.80 7.055e-25 
206-254 DM00031A 16.80 5.368e- 
18 20-68 DM00031B 15.41 
8.519e-09 267-301 


1653 


PR00173 


GLUTAMATE-ASPARTATE 
SYMPORTER SIGNATURE 


PR00173F 10.44 8.967e-09 8-28 


1654 


BL01032 


Protein phosphatase 2C proteins. 


BL01032C 6.14 1.000e-10 118-128 


1657 


BL00523 


Sulfatases proteins. 


BL00523A 13.36 2.200e-13 47-64 


1658 


BL00194 


Thioredoxin family proteins. 


BL00194 12.16 4.857e-16 482-495 
BL00194 12.16 3.813e-15 347-360 


1658 


PR00421 


THIOREDOXIN FAMILY 
SIGNATURE 


PR00421B 11.40 3.348e-12 489- 
499 PR00421B 11.40 9.379e-ll 
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10 346-355 PR00421A 10.15 

A o^lo lO/lQI AQfi PRODA? 1 (P 
4.oD /e- 1U 4o l-*fyu rrvUUH-zi^ 

13.60 5.714e-l 0 533-545 


1662 


PF00850 


Histone deacetylase family. 


r»T?nAO^AT7 Q 08 1 7£6f» 01 1960- 

JrrUOoDUli o.oo l./O0e-^l l^o^- 

1295 PF00850F 15.70 1.889e-10 
1307-1340 PF00850G 22.75 
3.724e-10 1346-1388 


1666 


BL00317 


WAP-type Tour-disulfide core* domain 
proteins. 


BL00317B 14.58 7.231e-22 92-114 


1666 


PR00003 


4-DISULPHIDE CORE SIGNATURE 


PR00003C7.69 6.885e-ll 98-108 
PR00003D 8.10 4.971e-09 108-117 


1666 


BL00313 


Seminal vesicle protein I repeat 
proteins proteins. 


BL00313B 8.87 1.000e-40 31-68 
BL00313C 11.19 l.UUUe-io iuj- 
118 BL00313B 8.87 5.337e-17 25- 

s-r\ t->t rxni 1 id Q C7 7 ^Q^^ 1 f. 1 Q_ 

5o BLUU313r> 0.0/ i.yiue-u **o- 
80 BL00313B 8.87 9.413e~14 37- 
74 BL00313B 8.87 4.522e-09 49- 
qa tit nn^i ^ A in fin S SOOp-OQ 1 -9 

BL00313C 11.19 9.060e-09 30-45 


1667 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PFiHl Ofifi 1 0 A^K 5 108-147 


1667 


BL01277 


Ribonuclease PH proteins. 


dt ni 977 A 17 ^0 8 81 Re-09 64-102 


1672 


DM00372 


CARCINOEMBRYONIC ANTIGEN 
PRECURSOR AMINO-TERM1NAL 
DOMAIN. 


DM00372B 20.31 8.920e-15 363- 
/ins r\N/fnfn79R ?n 11 1 ^?Qe-12 

68-113 


1673 


PF00992 


Troponin. 


T>T?onOQO A 1 £ fi7 8 RS0f=-1 0 1 76- 
01 1 


1673 


BL00093 


N-4 cytosine-specific DNA methylases 

■ 

proteins. 


BL00093A 9.77 4.329e-09 718-733 


1673 


PR00948 


ELICITIN SIGNATURE 


PR00948B 15.44 5.361e-08 707- 


1 1673 


PR00558 


ALPHA-2A ADRbNbKCjlC 
RECEPTOR SIGNATURE 


PROfKSar fiSArfi 719e-08 429-449 


1676 


BL00962 


Ribosomal protein S2 proteins. 


m 00Qfi9R 16 15 6 S99e-14 53-107 


1678 


PD00015 


GLYCOPROTEIN PRECURSOR 
CELL SI. 


PD00015A 8.90 6.400e-09 35-43 


1678 


PR00795 


RYANODINE RECEPTOR 
SIGNATURE 


PR00795C 6.30 8.252e-09 107-132 


1683 


BL00262 


Insulin family proteins. 


dt n09^9 A 19 1 R9Qp-0Q in-4-S 


1684 


BL00061 


Short-chain dehydrogenases/reductases 
family proteins. 


BL00061B 25.79 8.105e-19 133- 

1^1 
1/1 


1684 


PR00080 


ALCOHOL DEHYDROGENASE 
SUPERFAMILY SIGNATURE 


PR00080C 17.16 2.125e-10 153- 

1/3 


hl684 


PR00081 


GLUCOSE/RIBITOL 
DEHYDROGENASE FAMILY 
SIGNATURE 


PR00081A 10.53 7.805e-ll 5-23 
PR00081C 15.13 1.771e-09 127- 


1 1685 


BL00983 


Ly-o / u-.rAiv aomam proiems. 


RT 00983C 12 69 6 885e-09 197- 
213 


M686 


BL00107 


Protein kinases ATP-binding region 
proteins. 


BL00107A 18.39 7.158e-18 678- 
709 


1686 


PR00653 


ACTIVIN TYPE II RECEPTOR 
SIGNATURE 


PR00653E 15.68 5.114e-16 871- 
891 PR00653D 13.25 7.195e-12 
697-719 


1686 


BL00790 


Receptor tyrosine kinase class Y 


BL00790N 13.25 7.505e-09 691- 
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proteins. 


718 


1686 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12.27 8.031e-09 678- 
697 


1687 


BL00649 


G-protein coupled receptors family 2 
proteins. 


BL00649A 24.05 2.333e-22 85-113 


1687 


PR00491 


VASOACTIVE INTESTINAL 
PEPTIDE RECEPTOR SIGNATURE 


PR00491A 11.52 9.509e-10 109- 
121 


1688 


BL50002 


Src homology 3 (SH3) domain proteins 
profile. 


BL50002A 14.19 2.500e-18 154- 
173 


1688 


PR00401 


SH2 DOMAIN SIGNATURE 


PR00401A 14.00 5.765e-15 214- 
229 


1688 


BL50001 


Src homology 2 (SH2) domain proteins 
profile. 


BL50001A 10.81 6.250e-ll 154- 
167 


1688 


PR00452 


SH3 DOMAIN SIGNATURE 


PR00452B 11.65 5.765e-14 164- 
180 PR00452A 10.48 8.500e-12 
150-161 PR00452C 10.78 l.OOOe- 
10 181-191 


1688 


PR00499 


NEUTROPHIL CYTOSOL FACTOR 2 
SIGNATURE 


PR00499D 10.18 7.279e-09 152- 
173 


1693 


DM01551 


kw OSTEOINDUCTIVE YOPM 
MEMBRANE OUTER. 


DM01551A 15.63 7.265e-09 559- 
589 


1693 


PR00019 

• 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 2.957e-10 614- 
628 PR00019B 11.36 4.600e-10 
527-541 PR00019B 11.36 1.720e- 
09 611-625 PR00019A 11.19 
3.000e-09 660-674 PR00019A 
ll.l9 7.667e-09 530-544 


1698 


PR00528 


GLUCOCORTICOID RECEPTOR 
SIGNATURE 


PR00528C 14.75 9.438e-09 210- 
231 


1705 


BL00733 


Ribosomal protein S26e proteins. 


BL00733B 12.04 4.661e-25 165- 
198 


1706 


PD00078 


REPEAT PROTEIN ANK NUCLEAR 
ANKYR. 


PD00078B 13.14 6.500e-ll 412- 
425 


1706 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019B 11.36 7.000e-ll 773- 
787 PR00019A 11.19 7.000e-09 
776-790 


1706 


PF00791 


Domain present in ZO-1 and Unc5-like 
netrin receptors. 


PF00791B 28.49 6.242e-13 386- 
441 PF00791C 20.98 7.088e-09 
436-475 PF00791B 28.49 7.936e- 

j* am mm* mm -4 /~\ 

09 455-510 


1706 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 1.000e-ll 419- 
435 PF00023B 14.20 9.591e-09 
415-425 


1708 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 3.000e-13 251-264 
PD00066 13.92 3.571e-12 223-236 


1708 


PR00498 


NEUTROPHIL CYTOSOL FACTOR 1 
SIGNATURE 


PR00498A 12.01 5.765e-09 353- 
362 


1708 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048B 6.02 1.563e-10 220-230 
PK.U0U4oA 1U.DZ z.jODe-iU zoz- 
246 PR00048A 10.52 5.320e-09 
50-64 PR00048A 10.52 8.200e-09 
204-218 


1708 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 5.154e-ll 235-252 
BL00028 16.07 3.400e-10 207-224 
BL00028 16.07 9.743e-09 53-70 


1711 


PF00550 


Phosphopantetheine attachment site 


PF00550C 13.05 7.188e-10 235- 
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proteins. 


252 


1712 


BL00484 


Thyroglobulin type-1 repeat proteins 
proteins. 


BL00484B 9.04 4.400e-12 289-303 


1712 


BL00222 


Insulin-like growth factor binding 
proteins. 


BL00222B 11.09 7.300e-17 65-81 
BL00222A 11.34 9.438e-ll 45-56 


1715 


PF00075 


RNase H. 


PF00075C 11.58 6.786e-13 60-72 


1717 


BL00477 


Alpha-2-macroglobulin family 
thiolester region proteins. 


BL00477G 19.43 3.739e-34 984- 
1016 BL00477A 13.50 6.000e-32 
174-203 BL00477J 19.04 9.625e- 
31 1301-1332 BL00477F 17.34 
1.931e-30 840-870 BL00477L 
23.51 1.964e-30 1498-1531 
BL00477K 17.42 6.400e-24 1443- 
1467 BL00477E 17.53 7.000e-22 
810-831 BL004771 18.76 6.750e- 
20 1088-1115 BL00477C 15.70 
6.667e-17 291-308 BL00477B9.05 
7.207e-15 264-277 BL00477H 9.07 
1.333e-14 1057-1069 BL00477D 
12.73 2.059e-14 784-794 


1717 


BL01177 


Anaphylatoxin domain proteins. 


BL01 177E 20.64 9.438e-12 845- 
872 


1718 


BL00232 


Cadherins extracellular repeat proteins 
domain proteins. 


BL00232B 32.79 8.594e-35 151- 
199 BL00232B 32.79 5.579e-22 
417-465 BL00232A 27.72 l.OOOe- 
20 57-90 BL00232C 10.65 3.613e- 
14 415-433 BL00232B 32.79 
4.872e-ll 534-582 


1718 


PR00205 


CADHERIN SIGNATURE 


PR00205B 11.39 4.545e-15 415- 
433 PR00205A 14.73 5.600e-09 
183-199 


1718 


BL00422 


Granins proteins. 


BL00422C 16.18 3.647e-09 226- 
254 BL00422C 16. 18 9.294e-09 
220-248 


1718 


PF01140 


Matrix protein (MA), pi 5. 


PF01140D 15.^4 2.256e-09 221- 
256 PF01140D 15.54 3.198e-09 
217-252 PF01140D 15.54 9.79 le- 
09 220-255 


1722 


BL00086 


Cytochrome P450 cysteine heme-iron 
ligand proteins. 


BL00086 20.87 8.714e-20 469-501 


1722 


PR00463 


E-CLASS P450 GROUP I 
SIGNATURE 


PR00463H 12.41 5.821e-10 469- 
480 


1722 


PR00408 


MITOCHONDRIAL P450 
SIGNATURE 


PR00408F 11.33 6.000e-10 388- 
407 


1722 


PR00464 


E-CLASS P450 GROUP II 
SIGNATURE 


PR004641 14.64 1.771e-13 479-503 
PR00464E 18.28 9.617e-ll 389- 
410 PR00464C 18.84 1.000e-10 
330-359 PR00464H13.32 3.596e- 
09 46o-4oU 


1722 


PR00465 


E-CLASS P450 GROUP IV 
SIGNATURE 


PR00465C 21.23 5.125e-25 332- 
359 PR00465H 17.76 5.500e-22 
479-498 PR00465G 13.06 1.281e- 
17 463-480 PR00465D 14.64 
2.607e-17 390-407 PR00465F 
13.37 6.000e-15 439-458 
PR00465E 15.03 9.217e-15 423- 
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438 PR00465B 16.84 4.908e-09 
113-137 


1722 


PR00385 


P450 SUPERFAMILY SIGNATURE 


PR00385D 13.11 1.667e-09 470- 
480 PR00385C 16.94 4.682e-09 
395-407 PR00385A 14.97 5.865e- 
09 341-359 PR00385E 12.66 
7.000e-09 479-491 


1725 


DM00179 


w KINASE ALPHA ADHESION T- 
CELL. 


DM00179 13.97 7.158e-10 107-117 


1727 


PR00220 


SYNAPTOPHYSIN/SYNAPTOPORIN 
FAMILY SIGNATURE 


PR00220B 15.48 2.800e~31 47-73 
PR00220C 11.05 9.000e-31 100- 
125 PR00220D 8.32 3.842e-30 
132-156 PR00220A 10.93 1.857e- 
28 23-46 PR00220E 3.46 4.273e- 
24 199-218 


1727 


BL00604 


Synaptophysin / synaptoporin proteins. 


BL00604A 9.13 1.000e-40 26-81 
BL00604C 14.66 1.000e-40 117- 
149 BL00604D 12.28 1.000e-40 
149-184 BL00604E8.32 1.000e-40 
184-226 BL00604F 5.96 1.000e-40 
231-276 BL00604B 9.95 2.895e-37 
87-117 BL00604F5.96 1.122e-09 
263-308 


1727 


PR00761 


BINDIN PRECURSOR SIGNATURE 


PR00761A 5.81 3.662e-09 287-304 


1727 


BL00291 


Prion protein. 


BL00291A 4.49 8.138e-09 236-271 


1728 


BL00795 


Involucrin proteins. 


BL00795C 17.06 6.700e-09 159- 
204 


1729 


PR00747 


GLYCOSYL HYDROLASE FAMILY 
47 SIGNATURE 


PR00747E 15.13 8.269e-18 225- 
243 PR00747H 12.76 5.610e-17 
326-347 PR00747C 12.06 4.797e- 
11 145-164 


1730 


PR00209 


ALPHA/BETA GLIADIN FAMILY 
SIGNATURE 


PR00209C 4.56 3.893e-09 77-91 


1731 


PR00244 


NEUROKININ RECEPTOR 
SIGNATURE 


PR00244H 13.12 9.357e-26 294- 
313 PR00244G 11.10 2.286e-15 
268-280 PR00244F 10.47 3.132e- 
15 204-218 PR00244A 10.34 
3.368e-15 29-41 PR00244B 12.45 
1.000e-13 55-65 PR00244D 6.84 
L964e-13 125-136 PR00244C 
13.44 3.118e-12 108-119 
PR00244E 5.93 8.875e-12 183-193 


1731 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237G 19.63 9.053c-19 275- 
302 PR00237F 13.57 5.800e-18 
234-259 PR00237B 13.50 9.400e- 
17 68-90 PR00237A 11.48 5.765e- 
14 35-60 PR00237D 8.94 2.350e- 
11 133-155 PR00237E 13.03 
L600e-10 184-208 


1731 


BL00237 


G-protein coupled receptors proteins. 


BL00237C 13.19 2.688e-19 229- 
256 BL00237D 11.23 4.429e-13 
285-302 BL00237B 5.28 5.909e-09 
192-204 


1732 


PD02870 


RECEPTOR INTERLEUKIN- 1 
PRECURSOR. 


PD02870D 15.74 8.755e-09 223- 
258 


1733 


BL00523 


Sulfatases proteins. 


BL00523A 13.36 8.800e-13 44-61 
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1734 


PR00464 


E-CLASS P450 GROUP II 
SIGNATURE 


PR00464A 20.47 9.719e-13 167- 
188 


1735 


BL00414 


Profilin proteins. 


BL00414D 15.59 5.909e-10 1037- 
1064 


1735 


PR00392 


PROFILIN SIGNATURE 


PR00392C 18.98 5.592e-10 986- 
1007 PR00392D 12.00 7.652e-09 
1008-1023 


1738 


PR00003 


4-DISULPHIDE CORE SIGNATURE 


PR00003C 7.69 9.357e-09 73-83 


1741 


PR00915 


LUTEOVIRUS GROUP 1 COAT 
PROTEIN SIGNATURE 


PR00915G 15.24 6.875e-10 303- 
325 


1745 


PR00837 


ALLERGEN V5/TPX-1 FAMILY 
SIGNATURE 


PR00837C 17.21 6.143e-20 126- 
143 PR00837A 14.77 1.973e-13 
57-76 


1745 


BL01009 


Extracellular proteins SCP/Tpx- 
1/Ag5/PR-1/Sc7 proteins. 


BL01009D 14.19 4.300e-20 127- 
148 BL01009A 13.75 6.586e-13 
57-75 


1745 


PR00838 


VENOM ALLERGEN 5 SIGNATURE 


PR00838G 16.07 2.033e-17 125- 
145 PR00838D 8.73 4.214e-09 57- 
76 


1748 


PR00920 


SPUMAVIRUS ASPARTIC 
PROTEASE (A9) SIGNATURE 


PR00920C 13.24 7.310e-09 212- 
234 


1749 


PR00672 


INHIBIN BETA C CHAIN 
SIGNATURE 


PR00672D 10.52 6.400e-09 96-113 


1749 


PD01572 


PHOTOSYSTEM II REACTION 
CENTRE T PROTEIN PHOTOS. 


PD01572 8.77 6.917e-09 544-574 


1751 


BL00713 


Sodium: dicarboxylate symporter family 
proteins. 


BL00713A 20.35 2.227e-35 314- 
356 BL00713B 13.04 8.000e-22 
375-395 


1751 


PR00173 


GLUTAMATE-ASPARTATE 
SYMPORTER SIGNATURE 


PR00173C 13.89 7.353e-25 327- 
353 PR00173B 9.62 9.053e-13 19- 
40 


1752 


PR00219 


SYNAPTOBREVIN SIGNATURE 


PR00219C 9.04 7.750e-09 1-21 


1753 


PR00379 


INTEIN SIGNATURE 


PR00379A 16.17 9.780e-09 393- 
407 


1757 


PF00783 


Inositol polyphosphate phosphatase, 
catalytic domain proteins homologu. 


PF00783A 11.91 3.400e-14 539- 
549 


1758 


PR00169 


POTASSIUM CHANNEL 
SIGNATURE 


PR00169B 14.43 2.800e-30 239- 
268 PR00169C 16.31 3.109e-ll 
307-331 


1762 


BL00456 


Sodiumrsolute symporter family 
proteins. 


BL00456A 22.59 1.000e-40 16-71 
BL00456C 24,55 1.000e-40 154- 
209 BL00456B 18.94 7.158e-28 
92-122 BL00456D 6.92 2.862e-09 
450-460 


1763 


DM01253 


BNR MOTIF REPEAT. 


DM01253A 11.72 7.785e-09 454- 
470 


1763 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 2.200e-12 86-100 
PR00019B 11.36 6.400e-10 83-97 
PR00019A 11,19 2.333e-09 278- 
292 PR00019B 11.36 4.960e-09 
275-289 PR00019A 11.19 9.333e- 
09 62-76 


1764 


PD01719 


PRECURSOR GLYCOPROTEIN 
SIGNAL RE. 


PD01719A 12.89 5.714e-14 421- 
449 


1765 


PR00824 


HEPATIC LIPASE SIGNATURE 


PR00824A 7.81 7.214e-22 6-25 


1768 


BL00979 


G-protein coupled receptors family 3 


BL00979A 19.66 L000e-40 77-125 
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proteins. 




1769 


PF00168 


C2 domain proteins. 


PF00168C 27.49 4.000e-12 410- 
436 


1769 


PR00360 


C2 DOMAIN SIGNATURE 


PR00360B 13.61 5.345e-12 292- 
306 PR00360B 13.61 1.000e-ll 
427-441 PR00360A 14.59 2.895e- 
09 398-41 1 


1769 


PR00399 


SYNAPTOTAGMIN SIGNATURE 


PR00399B 14.27 8.425e-12 385- 
399 PR00399D 14.48 7.796e-09 
328-339 


1771 


BL00312 


Glycophorin A proteins. 


BL00312B 9.22 5.781e-10 717-746 


1771 


BL00446 


RNA polymerases D / 30 to 40 Kd 
subunits proteins. 


BL00446C 18.90 9.087e-09 1326- 
1353 


1771 


PR00122 


VACUOLAR ATP SYNTHASE 16 KD 
SUBUNIT SIGNATURE 


PR00122D 9.97 9.330e-09 716-740 


1771 


PR00205 


CADHERIN SIGNATURE 


PR00205B 11.39 5.745e-10 225- 
243 PR00205B 11,39 4.966e-09 
333-351 PR00205B 11.39 9.390e- 
09 441-459 


1771 


BL00232 


Cadherins extracellular repeat proteins 
domain proteins. 


BL00232B 32.79 9.308e-18 227- 
275 BL00232B 32.79 9.206e-17 
443-491 BL00232B 32.79 3.407e- 
11 118-166 BL00232C 10.65 
4. 115e-10 225-243 BL00232C 
10.65 4.462e-10 546-564 
BL00232C 10.65 7.404e-10 333- 
351 BL00232C 10.65 9.842e-09 
441-459 


1773 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354Z 9.06 1.000e-16 102-151 


1774 


PR00259 


TRANSMEMBRANE FOUR FAMILY 
SIGNATURE 


PR00259C 16.40 5.313e-17 513- 
542 PR00259B 14.81 4.000e-14 
486-513 PR00259D 13.50 2.340e- 
13 676-703 


1774 


BL00421 


Transmembrane 4 family proteins. 


BL00421B 17.62 1.000e-27 492- 
531 BL00421E 20.97 6.21 le-13 
673-703 


1775 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245C 7.84 8.258e-17 319-335 
PR00245A 18.03 9.265 e- 16 52-74 
PR00245B 10.38 6.143e-14 258- 
273 


1775 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 5.974e-12 83-123 
BL00237A 27.68 5.974e-12 171- 
211 


1775 


PR00534 


MELANOCORTIN RECEPTOR 
FAMILY SIGNATURE 


PR00534A 11.49 6. 1 23 e- 10 44-57 


1775 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237E 13.03 9.333e-ll 280- 
304 PR00237A 11.48 5.935e-09 

in a A 

19-44 


1778 


BL00417 


Synaptobrevin proteins. 


BL00417B 18.48 2.414e-19 141- 
195 BL00417A7.74 6.704e-13 
113-141 


1778 


PR00219 


SYNAPTOBREVIN SIGNATURE 


PR00219A 8.98 9.156e-13 118-138 


1780 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290B 13.17 2.385e-15 255- 
273 BL00290A 20.89 1.529e-14 
34-57 BL00290A 20.89 6.684e-13 
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198-221 


1782 


BL00211 


ABC transporters family proteins. 


BL00211B 13.37 2.385e-17 558- 
590 


1782 


BL00674 


AAA-protein family proteins. 


BL00674B 4.46 5.622e-09 448-470 


1782 


PR00671 


INHIBIN BETA B CHAIN 
SIGNATURE 


PR00671B 4.29 9.408e-09 396-416 


1782 


PR00300 


ATP-DEPENDENT CLP PROTEASE 
ATP-BINDING SUBUNIT 
SIGNATURE 


PR00300A 9.56 9.649e-09 451-470 


1783 


PR00176 


SODIUM/NEUROTRANSMITTER 
SYMPORTER SIGNATURE 


PR00176C 10.84 8.615e-25 283- 
310 PR00176A 16.82 5.909e-24 
211-233 PR00176B7.31 4.600e-23 
240-260 PR00176D 9.02 6.143e-21 
412-430 PR00176F 10.73 L333e- 
20 548-568 PR00176E 11.41 
2.227e-20 494-515 PR00176G 
12.48 5,034e-17 630-651 
PR00176H 15.27 7.429e-17 670- 
691 


1783 


BL00610 


Sodium: neurotransmitter symporter 
family proteins. 


BL00610A 17.73 1.000e-40 211- 
261 BL00610B 23.65 1.000e-40 
274-324 BL00610D 20.97 l.OOOe- 
40 451-504 BL00610F 29.02 
1.000e-40 641-696 BL00610E 
20.34 9.100e-37 544-587 
BL00610G 12.89 6.087e-22 700- 
723 BL00610C 12.94 4.282e-15 
385-437 


1784 


BL00243 


Integrins beta chain cysteine-rich 
domain proteins. 


BL00243I 31.77 7.592e-09 799-842 


1785 


BL00349 


CTF/NF-I proteins. 


BL00349A 10.07 1.000e-40 649- 
695 BL00349C9.33 1.000e-40 
723-766 BL00349E 10.79 9.833e- 
36 793-836 BL00349B 10.51 

I. 205e-34 695-723 BL00349D 

II. 70 1.000e-33 766-793 


1786 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 9.100e-ll 243- 
257 PR00019B 11.36 9.100e-10 
431_445 PR00019A 11.19 2.000e- 
09 434-448 PR00019A 11.19 
4.667e-09 482-496 PR00019B 
11.36 6.040e-09 643-657 
PR00019B 11.36 6.400e-09 597- 
611 PR00019B 11.36 9.640e-09 
479-493 


1789 


PR00702 


ACRIFLAVIN RESISTANCE 
PROTEIN FAMILY SIGNATURE 


PR00702G 9.63 4.214e-22 74-98 
PR00702F 12.14 1.474e- 19 49-73 
PR00702H 12.94 7.955e-17 153- 
171 PR007021 17. o5 7.o5/e-lo 
216-231 


1790 


DM00914 


BINDING-PROTEIN-DEPENDENT 
TRANSPORT SYSTEMS INNER 
MEMBRANE. 


DM00914C 21.08 8.457e-35 121- 
168 DM00914B 22,92 5.750e-30 
45-86 


1790 


PR00797 


STREPTOPAIN (C10) CYSTEINE 
PROTEASE FAMILY SIGNATURE 


PR00797F 12.40 9.804e-09 41-63 


1791 


BL00942 


glpT family of transporters proteins. 


BL00942E 21.14 9.700e-40 104- 
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145 BL00942D 18.76 1.273e~31 
55-92 BL00942C 14.04 2.688e-21 
13-33 BL00942F 15.07 7.632e-21 
162-180 


1792 


BL00523 


Sulfatases proteins. 


BL00523C 12.64 5.800e-12 593- 
604 


1796 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 7.261e-10 315- 
329 PR00019B 11.36 4.600e-09 
312-326 


1798 


PR00303 


PREPROTEIN TRANSLOCASE 
SECY SUBUNIT SIGNATURE 


PR00303E 14.69 1.429e-22 98-122 
PR00303D 16.15 8.650e-22 72-98 
PR00303C 15.93 9.036e-19 34-58 


1798 


BL00755 


Protein secY proteins. 


BL00755C 14.21 2.500e-18 89-105 


1798 


PR00825 


VESPID VENOM ALLERGEN 
PHOSPHOLIPASE Al SIGNATURE 


PR00825B 14.81 9.738e-09 234- 
255 


1800 


PR00081 


GLUCOSE/RIBITOL 
DEHYDROGENASE FAMILY 
SIGNATURE 


PR00081A 10.53 1.439e-ll 179- 
197 


1800 


PR00669 


INHIBIN ALPHA CHAIN 
SIGNATURE 


PR00669B 8.27 2.976e-09 80-97 


1801 


BL01022 


PTR2 family proton/oligopeptide 
symporters proteins. 


BL01022E 23.51 1.173e-12 474- 
510 BL01022D9.42 3.455e-ll 12- 
25 


1802 


BL01187 


Calcium-binding EGF-like domain 
proteins pattern proteins. 


BL01187B 12.04 2.350e-14 489- 
505 


1802 


PR00792 


PEPSIN (Al) ASPARTIC PROTEASE 
FAMILY SIGNATURE 


PR00792B 12.78 7.750e-14 331- 
345 PR00792C9.10 1.000e-12 
380-392 


1802 


BL00141 


Eukaryotic and viral aspartyl proteases 
proteins. 


BL00141C 9.74 1.000e-ll 336-346 
BL00141D 6.28 3.700e-ll 380-390 


1802 


BL01177 


Anaphylatoxin domain proteins. 


BL01177C 17.39 8.780e-10 483- 
502 


1804 


PR00165 


ANION EXCHANGER SIGNATURE 


PR00165F 10.39 9.667e-12 32-51 


1804 


BL00219 


Anion exchangers family proteins. 


BL00219L 18.71 1.000e-40 126- 
165 BL00219N 10.66 6.164e-31 
194-238 BL00219M9.98 3.100e- 
17 148-194 BL00219P 19.59 
6.123e-13 247-274 BL00219H 
10.06 4.555e-ll 11-59 BL00219K 
12.73 8.780e-10 84-126 


1805 


BL00972 


Ubiquitin carboxyl-terminal hydrolases 
family 2 proteins. 


BL00972A 11.93 3.118e-16 786- 
804 BL00972D 22.55 1.500e-14 
1281-1306 BL00972E 20.72 
6.850e-13 1308-1330 BL00972B 
9.45 7.923e-10 882-892 


1810 


BL00216 


Sugar transport proteins. 


BL00216B 27.64 8.531e-13 736- 
786 


1810 


DM00973 


3 kw RESISTANCE BENOMYL 
YLL028W CYCLOHEXIMIDE. 


DM00973A 21.17 1.391e-l 1 708- 
745 


1813 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDL 


PD00066 13.92 9.308e-15 288-301 


1813 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048B 6.02 7.188e-10 285-295 
PR00048A 10.52 8.043e-10 269- 
283 PR00048A 10.52 3.880e-09 
297-3 1 1 


1813 


BL00028 


Zinc finger, C2H2 type, domain 


BL00028 16.07 1.692e-ll 515-532 
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proteins. 


BL00028 16.07 4.000e-10 487-504 
BL00028 16.07 1.257e-09 773-790 
BL00028 16.07 5.371e-09 272-289 


1813 


PF01059 


NADH-ubiquinone oxidoreductase 
chain 4, amino terminus. 


PF01059A 13.84 6.192e-09 19-53 


1815 


BL00134 


Serine proteases, trypsin family, 
histidine proteins. 


BL00134B 15.99 8.560e-17 316- 
340 BL00134A 11.96 4.214e-16 
53-70 


1815 


BL00495 


Apple domain proteins. 


BL00495M 8.50 5.071e-16 226-261 
BL00495N 11.04 4.438e-15 308- 
343 


1815 


PR00722 


CHYMOTRYPSIN SERINE 
PROTEASE FAMILY (SI) 
SIGNATURE 


PR00722A 12.27 4.414e-14 54-70 
PR00722C 10.87 9.47 le- 13 315- 
328 


1815 


BL00021 


Kringle domain proteins. 


BL00021B 13.33 6.580e-16 53-71 
BL00021C 22.21 2.125e-10 235- 
257 


1815 


BL01253 


Type I fibronectm domain proteins. 


BL01253G 11.34 4.316e-12 315- 
329 BL01253E 16.01 9.899e-ll 
224-261 BL01253D4.84 5.920e-09 
53-67 


1815 


PR00839 


V8 SERINE PROTEASE FAMILY 
SIGNATURE 


PR00839B 11.20 6.425e-09 53-71 


1816 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354H 18.00 5.618e-12 134- 
174 


1817 


BL00859 


GTP cyclohydrolase I proteins. 


BL00859B 13.15 9.057e-09 72-113 


1819 


PR00700 


PROTEIN TYROSINE 
PHOSPHATASE SIGNATURE 


PR00700D 12.47 2.421e-20 1431- 
1450 PR00700C 13.17 5.765e-17 
1390-1408 PR00700F 11.18 
7.429e-12 1478-1489 


1819 


PR00213 


MYELIN P0 PROTEIN SIGNATURE 


PR00213E 5.51 1.656e-09 1186- 
1211 


1819 


BL00383 


Tyrosine specific protein phosphatases 
proteins. 


BL00383E 10.35 1.000e-14 1434- 
1445 BL00383D 11.92 1.692e-14 
1394-1407 BL00383A 13.34 
6.850e-14 1280-1295 BL00383F 
15.51 4.240e-13 1472-1488 
BL00383C 10.10 6.625e-10 1318- 
1329 BL00383B 7.61 4.000e-09 
1305-1314 


1819 


PR00014 


FIBRONECTIN TYPE III REPEAT 
SIGNATURE 


PR00014C 15.44 1.450e-10 688- 
707 PR00014C 15.44 6.400e-10 
866-885 PR00014C 15.44 2.565e- 
09 510-529 PR00014C 15.44 
5.69oe-09 1056-1075 


1821 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245C 7.84 4.073e-15 361-377 
PR00245E 12.40 8.286e-12 414- 
429 


1821 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237G 19.63 5.814e-09 395- 
422 


1822 


PF00242 


DNA polymerase (viral) N-terminal 
domain proteins. 


PF00242F 12.18 8.522e-09 322-344 



* Results include in order: accession number subtype; raw score; p-value; position of signature in 
amino acid sequence 
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912 


IPB001398A 


Macrophage migration inhibitory factor 
family 


23.60 6.73e-29 124-163 


912 


IPB001398B 


Macrophage migration inhibitory factor 
family 


19.16 2.06e-20 164-190 


912 


IPB001398C 


Macrophage migration inhibitory factor 
family 


26.46 6.96e-ll 192-238 


913 


IPB000033A 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


21.82 4.35e-09 514-536 


913 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 4.48e-l 0 817-827 


913 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 7.10e-10 574-584 


913 


IPB000118B 


Granulin 


7.94 9.20e-09 472-510 


913 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 1.96e- 11 484-499 


913 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 3.84e-14 696-711 


913 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 3.86e-13 569-584 


913 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 6.54e-17 812-827 


913 


IPB000561 


EGF-hke domain 


4.89 3.57e-09 821-829 


913 


IPB001862A 


Membrane attack complex 
components/perfonn/complement C9 


12.54 8.02e-10 175-190 


913 


IPB001881B 


Calcium-bmdmg EGF-hke domain 


12.28 4.79e-ll 696-707 


913 


IPB001881B 


Calcium-binding EGF-hke domain 


12.28 5.20e-12 812-823 


913 


IPB001881B 


Calcium-binding EGF-hke domain 


12.28 8.58e-ll 527-538 


913 


IPB001881B 


Calcium-binding EGF-hke domain 


12.2S 8.71e-10 569-580 


913 


IPB001881B 


Calcium-bmdmg EGF-hke domain 


12.28 9.53e-ll 484-495 


913 


IPB002861B 


Reeler domain 


10.50 4.48e-09 124-152 


913 


IPB002861B 


Reeler domain 


10.50 8.98e-09 181-209 


913 


IPB002861C 


Reeler domain 


23.17 6.48e-09 111-165 


913 


IPB003367A 


Thrombospondm type 3 repeat 


11.78 5.13e-09 782-802 


913 


IPB003367A 


Thrombospondm type 3 repeat 


11.78 5.83e-10 577-597 


913 


IPB003S86D 


Extracellular domain in mdogen 


13.91 3.49e-09 569-588 


913 


IPB003886D 


n . ill j • ■ . j 

Extracellular domain in mdogen 


13.91 4.78e-13 696-715 


913 1 


IPB003886D 


T—l j "tit 1 • • • 1 

Extracellular domain m mdogen 


13.91 9.41e-10 812-831 


913 


PR00010C 


Type II EGF-like signature III 


6.98 3.63e-09 574-584 


913 


PR00010C 


Type II EGF-hke signature III 


6.98 8.02e-09 701-711 


913 


PR00010C 


Type II EGF-like signature III 


6.9S 9.47e-13 817-827 


913 


PR00907B 


Thrombomodulin signature II 


11.50 9.43e-09 808-824 


913 


PR00907G 


Thrombomodulin signature VII 


10.43 8.09e-10 812-838 


913 


PR00907G 


Thrombomodulin signature VII 


10.43 8. 85e- 10 696-722 


913 


PR00907G 


Thrombomodulin signature VII 


10.43 9.27e-09 569-595 


913 


PR01303D 


Plasmodium circumsporozoite protein 
signature IV 


10.57 2.00e-09 187-204 


914 


IPB001398A 


Macrophage migration inhibitory factor 
family 


23.60 9.76e-26 2-41 


914 


IPB001398B 


Macrophage migration inhibitory factor 
family 


19.16 6.31e-18 42-68 


914 


IPB001398C 


Macrophage migration inhibitory factor 
family 


26.46 5. 14e-12 70-116 


915 


IPB001398A 


Macrophage migration inhibitory factor 
family 


23.60 3.25e-20 2-41 
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915 


IPB001398B 


Macrophage migration inhibitory factor 
family 


19.16 2.06e-20 55-81 


915 


IPB001398C 


Macrophage migration inhibitory factor 
family 


26.46 7.39e-36 83-129 


916 


IPB000001B 


Kringle 


18.04 8. 09e-19 286-310 


916 


IPB000001C 


Kringle 


20.41 2.23e-22 331-351 


916 


IPB000001D 


Kringle 


11.31 3. 61e-17 469-485 


916 


IPB000001E 


Kringle 


16.02 8.58e-13 544-557 


916 


IPB000001F 


Kringle 


10.14 7.65e-ll 567-577 


916 


IPB000001G 


Kringle 


29.29 2.80e-16 588-616 


916 


IPB000001H 


Kringle 


12.24 3.45e-13 628-638 


916 


IPB000177M 


Apple domain 


9.18 4.82e-10 545-579 


916 


IPB000177N 


Apple domain 


10.17 3.25e-20 618-652 


916 


IPB000177O 


Apple domain 


14.39 6.13e-17 653-681 


916 


IPB000561 


EGF-like domain 


4.89 4.21e-09 267-275 


916 


IPB000562 


Type II fibronectin collagen-binding 
domain 


10.39 9.05e-20 132-148 


916 


IPB001254A 


Serine proteases, trypsin family 


9.98 6.09e-14 469-485 


916 


IPB001254B 


Serine proteases, trypsin family 


15.01 3. 50e-17 626-649 


916 


IPB001254C 


Serine proteases, trypsin family 


16.54 1.00e-18 656-675 


916 


IPB001774D 


Delta serrate ligand 


19.23 5.82e-09 152-198 


916 


IPB001881A 


Calcium-binding EGF-like domain 


8.72 1.00e-08 245-254 


916 


IPB001881A 


Calcium-binding EGF-like domain 


8.72 8.20e-09 164-173 


916 


PR00010A 


Type II EGF-like signature I 


12.91 6.54e-10 160-171 


916 


PR00011B 


Type III EGF-like signature II 


-d -n- a, j-— », ^ > yK y\ y— \ 4 *»^r y^\ ^ (• f 

13.08 6.42e-09 179-197 


916 


PR00011B 


Tvpe III EGF-like signature II 


M y^ y^ y-v y^ y~i \ y^\ ^* y^V 

13.08 9.38e-09 260-278 


916 


PR00013A 


Fibronectin type II repeat signature I 


12.70 4.60e-10 105-114 


916 


PR00013B 


Fibronectin type II repeat signature II 


11.78 5.25e-10 116-128 


916 


PR00013C 


Fibronectin type II repeat signature III 


12.27 9.47e-19 132-147 


916 


PR00018A 


Kringle domain signature I 


12.23 8.84e-12 286-301 


916 


PR00018B 


Kringle domain signature II 


16.62 1.75e-09 302-314 


916 


PR00018C 


Kringle domain signature III 


16.77 7.00e-21 331-351 


916 


PR00018D 


Kringle domain signature IV 


12.83 9.04e-10 356-367 


916 


PR00722A 


Chymotrypsin serine protease family (SI) 
signature I 


12.06 9.36e-14 470-485 


916 


PR00722C 


Chymotrypsin serine protease family (SI) 
signature III 


10.74 8.62e-14 625-637 


916 


PR01236A 


Tumour necrosis factor beta (lymphotoxin- 
alpha) signature I 


4.92 8.22e-09 22-38 


917 


IPB000034A 


Laminin B 


y^. m a h y^ -r^ /— v -| y^i .4 -4 4 4 

22.21 3.69e-10 419-454 


917 


IPB000034A 


Laminin B 


22.21 7.22e-13 1046-1081 


917 


IPB000034A 


Laminin B 


22.21 9.42e-10 291-326 


917 


IPB000034B 


Laminin B 


21.22 8,38e-ll 808-846 


917 


IPB000034B 


Laminin B 


2 1 .22 9.07e- 1 9 443-48 1 


917 


IPB000034C 


Laminin B 


r-. y^ yj y^ ^ y—i j| y^ ftm f\ 

VZ.91 1.12e-10 542-560 


917 


IPB000034C 


Laminin B 


12.97 2.29e-09 805-823 


917 


IPB000034C 


Laminin B 


12.97 3. 10e-10 492-510 


917 


IPB000034C 


Laminin B 


12.97 5.09e-09 897-915 


917 


IPB000034C 


Laminin B 


12.97 5.29e-12 1067-1085 


917 


IPB000034C 


Laminin B 


12.97 8. 17e-14 955-973 


917 


IPB000034C 


Laminin B 


12.97 8.43e-ll 853-871 


917 


IPB000034C 


Laminin B 


12.97 8.62e-12 1114-1132 


917 


IPB001134C 


Netrin, C-terminus 


17.82 9.77e-09 1014-1028 


917 


IPB001774C 


Delta serrate ligand 


18.25 9.27e-09 1105-1147 
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1 WW _r^ ____ *_i *_i 1 4mtm THb 

Kesults* 


917 1 


IPBOOI886A 


; : — \ . 

Laminin N-terminal (Domain VI) 


53 ^ 1 Q AAe* 1 C\ 1 1 Q 1 Ofk 


917 


IPB001886B 


Laminin N-terminal (Domain VI) 


1 K 1f\ A 531 «_ OA 9£.Q 9Q1 

1 j.zU 4.o LQ-Ah LOy-Ay 1 


_fV -1 ^— t 

917 


IPBOOI886C 


Laminin N-terminal (Domain VI) 


Z4 -j^r z. i^.e- 1 -..j / /-^fio 


917 


tt%tn a A 1 _f_ O V" _f_ 

IPB001886C 


Laminin N-terminal (Domain VI) 


ia ka £ ni 0 11 1 nno 1 has 


917 1 


IPB001886C 


Laminin N-terminal (Domain VI) 




917 


IPBOOI886C 


Laminin N-terminal (Domain VI) 


7/t </1 7 f\Ao 1 1 Q^.^ QQA 

/.U4e-ii yDj-yy^. 


917 


IPBOOI886C 


Lammin N-terminal (Domain VI) 


O/t Q / .lo 17 AAH A7Q 
Z4.34 5.U-_Je-lZ 


917 1 


IPBOOI886C 


Laminin N-terminal (Domain VI) 


7/1 Q 41o 17 11 _ 1^1 
Z4.D4 o.4ie-JZ jlZ-jjl 


917 


IPB001886C 


Laminin N-terminal (Domain VI) 




917 


TTIT^AAI OO^Tt 
IPB001886D 


Laminin N-terminal (Domain VI) 


17 ci 0 IQo 11 SQ1 01 1 

iz.ji o.-jye-i^ 07J-71 1 


917 


TTin AA1 0 O T". 
IPBOOI880D 


Laminin N-terminal (Domain VI) 


10 ci c Q&o no ins 19*. 


917 


IPB001S86D 


Laminin N -terminal (Domain VI) 


T> ci 7 Trip 1 A 1 f.PK 1 091 


917 


IPB001886D 


Laminin N-terminal (Domain VI) 


n ci Q f,At> OO 171 1Q1 

iz.ji y.O'.e-zu 0 /j-j? 1 


917 


TTt T_ f\ f\ 1 OO _f T"' 

IPB001886E 


Laminin N-terminal (Domain VI) 


1/. on 1 1 1 A1C A^A 


917 


IPB001886E 


Laminin N-terminal (Domain VI) 


L\).y\J L.Z\)&- lUDJ"l-Ol 


917 


T"r»T_ /\ _T\ 1 oo^n 

IPB001886E 


Laminin N-terminal (Domain VI) 


If. Qf\ 1 H9p 1 1 AQH-^Hf. 

lu.yu _5.uze-ii 4_7u-_»uo 


917 


TT»T»AA1 00/T7 

IPB001886E 


Laminin N-terminal (Domain VI) 


in on & nn^ 10 s^i s^7 
lu.yu o.uue-iz o_ 1-00/ 


917 


IPBOOI886E 


Laminin N-terminal (Domain VI) 


1 n on £ 1 /i «_ 1 c ^An ^^f. 


917 


IPBG01886E 


Lammin N -terminal (Domain VI) 


mon/^ii.- 11 1110-110S 


917 


IPB001886E 


_____ __ _ _ _____ T _* - _*_____■___. __ - — m 1 ' V _ — _ — _h 1 

Lammin N -terminal (Domain VI) 


inon7Ai»_ 11 nn lOr. 

lU.!?- 1 /.4_!>e-ll jIU-jZD 


917 


IPB001886E 


Laminin N-terminal (Domain VI) 


1 n on o n7p> 1 7 qoi_£1 Q 


917 


TAT4AA1 OO A 

IPB003884C 


Factor I membrane attack complex 


11 nn 1 1^ no A1 1 aoq 


917 


rnnAAi 00 A j — 1 

IPB003884C 


Factor I membrane attack complex 


11 nn 1 57/a no en 


917 


IPB003888L 


i—ir • 1 J * XT j - - 

FY-nch domain N-termmus 


01 1Q Q oo Q in 110 ICC 

zi. iy _._ze-iu 5z>y-Dc>j 


917 


PR00011A 


rp _ TTT T7 y^tTT 1 1*1 ______ _-___L- T 

Type III EGF-hke signature 1 


1 a nc 0 A7o 1 a /iq^ ' 

14. UO Z.UZe- 14 .oD-DU_» l 


917 


PR00011A 


Type III EGF-like signature I 


1 a nc 1 171 n^n 1 n7Q 
14. UD z. /oe-i/ lU0U-lU/<5 


917 


PROOOllA 


— n ttt "f/^iir? i • i _ _ _ ' a t 

Type III EGF-hke signature 1 


1 /I AC 11C... 11 /fll /!C1 

14. Uj d.djQ-il 4J3-4D1 


917 


PR00011A 


rp TTT ~f~* i — 1 T7 1*1 „ ' j „_ T 

Type III EGF-like signature 1 


1 /I /.C A Ala. 11 7QQ C 1 £ 

14. UD 4.4le-l-3 /yo-oio 


917 


PR0001 lA 


r_~"* TTT T_ /"^*T7 1*1^ * _ __. _i__ _____ T 

Type III EGF-like signature 1 


ia nc /I 70*_ 1 t< C1C cci 
14. UD 4. /Ze-lO jdj-jDj 


917 


PR00011A 


*TP ___. TTT T"? /'"'*T 7 "1 1 1 _, _^ * _ _^_ _ . a_ _„ _ T 

Type III EGF-like signature 1 


ia nc 7 aria 1C SA/^ S3».A 
14. UD /.oUe-iD O40-504 


917 


T\T» AAA1 1 A 

PROOOllA 


rr~i TTT T~? /^T"? 121 - — _ n _Z p. ■ . _^4*_ _*«_-*. X 

Type III EGF-like signature 1 


i/i nc 7 cn«_» no 101 


_TV 7 ^7 

917 


PROOOllA 


rpi TTT Tl /~^T7 1 * 1 ^ _ __ ! _____ , _- __ T 

Type III EGF-like signature 1 


1 /i nc c o/io 1^11 n7 1 1 o*\ 

14. UD o.Z4e-lO 1 1 U /- 1 l-SO j 


917 


PR00011B 


T>/pe ill EOF -like signature 11 


11 nc a i7p ia as*, ^ni 

Ij.Uo 4. 1 /e-14 4o3-JU-_> 


917 


T_T_ /"..n /"\ 1 in 

PROOOllB 


Type III EGF-like signature 11 


1 1 nc a 1^11 n7 110^ 

lJt.Uo 4._D3e-lJ 1HJ/-11ZJ 


1 ^7 

917 


PROOOllB 


HP _ TTT T"? /~> T? 1 _ 1 _ -_ __•_--»-__+-■„--_ TT 

Type III EGF-like signature 11 


11 no A 17 S3A£ Q/.A 
l_».Uo 4. _ 0e- 1 / -40-OO4 


917 


PROOOllB 


Type 111 EGF-like signature 11 


1 1 nQ .. A/.1- 1 O 7QC c 1 ^_ 


917 


PROOOllB 


np TTT 1. S~iY7 1 : 1 , „ * ___ _x_ ,-»-_. I T 

Type 111 EGF-like signature 11 


11 nc & o#.^ in ah a^i 
lo.uo o.yoe-iu 4j d-hd 1 


r\ 1 ^7 

917 


PROOOllB 


rp TTT T7/TF7 1 1 . T __ _ ' ,, » , , .. . , TT 

Type 111 EGF-like signature 11 


1 1 no 7 71o 1C CQC CCI 


917 


PROOOllB 


m _ TTT T7/^*T7 U 1 _ _ . . T , , ■ ... j . ■ .._-, TT 

Type III EGF-like signature 11 


11 no 0 nrio ia 1 ni<;n 1 n7C 

io.Uo o.UUe-14 lUOU-lU/o 


7 ^7 

917 


PR00011C 


rr-> TTT T7/^T_* 1-J1--. n.nnntm-a TTT 

Type III EGF-liKe signature ill 


7C 7Q 1 1 c~ HQ A^*. AR1 

zd. /y 1. 1 je-uy h-jj-^oj 


917 


PROOOllC 


T~i _ TTT l~> /""i T*? 1 _ 1 n«i-r_.r.4'iii4A TTT 

lype III EGF-like signature ill 


OC 7Q < 71o 1 n AAn Af.0 
ZD. /y O.Zie-lU 44U-40cS 


917 


PROOOllC 


rp TTT T? Z" 1 T7 1 * 1 _* -___,_.+. TTT 

Type III EGF-like signature ill 


OC 7Q O 97o 10 C17 CA^ 

zd. /y y.sze- iz _ 1 /-_4j 


917 


PR00011D 


Type III EGF-like signature IV 


12.12 1.00e-16 1107-1125 


917 


PR00011D 


m ttt T~* 1 _! 1 . _ " _,_- —_*., -.--._____ T~\ 7" 

Type III EGF-like signature IV 


lz.lz z.oze-14 o4o-oo4 


917 


PR00011D 


Type III EGF-like signature IV 


12.12 3.00e-13 535-553 


917 


TYn AAA1 1 T^X 

PR0001 ID 


T^e III EGF-like signature IV 


1010c f\r\t* 11 AC 4 ; ^ni 

1Z.1Z D.UUe-l_> 4oD-DU_» 


917 


PR00011D 


Type III EGF-like signature IV 


12.12 5.50e-14 1060-1078 


917 


PR00011D 


Type III EGF-like signature IV 


12.12 6.23e- 11 433-451 


917 


PROOOllD 


Type III EGF-like signature IV 


12.12 7.98e-12 798-816 


918 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 1.00e-09 304-314 


918 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 1.00e-10 132-142 
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918 


IPB000033B 


Low-density lipoprotein (Idl) receptor, 
YWTD repeat 


7.05 4. 1 9e- 1 0 344-354 


918 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


mrntm f\ mm* mm mmm) A i-s j— v f~*\ A S~% f~\ A 

7.05 5.74e-09 384-394 


918 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 7.97e-10 91-101 


918 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 3.12e-15 379-394 


918 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 3.37e-14 299-314 


918 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 3.45e-13 46-61 


918 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 5.76e-15 339-354 


918 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 7.55e-l3 127-142 


918 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 8.77e-13 86-101 


918 


IPB000561 


EGF-like domain 


4.89 5.50e-09 55-oi 


918 


IPB000859 


CUB domain 


19.99 7.67e-l/ /zl-/Do 


918 


IPB001881B 


Calcium-bmdmg EGF-like domain 


10 0 0 1 /I 11 IOT 1QO 

12. 2o 1.4/e-il i2/-loo 


918 


IPB001881B 


Calcium-bmding EGF-like domain 


1000 1 11 TOO O 1 A 

12. 2o 1.47e-ll zyy-olU 


918 


IPB001881B 


Calcium-binding EGF-like domain 


IO OO -C AAa 1/1 TTO Tflfl 

12.2o 5.uUe-i4 3 /y-oyu 


918 


IPB001881B 


Calcium-bmding EGF-like domain 


IO OO C O^^ 11 O/C f>7 


918 


IPB001881B 


Calcium-binding EGF-like domain 


1^ 00 0 „ 10 000 oca 
12.28 7.00e-13 33y-33U 


918 


IPB001881B 


Calcium-bmdmg EGF-like domain 


IOOO 0^71« 1 A /I ^ <0 


918 


_ _ i — mm Mmm w »w 

IPB002557B 


Chitin binding domain 


10 /T/l O AO« Art /TCI ££LC 


918 


IPB003367A 


Thrombospondm type 3 repeat 


11.78 7.62e-14 j 4- /4 


918 


IPB003886D 


Extracellular domain m mdogen 


IO A 1 1 C/l A 1 t A £Z /C< 

13. yi 1.54e-13 4o-OJ> 


918 


IPB003886D 


~- — « <• n i i ■ • • 1 

Extracellular domain in mdogen 


IO A 1 O 1 1 « 11 TTfl OCO 

13.91 2.13e-ll 339-335 


918 


IPB003886D 


n i i • * * i 

Extracellular domain m mdogen 


A 1 "~T _ AA T7A TrtO 

13.91 9.17e-09 379-398 


918 


PR00010C 


Type II EGF-like signature III 


6.98 4.48e-10 132-142 


918 


PR00010C 


mm* ~Y ~Y • « 1 | » 1*1 * 1 TTTT 

Type II EGF-like signature III 


r\o C £1 . AO 1 A A 11 /I 

6.98 5.61e-09 3U4-314 


918 


PR00010C 


Type II EGF-like signature III 


6.98 5.83e-09 384-394 


918 


PR00010C 


Type II EGF-like signature III 


6.98 6.05e-09 51-61 


918 


PR00010C 


Type II EGF-like signature III 


zT rtO O A^— AA A1 1 A1 

6.98 8.02e-09 91-101 


918 


PR00907B 


Thrombomodulin signature II 


11.50 9.16e-10 215-231 


918 


PR00907G 


Thrombomodulin signature VII 


10.43 6.84e-10 46-72 


918 


PR00907G 


Thrombomodulin signature VII 


10.43 7. 51e-10 379-405 


919 


IPB001415A 


Parathyroid hormone family 


18.60 2.50e-37 80-114 


919 


IPB001415B 


Parathyroid hormone family 


26.47 1.00e-40 133-181 


920 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 5.32e-ll 144-154 


920 ' 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 6.76e-ll 103-113 


920 


IPBO0O033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 8.92e-ll 17-27 


920 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 3.05e-13 139-154 


920 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 5.26e-14 12-27 


920 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 9.59e-13 98-1 13 


920 


IPB000561 


EGF-like domain 


4.89 1.00e-09 252-260 1 
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920 


IPB000561 


EGF-like domain 


A QQ 1 A^o AO £TI 

4.oy i.uue-uy o/-/ j 


920 


IPB001881B 


Calcium-binding EGF-like domain 


TO OO 1 Ala. 1A 10 T3. 


920 


IPB001881B 


Calcium-binding EGF-hke domain 


10 oo c si** 1A IIO l^A 
IZ.Zo D.D/e-iU iDy-LOV 


920 


IPB001881B 


Calcium-binding EGF-hke domain 


1 O OO *7 QfZa 1ft QQ 1 AO 


920 


PR00010C 


m T~r / ii < 1*1 " i TTT 

Type II EGF-hke signature III 


r no 1 aq i m 1 1 Q 


920 


PR00010C 


Type II EGF-hke signature III 


/c no n cq q no i AA 1 


920 


PR00011A 


Type III EGF-hke signature I 


14. VD O.DUe-Uy JJl-JH-y 


920 


PR00011A 


f ii TTT "I — 1 /TT™ 1 1*1 * 1 _ T 

Type III EGF-hke signature I 


14. U3 O.Uye-14 zoo-jUO 


920 


PR00011A 


» ■ < -t -fT T~l T™' "1*1 * j_ T 

Type III EGF-hke signature I 


I4.uz> /.yue-iz j /o-Jb'^ 


920 


PR00011B 


Type III EGF-hke signature II 


1^ AO 1 AA<= 1^ 000 ^A^k 


920 


PR00011B 


Type III EGF-like signature II 


13.08 3.36e- 12 376-394 


920 


PR00011B 


Type III EGF-hke signature II 


13.0b 3.5ye-0y 331-J4y 


920 


PR00011D 


Type III EGF-like signature IV 


12.12 2.50e-13 376-394 


920 


PR00011D 


Type III EGF-like signature IV 


12.12 2.8oe-12 288-306 


920 


PR00011D 


Type III EGF-like signature IV 


12.12 4.05e-ll 331-349 


920 


PR00764F 


Complement C9 signature VI 


15.74 5.10e-09 236-256 


920 


PR00907B 


Thrombomodulin signature II 


11.50 8.32e-10 54-70 


921 


IPB000827A 


Small cytokines (inter crine/chemokine), C- 
C subfamily 


8.72 5.24e-ll 35-45 


922 


IPB000834A 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (Ml 4) family 


18.14 1.77e-ll 167-188 


922 


IPB000834B 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (M14) family 


13.51 5.50e-14 201-215 


922 


IPB000834C 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (M14) family 


17.20 6.66e-12 270-286 


922 


IPB000834D 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (Ml 4) family 


18.95 3.25e-23 297-323 


922 


IPB000834E 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (M14) family 


9.80 3.33e-13 325-339 


922 


IPB00O834F 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (Ml 4) family 


12.40 9.18e-09 382-394 


922 


IPB000834G 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (M14) family 


14.46 6.00e-15 415-430 


922 


PR00765B 


Carboxypeptidase A metalloprotease (M14) 
family signature II 


14.48 8.04e-15 197-211 


922 


PR00765C 


Carboxypeptidase A metalloprotease (M14) 
family signature III 


10.88 5.50e-09 277-285 


922 


PR00765D 


Carboxypeptidase A metalloprotease (Ml 4) 
family signature IV 


14.06 4.91e-14 330-343 


923 


IPB002181A 


Fibrinogen beta and gamma chains C- 
terminal globular domain 


18.44 7.00e-ll 64-80 


923 


IPB002181B 


Fibrinogen beta and gamma chains C- 
terminal globular domain 


20.16 4.86e-32 93-129 


923 


IPB002181C 


Fibrinogen beta and gamma chains C- 
terminal globular domain 


15.87 6.88e-13 134-146 


923 


IPB002181D 


Fibrinogen beta and gamma chains C- 
terminal globular domain 


29.18 1.23e-18 153-193 


923 


IPB002181E 


Fibrinogen beta and gamma chains C- 
terminal globular domain 


27.75 9.71e-23 205-237 


927 


IPB000774B 


Domain amino terminal to FKBP-type 
peptidyl-prolyl isomerase 


11.42 1.29e-12 54-81 


927 


IPB000774C 


Domain amino terminal to FKBP-type 
peptidyl-prolyl isomerase 


10.28 7.46e-19 84-113 


927 


IPB001179 


FKBP-type peptidyl-prolyl cis-trans 


22.74 4.60e-26 80-113 
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isomerase (PPIase) 




932 


IPB001359H 


Synapsin 


22.58 6.66e-10 176-226 


936 


PR00436A 


InterIeukin-8 signature I 


15.20 8.23e-09 3-26 


937 


IPB000308B 


14-3-3 proteins 


12.24 2.33e-40 77-1 16 


937 


IPB000308C 


14-3-3 proteins 


18.31 1.00e-36 143-192 


937 


IPB000308D 


14-3-3 proteins 


12.57 1.69e-35 215-269 


937 


PR00305A 


14-3-3 protein zeta signature I 


9.70 9.18e-33 77-106 


937 


PR00305B 


14-3-3 protein zeta signature II 


11.03 4.52e-22 126-150 


937 


PR00305C 


14-3-3 protein zeta signature III 


9.89 5.50e-28 159-181 


937 


PR00305D 


14-3-3 protein zeta signature IV 


17.40 4.81e-27 194-220 


937 


PR00305F 


14-3-3 protein zeta signature VI 


15.40 1.00e-36 248-277 


938 


IPB002048 


EF-hand family 


7.91 5.24e-10 123-135 


939 


PR01353A 


Glucagon family receptor signature I 


8.87 4.86e-10 5-18 


940 


IPB001875B 


Death effector domain 


13.21 1.00e-08 500-509 


940 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 9.65e-09 182-219 


940 


IPB003530A 


Long hematopoietin receptor, soluble alpha 
chains 


23.64 8.76e-09 339-393 


941 


IPB000483 


Leucine rich repeat C-termmal domain 


ll. 18 8.1 le-14 242-256 


941 


PR00019B 


Leucine-rich repeat signature II 


11.42 8.91e-09 135-148 


941 


PR00364D 


Disease resistance protein signature IV 


14.89 4.60e-09 84-100 


944 


IPB000215A 


Serpins 


13.01 9.31e-15 200-223 


944 


IPB000215B 


Serpins 


9.87 4.79e-12 296-308 


944 


IPB000215C 


Serpins 


13.90 5.18e-ll 323-337 


944 


IPB000215D 


Serpins 


15.35 9.27e-ll 430-456 


944 


IPB000215E 


Serpins 


15.36 1.00e-15 508-532 


944 


IPB000895A 


Transthyretin precursor 


12.60 4.04e-09 39-58 


944 


PR00743B 


Glycosyl hydrolase family 36 signature II 


14.95 8.83e-09 240-260 


945 


IPB002905C 


N2,N2-dimethylguanosine tRNA 
methyltransferase 


19.39 5.00e-ll 115-136 


947 


IPB000306 


FYVE Zn-fmger, rabphilin/VPS27/FAB 1 
type 


8.96 3.25e-18 552-564 


947 


IPB001841 


RING finger 


10.69 3.37e-09 559-568 


950 


IPB001304A 


C-type lectin domain 


17.98 5.50e-ll 117-141 


950 


PR00356D 


Type II antifreeze protein signature IV 


12.93 7.41e-09 174-190 


951 


IPB001304A 


C-type lectin domain 


17.98 2.29c- 15 202-226 


951 


PR00356A 


Type II antifreeze protein signature I 


13.29 6.70e-10 197-209 


951 


PR0O356B 


Type II antifreeze protein signature II 


12.71 4.63e-l0 209-226 


953 


IPB001359H 


Synapsin 


22.58 4.43e-10 16-66 


953 


PR00910A 


Luteovirus ORF6 protein signature I 


2.74 3.46e-09 107-119 


954 


PR00901H 


PheromoneB alpha- 1 receptor signature 
VIII 


14.75 4.05e-09 56-66 


955 


PR00436A 


Interleukin-8 signature I 


15.20 4.39e-09 36-59 


956 


IPB003406B 


Core-2/I-Branching enzyme 


11.63 9.31e-37 94-128 


956 


IPB003406C 


Core-2/I-Branching enzyme 


11.36 7.26e-37 145-184 


956 


IPB003406D 


Core-2/I-Branching enzyme 


23.29 1.82e-33 191-230 


956 


IPB003406E 


Core-2/I-Branchmg enzyme 


22.75 3.87e-29 251-292 


958 


IPB000130 


Neutral zinc metallopeptidases, zinc- 
binding region 


5.86 6.54e-ll 236-246 


958 


IPB001818A 


Matrixin 


14.60 7.35e-28 88-117 


958 


IPB0O1818B 


Matrixin 


26.48 7.75e-39 136-177 


958 


IPB001818C 


Matrixin 


24.38 1.00e-40 180-225 


958 


IPB0O1818D 


Matrixin 


14.91 5.34e-30 230-261 


958 


IPB001818E 


Matrixin 


8.86 4.67e- 13 269-282 



L 
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958 


IPB001818F 


Matrixin 


ll. 19 o.85e-22 iUD-iZD 


958 


IPB001818G 


Matrixin 


14. /I z.ole-lz J42-3j4 


958 


IPB001818H 


Matrixin 


15.46 2.89e-2o 40c-434 


958 


IPB002870E 


Reprolysin family propeptide 


11.90 5.22e-10 238-250 


958 


PR00138A 


Matrixin signature I 


12.54 6.79e-16 108-121 


958 


PR00138B 


Matrixin signature II 


14.84 1.56e-17 155-170 


958 


PR00138C 


Matrixin signature III 


20.07 5.74e-28 178-206 


958 


PR00138D 


Matrixin signature IV 


14.57 7.86e-27 236-261 


958 


PR00138E 


Matrixin signature V 


7.10 4.96e-13 269-282 


958 


PR00480B 


Astacin family signature II 


14.35 4.32e-ll 231-249 


958 


PR00997G 


Fragilysin metallopeptidase (M10C) 
enterotoxin signature VII 


11.64 8.53e-09 242-258 


962 


IPB000953 


Chromo domain 


14.73 2.29e-17 63-81 


962 


IPB003606E 


N-terminal to some SET domains 


9.57 2.89e-12 226-235 


962 


IPB003606G 


N-terminal to some SET domains 


13.63 4.00e-10 262-290 


962 


PR00504C 


Chromodomain signature III 


12.01 1.97e-10 72-84 


965 


IPB001729 


Surfactant associated polypeptide SP-C 
palmitoylation sites 


9.43 9. 1 le-09 19-52 


967 


IPB000276A 


Rhodopsin-like GPCR superfarnily 


11.56 9.18e-09 118-129 


967 


IPB000276D 


Rhodopsin-like GPCR superfarnily 


9.40 9.22e-10 229-245 


967 


IPB001133A 


NADH-ubiquinone oxidoreductase chain 4L 


13,73 5.50e-10 89-119 


967 


PR00237A 


Rhodopsin-like GPCR superfarnily 
signature I 


9.8 1 8.3 le-09 26-50 


967 


PR00245B 


Olfactory receptor signature II 


-f o o C~T ~ f\C\ 1 1/11 

13.73 3.57e-09 129-141 


967 


PR00245D 


Olfactory receptor signature IV 


9.34 2.59e-13 183-192 


967 


PR00245E 


Olfactory receptor signature V 


8.96 5.15e-io z3U-z41 


967 


PR00534A 


Melanocortin receptor family signature 1 


12.77 2.83e-09 51-63 


968 


IPB002328A 


Zinc-containmg alcohol dehydrogenase 


20.74 6.65e-09 179-210 


968 


IPB002328B 


Zinc-containmg alcohol dehydrogenase 


14.71 9.03e-12 223-24o 


968 


IPB002364 


Quinone oxidoreductase/zeta-crystallin 


20.78 6.67e-12 223-250 


970 


IPB001017A 


Dehydrogenase, El component 


11.25 7.35e-13 126-138 


970 


IPB001017B 


Dehydrogenase, El component 


25.91 2.64e-27 152-184 


970 


IPB001017C 


Dehydrogenase, El component 


15.29 7.00e-22 200-219 


970 


IPB001017D 


Dehydrogenase, El component 


18.59 1.45e-40 256-294 


970 


IPB001017E 


Dehydrogenase, El component 


10.22 3.65e-15 312-325 


970 


IPB001017F 


Dehydrogenase, El component 


17.40 9.31e-23 340-360 


975 


PR01281D 


Corticotropin releasing factor type 2 
receptor signature IV 


9.32 8.52e-09 388-403 


977 


PR00457A 


Animal haem peroxidase signature I 


15.27 6.79e-12 162-173 


977 


PR00457B 


Animal haem peroxidase signature II 


12.43 1.26e- 15 216-231 


977 


PR00457C 


Animal haem peroxidase signature III 


18.81 1.51e-14 366-384 


977 


PR00457D 


• 1 1 * 1 * i 7~ 

Animal haem peroxidase signature IV 


18.35 5.50e-22 384-404 


977 


PR00457E 


Animal haem peroxidase signature V 


19.97 4.91e-26 409-435 


977 


PR00457F 


Animal haem peroxidase signature VI 


14.42 6.05e-10 461-471 


977 


PR00457G 


Animal haem peroxidase signature VII 


14. 17 3.25e-18 590-610 


977 


PR00457H 


Animal haem peroxidase signature VIII 


14.82 6.19e-13 684-698 


978 


IPB00127I 


Mammalian defensin 


1 O 07 6 1^0^ 19^ 


985 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 8.63e-10 145-160 


985 


IPB002174A 


Furin-like cysteine rich region 


30.51 9.03e-10 26-57 


987 


PR00019B 


Leucine-rich repeat signature II 


11.42 8.33e-10 124-137 


987 


PR00019B 


Leucine-rich repeat signature II 


11.42 8.33e-10 316-329 


988 


IPB000392A 


NifH/frxC family 


9.08 2.98e-ll 107-122 


988 


IPB0008O8A 


Mrp family 


23.51 8.67e-37 101-145 
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no o 

988 


IPB000808B 


Mrp family 


15.70 7.48e-3o 191-227 


988 


IPB000808C 


Mrp family 


23.57 8.77e-29 250-285 


988 


IPB000808D 


Mrp ramily 


28.05 6.40e-24 297-337 


988 


TTinA A 1 A O /I /-I 

IPB001984C 


ATP-dependent serine proteases, Lon 
family 


12.76 9.82e~09 103-139 


Ann 

988 


IPB003348A 


Amon-transporting ATPase 


OA r\£Z "~! If ^ 1 f 1 A^ 1 A O 

20.06 7.75e-15 106-143 


988 


PR00091A 


Nitrogenase component II signature I 


7.81 4.38e-09 109-122 


A A Ci 

988 


PR00094A 


Adenylate kinase signature I 


A /"O A O AA 1 AT 1 AA 

9.62 9.86e-09 107-120 


988 


PR00364A 


Disease resistance protein signature I 


8.29 7.59e-10 105-120 


989 


IPB001007B 


TtT'11 1 "1 A A A^"i A 

von Willebrand factor, type C repeat 


10.03 4.79e-09 259-268 


989 


IPB001846B 


von Willebrand factor type D domain 


10.86 1.82e-12 489-503 


/V /""I /»x 

989 


IPB002919B 


Trypsin Inhibitor-like cysteine rich domain 


21.14 6.45e-10 321-343 


989 


PR00163B 


Rubredoxin signature II 


15.17 9.25e-09 215-231 


992 


IPB000353B 


Class II histocompatibility antigen, beta 
chain, beta-1 domain 


19.16 8.48e-10 133-182 


992 


IPB003006A 


Immunoglobulin and major 
histocompatibility complex domain 


17.51 1.38e-ll 154-176 


994 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 6.48e-ll 86-123 


994 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 8.04e-ll 267-304 


995 


IPB001283A 


Extracellular proteins SCP/Tpx-l/Ag5/PR- 
1/Sc7 


16.26 6.67e-10 29-48 


995 


IPB001283B 


Extracellular proteins SCP/Tpx-l/Ag5/PR- 
1/Sc7 


13.09 2.33e- 12 57-72 


995 


IPB001283E 


Extracellular proteins SCP/Tpx-l/Ag5/PR- 
1/Sc7 


13.34 5.50e-20 127-147 


995 


IPB001283F 


Extracellular proteins SCP/Tpx-l/Ag5/PR- 
1/Sc7 


1A ylA /T /10_ 11 1/TA 1*7A 

10.49 o.48e-ll 160-170 


995 


IPB003106E 


TT 1 * j. J 1 " * 

Homeobox associated leucine zipper 


13.73 6.90e-09 25-5^ 


995 


PR00837A 


Allergen V5/Tpx-1 family signature I 


14.69 1.56e-14 57-75 


995 


PR00837C 


Allergen V5/Tpx-1 family signature III 


16.31 5.50e-20 126-142 


995 


T*T* AA A/^ WTV 

PR00837D 


Allergen V5/Tpx-1 family signature IV 


11.13 3.70e-ll 160-173 


995 


PR00838D 


Venom allergen 5 signature IV 


8.59 3,57e-09 57-75 


995 


PR00838G 


Venom allergen 5 signature VII 


15.39 8.60e-18 125-144 


1000 


IPB000353B 


Class II histocompatibility antigen, beta 
chain, beta-1 domain 


19.16 3.25e-13 21-70 


1000 


IPB001003B 


MHC Class II, alpha chain, alpha- 1 domain 


14.72 8.38e-ll 29-72 


1000 


IPB003006A 


Immunoglobulin and major 
histocompatibility complex domain 


17.51 3.63e-ll 42-64 


1000 


IPB003006B 


T 1 1 1 * 1 • 

Immunoglobulin and major 

1 * A J # 1 "1*1 -1 I • 

histocompatibility complex domain 


20.23 8.31e-17 78-115 


1002 


IPB000306 


FYVE Zn-finger, rabphihn/VPS27/FABl 
type 


8.96 9.74e-09 181-193 


1002 


IPB000822 


Zinc finger, C2H2 type 


14.67 1.60e-16 213-238 


1002 


IPB000822 


Zinc finger, C2H2 type 


1 /l A AO 1 O ^AT OA<^ 

14.67 2.38e-18 297-322 


1 AAO 


lrB0u0o22 


Zinc tmger, K^ztiz type 


14. o/ Z.DUe-iy jZj-oDU 


1002 


IPB000822 


Zinc finger, C2H2 type 


14.67 7.23e-18 185-210 


1002 


IPB000822 


Zinc finger, C2H2 type 


14.67 7.23e-18 241-266 


1002 


IPB000822 


Zinc finger, C2H2 type 


14.67 7.75e-20 269-294 


1002 


IPB001275 


DM DNA binding domain 


19.17 1.85e-10 201-240 


1002 


IPB001275 


DM DNA binding domain 


19.17 4.65e-10 313-352 


1002 


IPB001275 


DM DNA binding domain 


19.17 5.50e-10 257-296 


1002 


1PB001275 


DM DNA binding domain 


19.17 6.24e-12 285-324 



T 
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1002 


IPB001275 


DM DNA binding domain 


iy.17 o.ooe-lU IZy-Zoo 


1002 


PR00048A 


C2H2-type zinc finger signature I 


y.y4 l.UUe-(jy Ii\)-zzd 


1002 


PR00048A 


C2H2-type zinc finger signature I 


9.94 1.3oe-U9 1o2-i9D 


1002 


PR00048A 


C2H2-type zinc finger signature I 


9.94 1.41e-IU JoU-^o3 


1002 


PR00048A 


C2H2-type zinc finger signature I 


9.94 l.Doe~Iz z3o-Zj1 


1002 


PR00048A 


C2H2-type zinc finger signature I 


9.94 I.y3e~ll zy4-3U/ 


1002 


PR00048A 


C2H2-type zinc finger signature I 


9.94 Z.Dl)e-U9 oZZ-JJD 


1002 


PR00048A 


C2H2-type zinc finger signature I 


9.94 o. / 9e-io zoo-z /y 


1002 


PR00048B 


C2H2-type zinc finger signature II 


c o Ar\„ aa 1 no on*7 


1002 


PR00048B 


C2H2-type zinc finger signature 11 


C CI c cn n iaoia no 
D.DUe-iU JLVJ-D iy 


1002 


PR00048B 


C2H2-type zinc finger signature II 


D.DZ y.UUe-Uy OJO-D^t/ 


1002 


PR00048B 


C2H2-type zinc finger signature II 


5.52 y.3oe-IU zoZ-zyi 


1003 


IPB000822 


Zinc finger, C2H2 type 


14. o/ l.UUe-1 / 4o/-4yz 


1003 


IPB000822 


Zmc finger, C2H2 type 


1 /I /in O CA« OA CO! C/1 0 


1003 


IPB000822 


Zmc finger, C2H2 type 


14. o/ z.oUe-io zoz-zs / 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.0/ Z.yje-l/ 439-404 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.0/ 4.Z/e-Zl 3o^-4Uo 


1003 


IPB000822 


Zmc finger, C2H2 type 


14.0/ 4.oUe-lo oo-5-oe5i$ 


1003 


IPB000822 


Zmc finger, C2H2 type 


14. o/ 4.y4e-lJ OOJ-OOU 


1003 


IPB000822 


Zmc finger, C2H2 type 


14. o/ j.uye-zi 411-430 


1003 


IPB000822 


Zmc finger, C2H2 type 


1/1 £.1 C Ola 01 ,1 O^C COA 

14.0/ D.yie-zi 4yDozu 


1003 


IPB000822 


Zmc ringer, C2H2 type 


14. 0/ O.zDe-iy 3Z/-3DZ 


1003 


IPB000822 


Zmc finger, C2H2 type 


1 /I /C"7 nC n OA C7Q 

14.6/ O.Zje-zU j /9-0U4 


1003 


IPB000822 


Zmc finger, C2H2 type 


1 /I iC7 £i C /I /n. lO ICC OQA 

14.6/ O.04e-io 3_>3-3oU 


1003 


IPB000822 


Zmc finger, C2H2 type 


1 A CI £L Trta 1*7 OQQ l*)/l 

14.67 6.79e-17 299-3Z4 


1003 


IPB000822 


Zinc finger, C2H2 type 


1 /I /T "7 O OA. OO C.C\"~I /TQ O 

14.67 8.20e-22 oU7-o3z 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.67 8.71e-17 551-5 /o 


1003 


IPB001275 


DM DNA binding domain 


19.17 3.69e-lz 3 /1-41U 


1003 


IPB001275 


DM DNA binding domain 


10 1*7/1 01 - 11 1QQ A *i 0 

19.1/ 4.zie-ii oyy-43o 


1003 


IPB001275 


DM DNA binding domain 


1A 1*7 A *>1a 11 COO 

19.1/ 4.Z le- 1 1 4o3-5zZ 


1003 


IPB001275 


DM DNA binding domain 


1A 1*7 A Ofln 1A r lQ C*7Q 

19.1/ 4. ©ye- iu j3yo /o 


1003 


IPB001275 


DM DNA binding domain 


1 A 1 *7 C CHa. 1 1 CQC ^1/1 

19. 1 / 5.5 /e-13 5y5-o34 


1003 


IPB001275 


DM DNA binding domain 


iy.1 / 6.35e-lU 4jj-4y4 


1003 


IPB001275 


DM DNA binding domain 


1 A 1*7*7 C^T^w 11 C1 1 <^A 

19.17 7.56e-ll 511-55U 


1003 


|J- -1-1 i — 1 .A «*V .fl 

IPB001275 


DM DNA binding domain 


in 1*7 O /1*)« in I^O 1QO 

19.1/ o.4Ze-lU 343-3 oz 


1003 


IPB001275 


DM DNA binding domain 


1 rv 1*7 n f\i<~L in /Cc 1 /To A 

19.17 9.U3e-10 051-oyu 


1003 


IPB001275 


DM DNA binding domain 


1 a 1 *7 a nn« AAOQ*7 1*)^ 

19.1/ 9.09e-uy Zo/-3zo 


1003 


IPB001275 


DM DNA binding domain 


1A 1*7A l/l-. 1*3 C/C*7 /CA/C 

19.1 / 9. 14e-13 5o/-oUo 


1003 


IPB001275 


DM DNA binding domain 


ia 1*7 n in 01 c 1C/1 
19. 1 / y.64e-lU 3 15-3-54 


1003 


PR00048A 


C2H2-type zmc finger signature I 


y.y4 i,uue-Uo 540-501 


1003 


PR00048A 


C2H2-type zmc ringer signature I 


n n/i 1 oifl in ocn OTO 

y.94 l.oze-iu z5y-z/z 


1003 


PR00048A 


i»"^y^X"X > y^ * * » X" 

C2H2-type zmc finger signature I 


nn/i 0 c\/Z ^ 10 /l^/l zl'7'7 
9.94 Z.Uoe- Iz 4o4-4 / / 


1003 


PR00048A 


4 -w- -m- > * J X" 

C2H2-type zmc fmger signature I 


0 c\A O fio Q 1 1 £ir\A ^1*7 

9.94 Z.93e-13 0(J4-01 / 


1003 


PR00048A 


C2H2-type zmc finger signature I 


n A/1 O *)C_ AA TO/1 O'lO' 

9.94 3.z5e-U9 324-33 / 


1003 


PR00048A 


C2H2-type zmc finger signature I 


n n a a o*7„ in coa coo 

9.94 4.27e-10 520-533 , 


1003 


PR00048A 


C2H2-type zinc finger signature I 




1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 5.15e-14 576-589 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.54e-14 408-421 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.73e- 10 436-449 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 7.23e-14 492-505 


1003 


PR00048A 


C2H2~type zinc finger signature I 


9.94 7.35e-12 352-365 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 8.88e-09 632-645 


1003 


PR00048A | C2H2-type zinc finger signature I 


9.94 9.53e-ll 660-673 
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-i s~\ A A 

1003 


PR00048B 


C2H2-type zmc fmger signature II 


5.52 2.5Ue-0y o/o-oo5 


1003 


PR00048B 


C2H2-type zinc finger signature II 


c co o ci ^ in cao cn 

5.52 J.57e-10 50o-517 


1003 


PR00048B 


C2H2-type zmc finger signature II 


5.52 3.57e-10 536-545 


1003 


PR00048B 


C2H2-type zinc finger signature II 


5.52 3.57e-10 620-o2y 


1003 


PR00048B 


C2H2-type zinc finger signature II 


c co /i r\r\ ^ r\ci cr\o /tai 

5.52 4.00e-0y 592-601 


1003 


PR00048B 


C2H2-type zmc finger signature II 


5.52 4.86e-10 312-321 


1003 


PR00048B 


C2H2-type zmc finger signature II 


5.52 4.8oe-10 396-405 


1003 


PR00048B 


C2H2-type zmc finger signature II 


5.52 6.l4e-l0 340-349 


1003 


PR00048B 


C2H2-type zmc finger signature II 


5.52 7.75e-l 1 424-433 


1003 


PR00048B 


C2H2-type zinc finger signature II 


5.52 S.00e-09 368-377 


1003 


PR00048B 


C2H2-type zinc finger signature II 


5.52 9.00e-09 564-573 


1007 


IPB001781 


LIM domain 


H.42 l.00e-09 536-546 


1007 


IPB001781 


LIM domain 


11.42 8.11e-12 469-479 


1007 


IPB001781 


LIM domain 


l l. 42 9.59e-ll 4H-421 


1010 


IPB000822 


Zinc finger, C2H2 type 


14.67 2.88e-09 156-181 


1011 


PR00456F 


Ribosomal protein P2 signature VI 


5.70 6.76e-12 76-87 


1016 


IPB002087A 


Anti-proliferative protein 


15.38 5.68e-13 14-28 


1016 


IPB002087B 


Antiproliferative protein 


19.44 6.88e-20 122-155 


1016 


IPB002087C 


Anti-proliferative protein 


13.79 1.00e-23 180-200 


1016 


PR00310A 


Anti-proliferative protein BTG1 family 
signature I 


10.74 2.45e-28 16-40 


1016 


PR00310B 


Anti-proliferative protein BTG1 family 
signature II 


10.25 6.14e-24 123-152 


1016 


PR00310C 


Anti-proliferative protein BTG1 family 
signature III 


12.94 8.41e-36 153-182 


1016 


PR00310D 


Anti-proliferative protein BTG1 family 
signature IV 


9.99 4.71e-38 183-212 


1016 


PR00310E 


Anti-proliferative protein BTG1 family 
signature V 


14.23 2.06e-24 229-248 


1017 


IPB000306 


FYVE Zn-fmger, rabphilmA^PS27/FAB 1 
type 


8.96 9.74e-09 491-503 


1017 


IPB000822 


Zmc finger, C2H2 type 


14.67 1.64e-17 411-436 


1017 


IPB000822 


nr ■ a*~\^\ T T/^ * 

Zmc finger, C2H2 type 


1 A y~j 1 >1 1 -7 C /I o 

14.67 1.64e-17 523-548 


1017 


IPB000822 


Zmc finger, C2H2 type 


14.67 3.00e-24 383-408 


1017 


IPB000822 


Zmc finger, C2H2 type 


14.67 3.25e-20 355-380 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 4.27e-21 467-492 


1017 


IPB000822 


1— ^ ■ c+ Af~"i?\ T T^% > 

Zmc fmger, C2H2 type 


1/1 /I O _ 11 11 f 0>1A 

14.67 4.86e-ll 215-240 


1017 


IPB000822 


Zinc finger, C2H2 type 


1 A /TO yi /"V ^ _ 1 /I 1*71 /-S /"\ .'f 

14.67 4.94e-14 271-296 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 5.50e-09 121-146 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 6.79e-17 439-464 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 8.50e-20 327-352 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 8.62e-18 495-520 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 8.71e-10 149-174 


1017 


IPB001275 


DM DNA binding domain 


19.17 5.24e-ll 343-382 


1017 


IPB001275 


DM DNA binding domain 


ir\ n r oo_ irv joo r* ^ 

19. 17 5.38e-10 483-522 


1017 


IPB001275 


DM DNA binding domain 


19.17 6.35e-09 315-354 


1017 


IPB001275 


JJM JJJNA bin dmg domain 


iy. i / o.uoe-uy 4z/-4oo 


1017 


IPB001275 


DM DNA binding domain 


19.17 8.18e-09 455-494 


1017 


IPB001275 


DM DNA binding domain 


19.17 8.78e-10 511-550 


1017 


IPB001275 


DM DNA binding domain 


19.17 9.61e-ll 371-410 


1017 


IPB001909 


KRAB box 


17.37 2.50e-33 44-78 


1017 


PR00048A 


C2H2-type zinc finger signature I 


9.94 1.00e-13 408-421 


1017 


PR00048A 


C2H2-type zinc finger signature I 


9.94 5.09e-10 324-337 


1017 


PR00048A 


C2H2-type zinc finger signature I 


9.94 5.24e-12 436-449 
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1017 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.54e-14 380-393 


1017 


PR00048A 


C2H2-type zinc finger signature I 


v xx a /—t xx i t**s A /x xx r* r\ f-* 

9.94 673e-10 492-505 


1017 


PR00048A 


C2H2-type zinc finger signature I 


XX X"X A *r" r—t XX <i xx A .f A A Kt^f 

9.94 6.79e-13 464-477 


1017 


PR00048A 


C2H2-type zinc finger signature I 


XX XX A r—t XX XX 1 XX XX r* XX, XX X** f* 

9.94 7.88e-12 352-365 


1017 


PR00048A 


C2H2-type zinc finger signature I 


X*\ /X, A XX XX XX /X XX X" XX XX /") -t 

9.94 8.13e-09 268-281 


1017 


PR00048A 


C2H2-type zinc finger signature I 


9.94 9.10e-15 520-533 


1017 


PR00048B 


C2H2-type zmc finger signature II 


f* x - xx xx x— r\ r\ xx xx A xx xx A xx 1 

5.52 2.50c-09 340-349 


1017 


PR00048B 


C2H2-type zinc finger signature II 


7— t— r*. xx xxi x» 't -i xx xx y- A f\ £T 

5.52 3.25e-ll 396-405 


1017 


PR00048B 


C2H2-type zinc finger signature II 


x— x- 1 xx xx xx, /- *i -1 /"\ FJ C 1 

5.52 3.25e-l l 508-517 


1017 


PR00048B 


C2H2-type zmc finger signature II 


5.52 7.00e-ll 368-377 


1017 


PR00048B 


C2H2-type zinc finger signature II 


t xx xx. xx x** 1 /"\ ^ C\ /~\ AC) f\ 

5.52 9.36e-10 480-489 


1020 


IPB001781 


LIM domain 


11.42 1.69e-10 160-170 


1020 


IPB001781 


LIM domain 


-t -t A XX 4 XX X* 1 XX XX H XX XX XX XX 

11.42 1.95e-12 219-229 


1020 


IPB001781 


LIM domain 


11.42 5.26e-12 99-109 


1022 


PR01217G 


Proline rich extensin signature VII 


A XX XX X*" -X** -f XX XX 1 /X XX -1 ™" XT 

4.02 6.61e-09 130-155 


1024 


IPB000822 


Zinc finger, C2H2 type 


14.67 1.00e-16 410-435 


1024 


IPB000822 


Zinc fmger, C2H2 type 


-1 4 X~*~f /X XX X^ *f XX X\ XX x^* *X ^ -1 

14.67 3.25e-19 326-351 


1024 


IPB000822 


Zinc finger, C2H2 type 


-* i s~ ^—i X^N X' XX XX -1 4 XX XX XX 

14.67 3.25e-20 214-239 


1024 


IPB000822 


Zinc finger, C2H2 type 


J XW XX X^X^T i n XX >« XX XX X"r1 

14.67 3.77e-18 242-267 


1024 


IPB000822 


Zinc finger, C2H2 type 


^ J j y~« /~» xx Xx xx xx >| xx x^ 

14.67 4.00e-19 382-407 


1024 


IPB000822 


Zinc finger, C2H2 type 


4 j y xt ji x^ xx x*\ xx x^ /x xx xx x» 

14.67 4.75c-20 270-295 


1024 


IPB000822 


Zinc finger, C2H2 type 


-mm j ^ — ■— f j j— v 4 «f J «X XX (— \ *X XX 'X 

14.67 4.94e-14 298-323 


1024 


IPB000822 


Zinc finger, C2H2 type 


■4 J v** ^"w r*+ x* XX /x xx *t x - xx 1 r~% rx 

14.67 5.50e-09 158-183 


1024 


IPB000822 


Zinc finger, C2H2 type 


14.67 6.00e-13 186-211 


1024 


IPB000822 


Zinc finger, C2H2 type 


-n\ A XX X\ XT -f XX X - >< XX XT XX 

14.67 8.07e-17 354-379 


1024 


IPB001275 


DM DNA binding domain 


"M XX -*t XT •< X" XX X^ XX XX XX t~~% A XX mmm f 

19.17 1.68e-09 398-437 


1024 


IPB001275 


DM DNA binding domain 


4 xx -w xX XX i mmm t 4 XX ^ *f A -X X - xx 

19.17 3.07e-10 314-353 


1024 


IPB001275 


DM DNA binding domain 


■4 y^v -4 XT X* XX X— ♦ -| 'i XX XT XX /I XX XX 

19.17 5.37e-ll 370-409 


1024 


IPB001275 


DM DNA binding domain 


-4 XX. -4 >— « *m «T 4 1 A ^ /"O XX XX XT 

19. 17 5.74e-10 258-297 


1024 


IPB001275 


DM DNA binding domain 


«4 XX -4 X-f X** *t /*\ -f /X X\ XX XX 4 -1 

19.17 6.10e-12 202-241 


1024 


IPB001275 


DM DNA binding domain 


i a -1 *"T n *1 x>* -f xx f> A r-\ ^ o "f 

19.17 8.12e-12 342-381 


1024 


IPB001275 


DM DNA binding domain 


*• xx 4 x^ xx jf xx xx xx <*x /x xx >r" xx 

19.17 9.66e-09 230-269 


1024 


IPB001502E 


Endonuclease III 


XX 1 j^* /— 7 jX >f r"X ("X XX X~ XX XX r — 7 /~\ 

23.53 7.04e-09 252-279 


1024 


IPB001909 


KRABbox 


XT X 1 * ^-f j*X XX XX XX XX *\ XT X» -f 

17.37 9.00e-33 17-51 s 


1024 


IPB002867C 


Cysteme-nch domain (C6HC) 


*mj /X x y X% XX XT /X XX /X XX> XX /X /I x\ 

19.46 3.37e-09 323-340 ; 


1024 


IPB002867C 


Cysteme-nch domain (C6HC) 


« /\ X y XX XX X» /X XX XX X" XT XX O ^1 

19.46 9.05e-09 267-284 i 


1024 


PR00048A 


C2H2-type zinc finger signature I 


XX XX A •*! XX XX /"% XX -1 -1 XX XX .1 

9.94 1.82e-10 211-224 i 


1024 


PR00048A 


C2H2-type zinc finger signature I 


XX XX ^ -4 X\ XX •% p* XX ^ 1 XX x - " yl 

9.94 1.90e-15 351-364 


1024 


PR00048A 


C2H2-type zinc finger signature I 


XX XX M XX XX XX -4 XX ^\ WX\ XX XX ^X 

9.94 2.23e-10 379-392 


1024 


PR00048A 


C2H2-type zinc ringer signature I 


XX XX A XX XX XX XX yx XX *»X XX j^" 

9.94 6.29e-12 323-336 


1024 


PR00048A 


C2H2-type zinc finger signature I 


xx\ xx A ^-r A xx -i x^ xx t^" mmw y xx /-\ xx 

9.94 7.43e-13 267-280 ! 


1024 


PR00048A 


C2H2-type zinc finger signature I 


xx xx a i m * m 9 r+ x* -4 xx xx xx r* *x /x 

9.94 7.55e-10 295-308 


1024 


PR00048A 


C2H2-type zinc finger signature I 


9.94 8.58e-ll 407-420 


1024 


PR00048A 


C2H2-type zinc finger signature I 


9.94 8.62e-14 239-252 


1024 


PR00048B 


C2H2-type zinc finger signature II 


5.52 5.50e-10 227-236 


1024 


PR00048B 


C2H2-type zinc finger signature II 


m— — *mm j*— XX XX XX A XX XX yf /X XX 

5.52 6.00e-09 423-432 


1024 


PR00048B 


C2H2-type zinc finger signature II 


5.52 8.07e-10 283-292 


1024 


PR00048B 


C2H2-type zinc finger signature II 


5.52 8.07e-10 339-348 


1024 


PR00048B 


C2H2-type zinc finger signature II 


5.52 8.07e- 10 395-404 


1025 


PR00966H 


Potyvirus nuclear inclusion A cysteine 
protease (C4) signature VIII 


13.43 9.45e-09 215-234 


1026 


IPB000315A 


B-box zinc finger superfamily 


24.51 4.67e-12 19-42 


1026 


IPB001293A 


TRAF-type zinc finger 


26.83 8.34e-10 10-54 



WO 03/054152 PCT/US02/39555 



Table 3B 
375 



SEQ 

ID 

NO: 


Database 
entry ID 


Description 


Results* 


1026 


IPB003649A 


B-Box C-terminal domain 


13.56 7.92e-12 26-39 


1026 


PR00194D 


Tropomyosin signature IV 


9.54 7.21e-10 150-173 


1026 


PR01511F 


Kvl.4 voltage-gated K+ channel signature 
VI 


C t f\ A C A f\f\ 1/11 1 CC 

5.19 4.54e-09 141-155 


1029 


IPB000006 


Vertebrate metallothionein, family 1 


10 At /T 1 O „ 1 1/; 

13,41 6.13e-10 91-136 


1029 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 6.23e-09 71-116 


1029 


IPB000118A 


Granulin 


O r"\ t? O O A f\ C\ 1/11 1 /*" O 

8.25 8.84e-09 141-163 


1029 


IPB000359B 


Cystine-knot domain 


■\ f\ SIS' f\ A A Art 101 1 C\C\ 

19.26 9.44e-09 181-199 


1029 


IPB000726B 


Glycoside hydrolase family 19 


5.39 2.54e-09 66-76 


1029 


IPB000967E 


Zinc finger NF-X1 type 


21.88 6.54e-09 105-140 


1029 


IPB000967E 


Zinc finger NF-X1 type 


21. oo 7.o3e-10 lo3-z/3 


1029 


IPB000967E 


Zinc finger NF-X1 type 


21.00 y.U4e-U9 130-1 /o 


1029 


IPB001169K 


Integrin beta, C-terminus 


O o /i c /T ^ nn 1 AT 1 /t O 

27.45 6.yoe-Uy lU/-14y 


1029 


IPB001169K 


Integrin beta, C-terminus 


27.45 7.93e-09 56-9o 


1029 


IPB001762A 


Disintegrin 


AT £. A 1- ! ^ AA IOC OO C 

23.93 6.47e-U9 1oj-2zj 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


10 A A O O /I ^ 1A 1 AO 1QC 

12.44 2.o4e-l(J 102-133 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


10 /1/t *> AO^ 1 O 1 AO 

12.44 3.0oe-13 164-197 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


1 O /I /I O 0;C» 1 A ^ £1 Af~l 

12.44 3.25e-10 6o-y9 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


lO A A O O C _ 1AHO 1 OC 

12.44 3.25e-10 92-lio 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


1 O /I A O /I £f _ 11 IIO 

12.44 3.46e-ll 112-145 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


1 O /I /l O c~ O ~ 11 1 /Trt OAO 

12.44 3.68e-ll 169-202 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


1 r\ a a O or»« 10 100 ICC 

12.44 3.89e-12 122-155 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


1 o a a a r\r\~ 1/11 cr\ 1 AO 

12.44 4.00e-14 159-192 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


-I /■> y4 yf yl OH™ AA 1 ^7/1 OAT 

12.44 4.39e-09 174-20/ 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


10 /1/1 /I /Z A ^ 11 OA/T OOft 

12.44 4.64e-ll 206-239 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


1*1 A A C 1 r» _ 1 A 1 AO 1 /I A 

12.44 5.19e-10 10/-14(J 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 5.65e-09 91-124 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


•to /i >i a /i— l/i o^T inn 

12.44 6.04e-14 76-109 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


1 n >i ^ ^ 1 n 1001 11/i 

12.44 6.19e-13 81-114 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.71e-09 8-41 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.88e-15 117-150 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


1 ;-v /« J r-t nn_ OO OO 1 O C /I 

12.44 7.00e-09 221-254 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


lO A A *~7 1 O „ AA /T—7 1 AA 

12.44 7.10e-09 6/-1UU 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


10 /i/i t /( o„ 00000 ic c 

12.44 7.48e-09 222-255 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


1 O A A O 1 /C^ AA 01 iC O/j Q 

12.44 o.ioe-uy 21o-24y 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


IO A A O 0.0— IO 1 A/C OOO 

12.44 o.3-5e-iz iyo-22y 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


IO A A O O O _ 10*71 1 A/I 

12,44 8.38e-13 71-104 


1029 


IPB002494A 


-_- — ■ 1*1 1 ^\ i 

Keratin, high sulfur B2 protein 


IO /I /t O O /I _ AA 1 AO OO A 

12.44 o.o4e-(jy iy /-23U 


1029 


IPB002494A 


■w- w _ ■ 1*1 4 _T% | k y% j * 

Keratin, high sulfur B2 protein 


1 O A A n AO — A A O iZ 1 1 A 

12.44 9.03e-09 00- 1 iy 


1029 


IPB002494A 


Keratm, high sulfur B2 protein 


IO A A AOO« 1A ~\ G A 1 OO 

12.44 9.28e-10 154-lo / 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


1 ^ A A (~\ OO^. AA C*7 AA 

12.44 9.32e-09 57-90 


1029 


IPB002494B 


Keratm, high sulfur B2 protein 


•t/\,CO 1 IO— A A OA O A 

10.58 1.13e-09 70-84 


1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 2.01e-09 136-150 


1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 3.15e-10 22-36 


1029 


IPB002494B 


Keratm, high sulfur B2 protein 


10.58 5.94e-09 215-229 


1029 


TTJDAA1 AClA'D 

iPB00z4y4rJ 


Jveratm, nign sunur r>z proiein 


1 n co £ 90e-09 105-119 


1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 6.58e-09 163-177 


1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 7.85e-10 210-224 


1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 8.23e-09 234-248 


1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 8.66e-10 95-109 


1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 9.62e-09 121-135 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.10e-14 31-74 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 l.lle-17 69-112 
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1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A A C 1 T7 a 1 1 T7 TO 

14.4o l.z/e-Il Z/-/U 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 l.z/e-11 jy-lU2 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.39e-lJ iyy-z42 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 /l^: 1 11 11/1 1 -^'7 

14.46 l.o4e-ll J.14-1D/ 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 /l/C 1 1 C 1 A/1 1 A 1 

14.46 l./ze-iO 1U4-14/ 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1/1/^1 ^70^ 1 n OH/1 1AH 

14.46 1. /ve- 1U zU4-z4 / 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


■\ a A£. o ooa n inn i A ^ 
14.46 z.zze-iz lUU-l-^J 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 /1/C O no 11< 175 

14.46 Z.jDe-Uy Ioj-1 /o 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


i/i /i^: o /i^^ no 177 oon 
14.46 z.43e-uy 1 / /-zzu 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 i1<C O OAa 10 IK KO 

14.46 2.o0e-lo 11j-1Do 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 /I O 1 iC- 1 n 1 /CO 7A< 

14.46 3.16e-iy 162-zUj 


1029 


IPB002494C 


Keratm, high sulfur B2 protein 


i/i /ii£T^ ia« i/i ion i 
14.46 3.20e-14 120-163 


1029 


IPB002494C 


Keratm, high sulfur B2 protein 


1 A A £. i /ir. -j'j r»yi i 07 

14.46 3.45e-li y4-13/ 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 /I 1 '-7/-S „ 1 C 1 /C7 11 A 

14.46 3.7ye-lj 16/-210 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1 /i /i /" o p\ /i „ rvn oc no 

14.46 3.94e-0y e5-12o 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 A £. A no ^ 1 1 O/C £ZC\ 

14.46 4.U3e-13 z6-6y 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A A £. A /lO^ 10 1 n7 

14.46 4.4ze-13 64- 1U/ 


1029 


IPB002494C 


Keratm, high sulfur B2 protein 


1/1 /I A ti„ i o nr ioq 

14.46 4.72e-13 y3-13o 


1029 


IPB002494C 


Keratm, high sulfur B2 protein 


1/1 A£L A 01« K 0/1 177 

14.46 4.o3e-lj o4-lz/ 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A /!/* C AA„ 1 y| 1 77 71 C 

14.46 5.00e-14 l/z-zl3 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


i /i /i c ^ ^7 „ nn i o o ii <c 

14.46 5.37e-Uy lo3-zz6 


1029 


IPB002494C 


-WW i * 1*1 1 /** 1^*4 f\ 2 * 

Keratin, high sulfur B2 protein 


i/i /izrcoo^7f\ inn 
14.46 5.3oe-zU 2\)y-ZjZ 


1029 


IPB002494C 


-ww >* 1*1 1/** T~~V i * 

Keratm, high sulfur B2 protem 


1 A A £L C Gf\~ 17 lin O/CO 

14.46 5.50e-lz 21V-262 


1029 


IPB002494C 


-WW > * 1*1 If** f V /— k ■ * 

Keratm, high sulfur B2 protein 


i/i /i /r c , "7 1 no 1/17 1 on 

14.46 5.71e-Uy 14/-iyu 


1029 


IPB002494C 


Keratm, high sulfur B2 protem 


i/i /i £i c "i c\ ^ no 7A 1 1 o 

14.46 5.79e-U9 /U-113 


1029 


IPB002494C 


Keratm, high sulfur B2 protein 


i/i a n £1 nOn 1 1 on 1 /17 

14.46 6.(jye-i3 yy-i4z 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 //C /" 1AOO 111 

14.46 6.33e-10 yU-133 


1029 


IPB002494C 


•ww . * i*i i r* T*fc**i j_ • 

Keratm, high sulfur B2 protein 


1/1 ^ /17o 1AQ7 7C 

14.46 6.4ze-lU jz- / j 


1029 


IPB002494C 


■w W j * 1 • 1 1 {+ T~\, /*\ J_ * 

Keratin, high sulfur B2 protein 


1/1 A£L H 1 /1 Q AO 1/17 1 Q< 

14.46 /. 14e-Uy 14z-1c5j 


1029 


IPB002494C 


-WW m • 1*1 "1 J*"* T— V 1 * 

Keratm, high sulfur B2 protein 


i/i A£i n 1 ^ n i on 170 
14.46 /. 16e-13 13U-1/3 


1029 


IPB002494C 


-ww . • 1*1 1 P JL " 

Keratm, high sullur B2 protem 


1/1 /1^ 7 1Co IO 1 <7 7O0 

14.46 /.3oe- iy 13 I-ZVV 


1029 


IPB002494C 


'ww j ■ 1*1 1 /** ■f'k f\ ] • 

Keratin, high sulfur B2 protein 


1 A A £. 7 7A 7n 177 

14.46 7.3ye-zU /y-liz 


1029 


IPB002494C 


■_- • 1*1 1 /> W—\ 1 * 

Keratm, high sulfur B2 protein 


1 A A £L O 1 c „ no 1 CO 

14.46 o.l5e-l)y 16-3y 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


i/i /i zr o i c „ noon no 

14.46 o.l5e-Uy oU-lz3 


1029 


IPB002494C 


-_- _ • 1*1 1/** ir^Ti r * 

Keratin, high sulfur B2 protein 


1 /t /i a o 7 n« 10 11A 1 CO 

14.46 o.29e-iy 1 10-15 j 


1029 


IPB002494C 


— - — - . « 1*1 1/^ A . * 

Keratm, high sulfur B2 protein 


1 A A/Z O 1 1 1 nO KO 

14.46 5.36e-ll l(jy-l3z 


1029 


IPB002494C 


_ _ _ - • 1*1 1 r+ ~x~-m /~\ ■ ■ 

Keratm, high sulfur B2 protein 


1/1 /I /^ O /I 1 „ IO lOf 1/TO 

14.46 o.41e-lz lzD-16o 


1029 


IPB002494C 


-WW m • 1*1 "I f% , • 

Keratm, high sulfur B2 protein 


i/i /i /t o ,1 c ic on 1 io 

14.46 8.45e-15 89-132 


1029 


IPB002494C 


— - _ , « 1*1 t 1^ /*4 i * 

Keratm, high sulfur B2 protem 


1 /l /l /T o ra„ aa i on 707 

14.46 o.57e-Uy 189-232 


1029 


IPB002494C 


Keratm, high sulfur B2 protem 


1 /I /li£ O O^a K 11 -C/1 

14.46 o.o6e-lj 11-34 


1029 


IPB002494C 


-m- >■ . ♦ 1*1 1 JT*. I . 

Keratin, high sulfur B2 protem 


i/i /i/c om« 1 1 1^7 in? 
14.46 o.y2e-lo 132- iy3 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.13e-17 21-64 


1029 


IPB002494C 


4 1*1 1 1 % /% J * 

Keratin, high sulfur B2 protein 


14.46 9.16e-21 74-117 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.27e-ll 6-49 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.71e-13 36-79 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A A£ Q 7^<» AO /I 1 C/t 

14.46 y. /3e-Uy 41-o4 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.80e-14 105-148 


1029 


IPB002919A 


Trypsin Inhibitor-like cysteine rich domain 


15.56 5.71e-09 24-36 


1031 


IPB002130A 


Cyclophilin-type peptidyl-prolyl cis-trans 
isomerase 


13.87 2.17e-15 18-39 


1031 


IPB002130B 


Cyclophilin-type peptidyl-prolyl cis-trans 
isomerase 


21.20 5.85e-39 48-86 


1031 


IPB002130C 


Cyclophilin-type peptidyl-prolyl cis-trans 


16.92 1.00e-40 98-135 
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isomerase ! 




1031 


PR00153A 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature I 


14.60 2.71e-10 24-39 


1031 


PR00153B 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature II 


12.94 6.29e-17 53-65 


1031 


PR00153C 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature III 


10.79 2.64e-20 96-1 1 1 


1031 


PR00153D 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature IV 


10.73 L47e-15 111-123 


1031 


PR00153E 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature V 


8.39 1.43e-l3 1 z4- 1 


1032 


PR00010C 


Type II EGF-hke signature III 




1035 


IPB000822 


Zinc finger, C2H2 type 


14.0/ 1 .oye- lo z /H-zyy 


1035 


IPB000822 


Zinc finger, C2H2 type 


l4.o / z,zye-i / luz-iz/ 


1035 


IPB000822 


Zinc finger, C2H2 type 


14.0/ z.zye-i/ juz-jz / 


1035 


IPB000822 


Zinc finger, C2H2 type 


1 /i £7 o in a i q 1 on 01 ^ 
14. o/ J.//e-io iyu-ziD 


1035 


IPB000822 


Zinc finger, C2H2 type 


*\ A tCl 1 01 Q 1^/11/1 A^O 

14. o/ 3.oie-ir> 4i4-4^y 


1035 


IPB000822 


Zinc finger, C2H2 type 


14. 0/ 4. /je-ly Z1S-Z4J 


1035 


IPB000822 


Zinc fmger, C2H2 type 


14.6/ 4. /je-iy 44Z-4D / 


1035 


IPB000822 


Zinc finger, C2H2 type 


1 /I /C7 ^ f If 1 /CO 1 C7 

14.6/ D.3Ue-lD loZ-lo/ 


1035 


IPB000822 


Zinc finger, C2H2 type 


14.67 6.Z-)e-iy 3oo-4l l 


1035 


IPB000822 


Zinc finger, C2H2 type 


14.6/ O.Zje-ZU jj o-joj 


1035 


IPB000822 


Zinc finger, C2H2 type 


14.6/ 6.2)Ue-lj Z40-Z/1 


1035 


IPB000822 


Zmc finger, C2H2 type 


14.6/ 6. /ye-1 / 


1035 


IPB000822 


Zmc finger, C2H2 type 


14.67 7.UUe-ly 4/U-4yo 


1035 


IPB000822 


Zmc finger, C2H2 type 


14.6/ /.zie-lo 4yo-j/j 


1035 


IPB000822 


Zinc finger, C2H2 type 


14.6 / o.U /e- 1 / DZO-DD 1 


1035 


IPB000822 


Zmc finger, C2H2 type 


1 /I Q <f\a 1 Q 1 "XA 1 KQ 

14.6/ o.jUe-iy iJ4-iDy 


1035 


IPB001275 


DM DNA binding domain 


iy.i / i.34e-uy ZUO-Z4J 


1035 


IPB001275 


DM DNA binding domain 


ly. l / z. /ue-iu i /o-zi / 


1035 


IPB001275 


DM DNA binding domain 


ir\ 1*7 *1 f Co 1 A 0*7/1 /I 1 O 

19.1/ 3.DJe-lU D / 4-413 


1035 


IPB001275 


- mf _ _ _ f t 1 * 1* 1 * 

DM DNA binding domain 


in 1*7 /I nOo AO 1AfZ 75^ 

19. 1 / 4.Uoe-Uy 340-3OJ 


1035 


IPB001275 


DM DNA binding domain 


m it c n /" _ 1 A /ICO /1Q*7 

iy. 1 / D.ooe- iw 4Do-4y / 


1035 


IPB001275 


DM DNA binding domain 


1A 1*/ O */Qo 1A llO *i*\*7 

iy.1/ o. /oe- 1U 3 1 o-3D / 


1035 


mm mm wvk y/*^ ^mt gmy jmmt. Mmmt MmJ 

IPB001275 


DM DNA binding domain 


in 1 *7 Q 1 Ko 1 A OQfl ^OQ 

iy. l / y. iDe-iu zyu-jzy 


1035 


IPB001275 


DM DNA binding domain 


in 17 Q o Oa 1 A /ISA KO *\ 

iy.i / y.csoe- iu 4oo-jzj 


1035 


IPB001909 


KRAB box 


1*7 "2*7 *i 0< 0 *)Q <*; AH 
1 /.3 / 3.ooe-Z5 D-4U 


1035 


PR00048A 


C2H2-type zinc finger signature I 


A A/1 1 1 A 1 OAA 

y.y4 i.4ie-iu io/-zuu 


1035 


PR00048A 


C2H2-type zmc finger signature I 


O A/1 *> *)Q Q 1 1 1QQ Q 1 *> 

y.y4 z.zye-13 zyy-3iz 


1035 


PR00048A 


C2H2-type zinc finger signature I 


A A/l *) /lOo 11 101 1 i44 

y.y4Z.4ze-ii 131-144 


1035 


PR00048A 


C2H2-type zmc finger signature I 


AA/1*>iC/1« 1AOQO OQA 

y.y4 Z.64e-lU 3o3-3yo 


1035 


PR00048A 


C2H2-type zmc fmger signature I 


n r\ a 1 /ir„ 1A/111 /1*)/1 

y.y4 3.4r>e-iu 41 i-4/4 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 4.1oe-12 49d-DUo 


1035 


PR00048A 


C2H2-type zinc finger signature I 


rvr\/i /I 11 » 1*> COO CT/C 

9.94 4.21e-13 523-D36 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 5.09e-lU 5 j 1-D64 


1035 


PR00048A 


C2H2-type zinc finger signature I 


a A/i /~ i a „ 1 1 n o*7 O/IA 

9.94 6.14e-13 32/-34U 


1035 


PR0Q048A 


C/Zrlz-type zmc linger signature l 


0 Q4 6 1 4e-l 3 99-1 12 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.54e-14 215-228 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6. 5 4e- 14 467-480 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.82e-12 243-256 


1035 


PR00048A 


C2H2-type zinc fmger signature I 


9.94 7.92e-14 355-368 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 8.58e-ll 159-172 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 9.05e-l 1439-452 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 9.36e-13 271-284 
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1035 


PR00048B 


C2H2-type zinc finger signature II 


c *ri i nn<a aq aqi aqo 


1035 


PR00048B 


C2H2-type zmc finger signature II 


c r o r aa« n i k 11/1 


1035 


PR00048B 


C2H2-type zinc finger signature II 


c. Co c rn 0 in ina 110 
D.OZ D.DUe-lU ZU3-Z1Z 


1035 


PR00048B 


C2H2-type zinc finger signature II 


C CI C CAo 1 A OSS'? OQfi 

r>. DZ D.Due-iu zo/-zyo 


1035 


PR0004SB 


^-u -w- * /V * * TT 

C2H2-type zmc finger signature II 


C CI C crto 1 1 /ICC A&A 
D.Z>Z XDUe-H 4DJ-404 


1035 


PR00048B 


C2H2-type zmc finger signature II 


c co i CAa no it\ qqa 


1035 


PR00048B 


C2H2-type zinc finger signature II 


c co o aa** ao. i ah i 
j.jZ o.UUe-Uy 14/-130 


1035 


PR00048B 


C2H2-type zinc finger signature II 


C CI O CAo AO 1QO /1AQ 


1038 


IPB001902A 


Sulfate transporter 


10.29 9.57e-10 310-319 


1038 


IPB001902B 


Sulfate transporter 


21.91 3.43e-31 J4U-oyi) 


1038 


IPB001902C 


Sulfate transporter 


23.62 6.44e-25 429-480 


1038 


IPB002645B 


STAS domain 


13.24 1.43e-ll 179-193 


1038 


IPB0O2645B 


STAS domain 


1 ^ ^ A 1 ylO 11 /TOO 

13.24 1.43e-ll 622-636 


1039 


IPB000626 


Ubiquitin domain 


21.25 7.65e-09 56-110 


1041 


IPB000729A 


PMP-22/EMP/MP20 family 


19.70 9.05e-09 47-68 


1042 


IPB003530A 


Long hematopoietin receptor, soluble alpha 
chains 


23,64 3.01e-09 362-416 


1042 


PR01474F 


Vascular cell adhesion molecule- 1 (VCAM- 
1) signature VI 


14.81 9.48e-09 389-402 


1043 


IPB001526A 


Ly-6/u-PAR domain 


13.24 3.65e-13 73-88 


1043 


IPB001526C 


Ly-6/u-PAR domain 


13.04 6.82e-13 142-157 


1043 


IPB003605A 


GS motif preceding kinase domain in TGF 
beta receptor 


13.79 3.93e-09 149-160 


1047 


IPB000353B 


Class II histocompatibility antigen, beta 
chain, beta-1 domain 


19.16 5.99e-10 203-252 


1047 


IPB001039A 


Major histocompatibility complex protein, 
Class I 


17.17 1.00e-40 15-68 


1047 


IPB001039B 


Major histocompatibility complex protein, 
Class I 


27.55 1.00e-40 96-147 


1047 


IPB001039C 


Major histocompatibility complex protein, 
Class I 


19.82 1.00e-40 177-230 


1047 


IPB001039D 


Major histocompatibility complex protein, 
Class I 


16.49 4.43e-38 255-309 


1047 


IPB003006A 


Immunoglobulin and major 
histocompatibility complex domain 


17.51 3.84e-15 224-246 


1047 


IPB003006B 


— ill* l * 

Immunoglobulin and major 
histocompatibility complex domain 


20,23 o.Zje-Zj Zol-zys 


1047 


IPB003363E 


Glycoprotein GG/GX 




1051 


IPB000729D 


PMP-22/EMP/MP20 family 


i o o< *7 in 0 1 a 1 h*x onn 
lo.yo /.jue-iu l /j-zuu 


1052 


IPB001526C 


Ly-6/u-PAR domain 




1054 


PR00258B 


Speract receptor signature II 


'V A/l A CAa 1 ^ 'TIS HAQ 


1054 


PR00258C 


Speract receptor signature III 


O AC O Tin 1 A TCJ 

y.UD Z.JZe-LU /JJ-/OJ 


1054 


PR00258D 


Speract receptor signature IV 


i /i oo ^ /Ci Q AO *7QQ 

14. zy o.uje-uy /oj-/yy 


1054 


PR00258E 


Speract receptor signature V 


14. UO cS.4je-14 oUo-OZU 


1055 


IPB001429A 


ATP P2X receptor 


1 T 1 *7 1 Afto 1Q 10 
17.1/ i.UUe-oo IZ-DU 


1055 


IPB001429B 


ATP P2X receptor 


C jIA O 71a 11 /^1 1 1 

5.49 Z. / le-iz oz- / 1 


1055 


TTinAAi /ton/""' 
lPBU014Zyi_, 


Air r za receptor 


1Q Q6 5 68e-31 88-118 


1055 


IPB001429D 


ATP P2X receptor 


20.63 2.38e-19 143-164 


1055 


IPB001429E 


ATP P2X receptor 


15.32 1.43e-30 178-205 


1055 


IPB001429F 


ATP P2X receptor 


20.01 5.88e-30 248-277 


1055 


IPB001429G 


ATP P2X receptor 


9.93 2.64e-14 291-300 


1055 


IPB001429H 


ATP P2X receptor 


12.67 7.00e-34 309-335 


1055 


IPB001675B 


Glycosyltransferase family 29 


15.84 9.26e-09 339-361 


1055 


PRO 13 07 A 


P2X purinoceptor family signature I 


10.92 2.80e-12 82-90 
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1055 


PR01307B 


P2X purmoceptor family signature 11 


10.30 1.00e-15 Id /-loo 


1055 


PR01307C 


P2X purmoceptor family signature III 


10.17 3.77e-15 243-2DD 


1055 


PR01307D 


P2X purmoceptor family signature IV 


10.26 1.56e-15 290-30U 


1055 


PR01307E 


P2X purmoceptor family signature V 


15.20 2.29e-19 310-324 


1055 


PR01311A 


P2X4 purinoceptor signature I 


7.73 8.32e-10 1-8 


1055 


PR01311C 


P2X4 purinoceptor signature III 


9.00 1.90e-12 30-38 


1055 


PR01311D 


P2X4 purinoceptor signature IV 


12.87 l,12e-10 49-55 


1055 


PR01311E 


P2X4 purinoceptor signature V 


6.43 1.00e-09 139-146 


1055 


PR01311F 


P2X4 purinoceptor signature VI 


9.15 1.15e-10 208-216 


1055 


PR01311G 


P2X4 purinoceptor signature VII 


11.74 8.00e-14 280-291) 


1055 


PR01311H 


^ • * j T 7" Y Y "¥" 

P2X4 purmoceptor signature VIII 


12.12 1.00e-12 303-312 


1055 


PR01311I 


P2X4 purinoceptor signature IX 


5.25 1.00e-ll 359-366 


1059 


PR00854A 


Prostaglandin D receptor signature I 


15.24 9.08e-18 6-20 


1059 


PR00854B 


Prostaglandin D receptor signature II 


7.30 8.15e-21 41-58 


1059 


PR00854C 


Prostaglandin D receptor signature III 


12.92 1.64e-12 93-104 


1059 


PR00854D 


Prostaglandin D receptor signature IV 


9.41 2.50e-18 185-200 


1059 


PR00854E 


Prostaglandin D receptor signature V 


10.50 4.65e~26 236-259 


1059 


PR00854F 


Prostaglandin D receptor signature VI 


12.83 9.68e-ll 321-332 


1059 


PR00854G 


Prostaglandin D receptor signature VII 


10.66 L78e-18 341-357 


1059 


PR00854H 


Prostaglandin D receptor signature VIII 


14.71 6.20e-17 369-389 


1059 


PR00856E 


Prostacyclin (prostanoid IP) receptor 
signature V 


9.82 1.72e-09 178-194 


1060 


IPB001898A 


Sodium: sulfate symporter family 


13.06 5.30e-21 49-72 


1060 


IPB001898B 


Sodiumisulfate symporter family 


13.18 8.41e-22 88-114 


1060 


IPB001898C 


Sodium: sulfate symporter family 


9.97 1.00e-23 132-151 


1060 


IPB0O1898D 


Sodium: sulfate symporter family 


12.13 1.00e-21 208-232 


1060 


IPB001898E 


Sodium:sulfate symporter family 


13.97 6.76e-21 248-272 


1060 


IPB001898F 


Sodium:sulfate symporter family 


18.66 6.04e-20 309-335 


1060 


IPB001898G 


Sodium: sulfate symporter family 


16.36 8.36e-24 405-434 


1060 


IPB001898H 


Sodium: sulfate symporter family 


21.26 1.00e-40 486-534 


1063 


IPB001039A 


Major histocompatibility complex protein, 
Class I 


17.17 8.89e-ll 19-72 


1064 


IPB001171A 


Ergosterol biosynthesis ERG4/ERG24 
enzymes 


11.99 1.55e-10 11-24 


1064 


IPB001171B 


Ergosterol biosynthesis ERG4/ERG24 
enzymes 


14.81 6.88e-15 166-178 


1064 


IPB001171C 


Ergosterol biosynthesis ERG4/ERG24 
enzymes 


26.44 4.75e-33 181-217 


1064 


IPB001171D 


Ergosterol biosynthesis ERG4/ERG24 
enzymes 


20.69 1.00e-40 241-282 


1064 


IPB001171G 


Ergosterol biosynthesis ERG4/ERG24 
enzymes 


25.35 1.00e-40 298-350 


1065 


IPB003388E 


Reticulon 


14.24 6.29e-09 930-972 


1066 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 2.04e-09 264-301 


1067 


IPB000276A 


Rhodopsin-like GPCR superfamily 


11.56 5.20e-12 105-116 


1067 


PR00237C 


Rhodopsin-hke GPCR superfamily 
signature III 


14.77 6. /3e-ll yi-113 


1067 


PR00237E 


Rhodopsin-like GPCR superfamily 
signature V 


13.03 4.77e-10 175-198 


1067 


PR00245A 


Olfactory receptor signature I 


10.98 3.65e-09 79-90 


1067 


PR00245B 


Olfactory receptor signature II 


13.73 4.60e-09 116-128 


1067 


PR00245D 


Olfactory receptor signature IV 


9.34 1.53e-13 212-221 


1067 


PR00245E 


Olfactory receptor signature V 


8.96 3.30e-10 259-270 
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1067 


PR00534A 


Melanocortin receptor family signature I 


12.71 8. 43 e- 09 38-50 


1069 


IPB000095E 


PAK-box /P21-Rho-binding 


17.62 3.52e-l 1410-455 


1069 


IPB000961C 


Protein kinase C-terminal domain 


15.48 9.39e-14 409-443 


1069 


IPB001245A 


Tyrosine kinase catalytic domain 


22.45 6.54e-15 402-442 


1069 


IPB001245B 


Tyrosine kinase catalytic domain 


21.68 6.18e-09 467-505 


1069 


IPB003527C 


MAP kinase 


14.70 7. 37e- 10 394-442 


1069 


IPB003605A 


GS motif preceding kinase domain in TGF 
beta receptor 


13.79 2.00e-13 92-103 


1069 


IPB003605C 


GS motif preceding kinase domain in TGF 
beta receptor 


14.92 1.00e-40 302-349 


1069 


IPB003605D 


GS motif preceding kinase domain in TGF 
beta receptor 


12.41 1.00e-40 398-439 


1069 


IPB003605E 


GS motif preceding kinase domain in TGF 
beta receptor 


21.14 1.64e-40 441-480 


1069 


IPB003605F 


GS motif preceding kinase domain in TGF 
beta receptor 


23.21 1.00e-40 504-557 


1069 


PR00109E 


Tyrosine kinase catalytic domain signature 
V 


12.99 5.58e-09 549-571 


1070 


IPB000729C 


PMP-22/EMP/MP20 family 


37.83 1.71e-09 80-132 


1070 


IPB000729D 


PMP-22/EMP/MP20 family 


18.96 9.33e-10 156-183 


1071 


IPB000353B 


Class II histocompatibility antigen, beta 
chain, beta-1 domain 


19.16 7.52e-14 201-250 


1071 


IPB001039A 


Major histocompatibility complex protein, 
Class I 


17.17 2.80e-26 14-67 


1071 


IPB001039B 


Major histocompatibility complex protein, 
Class I 


27.55 5.50e-23 90-141 


1071 


IPB001039C 


Major histocompatibility complex protein, 
Class I 


19.82 7.75e-16 176-229 


1071 


IPB001039C 


Major histocompatibility complex protein, 
Class I 


19.82 7.94e-23 175-228 


1071 


IPB001039D 


Major histocompatibility complex protein, 
Class I 


16.49 7.43e-29 254-308 


1071 


IPB003006A 


Immunoglobulin and major 
histocompatibility complex domain 


17.51 6. 82e- 17 222-244 


1071 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 6.40e-19 260-297 


1072 


IPB002146 


ATP synthase B/F CF(0) 


21.39 6.94e-09 188-226 


1072 


PR00308B 


Type I antifreeze protein signature II 


3.38 8.99e-09 3-14 


1072 


PR00308C 


Type I antifreeze protein signature III 


2.79 6.92e-09 3-12 


1072 


PR0O3O8C 


Type I antifreeze protein signature III 


2.79 9.59e-10 4-13 


1072 


PR0O3O8C 


Type I antifreeze protein signature III 


2.79 9.59e-10 5-14 


1072 


PR00698E 


C.elegans Srg family integral membrane 
protein signature V 


14.65 2.76e-09 111-136 


1072 


PR01511D 


Kvl.4 voltage-gated K+ channel signature 
IV 


3.91 2.29e-09 4-14 | 


1072 


PR01511D 


Kvl.4 voltage-gated K4- channel signature 
IV 


3.91 2.88e-09 5-15 


1073 


IPB002617C 


Vesicular monoamine transporter 


16.23 5.03e-10 198-250 


1073 


PR01035E 


Tetracycline resistance protein signature V 


10.87 1.00e-08 208-230 


1073 


PR01035F 


Tetracycline resistance protein signature VI 


12.88 l.OOe-08 499-519 


1074 


IPB002617C 


Vesicular monoamine transporter 


16.23 5.03e-10 286-338 


1074 


PR01035E 


Tetracycline resistance protein signature V 


10.87 1.00e-08 296-318 


1074 


PR01035F 


Tetracycline resistance protein signature VI 


12.88 1.00e-08 587-607 


1075 


IPB000906A 


ZU5 domain 


22.49 3.72e-09 58-100 
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1075 


IPB000906G 


ZU5 domain 


25.85 7.95e-l0 164-212 


1075 


PR00806C 


Vinculin signature III 


10.57 8.56e-09 350-367 


1076 


PR00237C 


Rhodopsin-like GPCR superfamily 
signature III 


14.77 3.25e-09 188-210 


1076 


PR00237E 


Rhodopsin-like GPCR superfamily 
signature V 


13.03 4.86e-09 276-299 


1076 


PR00237F 


Rhodopsin-like GPCR superfamily 
signature VI 


14.34 1.25e-10 322-346 


1077 


IPB003662A 


General substrate transporters 


18.97 2.89e-09 139-171 


1079 


IPB000101A 


Gamma-glutamyltranspeptidase 


29.14 3.45e-14 352-391 


1079 


PR01210A 


Gamma-glutamyltranspeptidase signature I 


17.15 1.50e-13 130-155 


1080 


IPB000684L 


Eukaryotic RNA polymerase II 
heptapeptide repeat 


3.49 8.35e-09 1357-1399 


1080 


IPB001237C 


43 Kd postsynaptic protein 


5.44 4.63e-ll 28-80 


1080 


IPB001359H 


Synapsin 


22.58 8.69e-09 1371-1421 


1081 


IPB002688A 


Sterol/diacylglycerol O-acyltransferase 


19.24 8.65e-20 259-281 


1081 


IPB002688B 


Sterol/diacylglycerol O-acyltransferase 


22.16 2.50e-25 308-347 


1081 


IPB002688C 


Sterol/diacylglycerol O-acyltransferase 


22.98 3.13e-20 321-368 


1081 


IPB002688D 


Sterol/diacylglycerol O-acyltransferase 


24.36 3.57e-33 380-415 


1081 


IPB002688E 


Sterol/diacylglycerol O-acyltransferase 


13.67 2,50e-18 433-453 


1082 


PRO 1547 A 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature I 


10.65 6.40e-24 53-74 


1082 


PR01547B 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature II 


11.11 2.38e-19 97-116 


1082 


PR01547C 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature III 


9.50 5.94e-23 118-139 


1082 


PR01547D 


Saccharomyces cerevisiae !75.8kDa 
hypothetical protein signature IV 


12.09 5.50e-29 144-169 


1082 


PR01547E 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature V 


10.42 1.95e-26 171-195 


1082 


PR01547F 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature VI 


11.64 5.95e-25 258-278 


1082 


PR01547G 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature VII 


7.91 1.00e-36 298-325 


1082 


PR01547H 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature VIII 


9.70 1.00e-30 333-357 


1082 


PR01547I 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature IX 


15.37 7.94e-28 408-438 


1082 


PRO 1547 J 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature X 


14.06 3.15e-23 439-460 


1082 


PR01547K 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature XI 


13.61 7.87e-23 462-481 


1082 


PR01547L 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature XII 


10.35 6.29e-19 486-502 


1082 


PR01547M 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature XIII 


1 a n/f o A A ^ 11 1 no 11 

10.06 3.44e-12 117o-119o | 


1083 


IPB000095E 


PAK-box /P21-Rno- binding 


I / . OZ» J. 1J C~ JU / » 1 


1083 


IPB000961C 


Protein kinase C-terminal domain 


15.48 4.66e-14 366-400 


1083 


IPB001245A 


Tyrosine kinase catalytic domain 


22.45 5.50e-14 359-399 


1083 


IPB001245B 


Tyrosine kinase catalytic domain 


21.68 6.18e-09 424-462 


1083 


IPB001526C 


Ly-6/u-PAR domain 


13.04 1.00e-08 105-120 


1083 


IPB003527C 


MAP kinase 


14.70 8.81e-10 351-399 


1083 


IPB003605A 


GS motif preceding kinase domain in TGF 
beta receptor 


13.79 1.32e-ll 112-123 
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1083 


IPB003605B 


GS motif preceding kinase domain in TGF 
beta receptor 


10.56 5,00e-28 229-255 


1083 


IPB003605C 


GS motif preceding kinase domain in TGF 
beta receptor 


14.92 1.00e-40 259-306 


1083 


IPB003605D 


GS motif preceding kinase domain m TGF 
beta receptor 


12.41 1.00e-40 355-396 


1083 


IPB003605E 


GS motif preceding kinase domain in TGF 
beta receptor 


21.14 8.88e-39 398-437 


1083 


IPB003605F 


GS motif preceding kinase domain m TGF 
beta receptor 


23.21 6.81e-40 461-514 


1083 


PR00109E 


Tyrosine kinase catalytic domain signature 
V 


12.99 2.15e-09 535-55 / 


1083 


PR00653D 


Activin type II receptor signature IV 


12.01 6.70e-ll 385-406 


1084 


IPB000095E 


PAK-box /P21-Rho-bindmg 


17.62 3.15e-12 429-4/4 


1084 


IPB000961C 


Protein kinase C-terminal domain 


15.48 4.66e- 14 428-462 


1084 


IPB001245A 


Tyrosine kinase catalytic domain 


22.45 5. 50e-14 421-461 


1084 


IPB001245B 


Tyrosine kinase catalytic domain 


21.68 6.18e-09 486-524 


1084 


IPB001526C 


Ly-6/u-PAR domain 


13.04 1.00e-08 105-120 


1084 


IPB003527C 


MAP kinase 


14.70 8.81e-10 413-461 


1084 


IPB003605A 


GS motif preceding kinase domain in TGF 
beta receptor 


13.79 1.32e-ll 112-123 


1084 


IPB003605B 


GS motif preceding kinase domain in TGF 
beta receptor 


10.56 5.00e-28 291-317 


1084 


IPB003605C 


GS motif preceding kinase domain in TGF 
beta receptor 


14.92 1.00e-40 321-368 


1084 


IPB003605D 


GS motif preceding kinase domain in TGF 
beta receptor 


12.41 1.00e-40 417-458 


1084 


IPB003605E 


GS motif preceding kinase domain in TGF 
beta receptor 


21.14 8.88e-39 460-499 


1084 


IPB003605F 


GS motif preceding kinase domain in TGF 
beta receptor 


23.21 6.81e-40 523-576 


1084 


PR00109E 


Tyrosine kinase catalytic domain signature 
V 


12.99 2.15e-09 597-619 


1084 


PR00653D 


Activin type II receptor signature IV 


12.01 6.70e-ll 44/-4O0 


1085 


PR00541C 


Muscarinic M4 receptor signature III 


8.06 7.73e-09 450-3U6 


1086 


PR00541C 


Muscarinic M4 receptor signature III 


8.06 7.73e-09 755-775 


1087 


PR01430A 


Protease activated receptor 2 precursor 
signature I 


8.72 8.20e-09 123-134 


1088 


IPB000322A 


Glycoside hydrolase family 31 


19.86 8.13e-23 151-176 


1088 


IPB000322B 


Glycoside hydrolase family 31 


14.24 9.64e-10 186-201 


1088 


IPB000322C 


Glycoside hydrolase family 3 1 


22.80 3.57e-20 225-250 


1088 


IPB000322D 


Glycoside hydrolase family 31 


17.98 6.57e-27 326-352 


1088 


IPB000322E 


Glycoside hydrolase family 31 


25.60 9.10e-33 366-407 


1090 


IPB000135A 


High mobility group proteins HMG1 and 
HMG2 


11.69 7.66e-14 707-761 


1090 


IPB000135B 


High mobility group proteins HMG1 and 
HMG2 


13.24 5.68e-21 680-724 


1090 


PR00886A 


High mobility group (HMG1/HMG2) 
protein signature I 


10.57 4.46e-10 710-732 


1090 


PR00886C 


High mobility group (HMG1/HMG2) 
protein signature III 


11.20 6.28e-13 696-714 


1090 


PR00887A 


Structure-specific recognition protein 
signature I 


12.71 1.82e-21 343-359 


1090 


PR00887B 


Structure-specific recognition protein 


10.24 1.35e-20 365-381 
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signature II 




1090 


PR00887C 


Structure-specific recognition protein 
signature III 


13.32 1.47e-21 388-404 


1090 


PR00887D 


Structure-specific recognition protein 
signature IV 


16.42 2.73e-15 453-466 


1090 


PR00887E 


Structure-specific recognition protein 
signature V 


10.75 3.40e-20 480-498 


1 AAA 

1090 


PR00887F 


Structure-specific recognition protein 
signature VI 


14.12 6.46e-21 498-515 


1 AOA 


TJT) AAO 07r 

rKOUoo /Cj 


Structure-specific recognition protem 
signature vii 


13.65 7.00e-18 521-537 


1 ooo 


PPOOfi°.7W 


otructure-speciiic recognition protein 
signature vin 


12.27 o.53e-2U 537-555 


1001 


TPR00091 o" A 

XT -E3UUVJZ. 1 J>J^A. 


oerpms 


1J.UI 4. //e-i3 


1 001 


TPR00001 


oerpms 


y.o / y.ooe-uy 44y-4o i 


1091 


IPB000215C 


Serpins 


13,90 7. lle-1 1476-490 


iuy i 


TDD A A AO 1 CT\ 


Serpins 


15.35 3. 8Se-16 578-604 




TDDAAAO 1 CT? 


Serpins 


15.36 6.21C-16 659-683 


i ooo 
LVyZ 


IPB000126A 


Serine proteases, V8 family 


1 1.75 2.88e-10 357-372 


1 AQO 

iuyz 


IPB000126B 


Serine proteases, V8 family 


12.50 1.51e-ll 471-487 




PR00834D 


HtrA/DegQ protease family signature IV 


11,75 6.23e-09 466-483 


iuyz 


PR00839B 


V8 serine protease family signature II 


11.20 8. 12e-10 357-374 


iuy / 


IPB000454 


Eubacterial and plasma membrane ATP 
synthase subunit C 


27.64 2.50e-28 79-132 


1 0Q7 


PR00124A 


ATP synthase C subunit signature I 


O A /l /I^T— lO A/1 

o.oy 4.46e-13 75-94 


1 OQ7 


PR00124B 1 ATP synthase C subunit signature II 


1/1 /loo i /i „ 11 n^ - 111 

14.4/ 8. 14e-ll 96-111 


1 0Q7 


PR00124C 


ATP synthase C subunit signature III 


10 oo /r^ro^. n i n 100 

12.82 6.63e-17 113-138 


1 OQS 


IPB000573 


Aconitase C-terminal domain 


OC OA /I O ZT „ O J A/IO AAzT 

25.70 4.8oe-34 947-996 


1 OQS 


IPB001030A 


Aconitase (aconitate hydratase) 


OO CO 1 1 ^7 _ 1 0 /ICO f AA 

37.52 1.17e-13 452-500 


1 HQS 


IPB001030A 


Aconitase (aconitate hydratase) 


OO CO C A/T „ Ol OAO /I /I A 

3/. 52 5.0oe-23 392-440 


1 0Q£ 


IPB001030B 


Aconitase (aconitate hydratase) 


1 O OO /I Tin 1 O /ZOO 


1 005? 


IPB001030C 


Aconitase (aconitate hydratase) 


1 Q l/TQ/CC^ IOOIA OO A 


10QS 

1VJ.70 


PR00415A 


Aconitase family signature I 


1*2 OC O nn a 1 A tt^ 

lo.ZD /.UUe-lU 3zj-JJd 


1098 


PR00415C 


Aconitase family signature III 


14.58 2.75e-13 376-389 


1 OQ8. 


PR00415D 


Aconitase family signature IV 


1 /i co 0 m« 1 c onn Af\C 

14.53 2.93e-15 390-405 


1098 


PR00415E 


Aconitase family signature V 


9.37 1.53e-12 452-465 


1 A AO 


PR00415F 


Aconitase family signature VI 


11.36 6.14e-12 466-479 


1098 


PR00415G 


Aconitase family signature VII 


14.02 2.35e-14 548-562 


1 A AO 

1098 


PR00415H 


Aconitase family signature VIII 


13.87 4.79e-12 613-624 


1 AAO 

1098 


PR00415I 


Aconitase family signature IX 


14.82 8.43e-15 675-688 


1099 


PR01088H 


Na+/H+ exchanger isoform 6 (NHE6) 
signature VIII 


16.89 1.00e-08 92-109 


1100 


PR01088H 


Na+/H+ exchanger isoform 6 (NHE6) 
signature VIII 


16.89 1.00e-08 188-205 


1 1 A 1 

1101 


PR01088H 


Na+/H+ exchanger isoform 6 (NHE6) 
signature VIII 


16.89 1.00e-08 142-159 


110? 


PR00259D 


Transmembrane four family signature IV 


1/1 OH O /lie. AO /CC OI 

i4.yu z.43e-uy oD-yi 


1104 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 8.56e-10 124-161 


1104 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 9.65e-09 227-264 


1106 


IPB001983A 


Translation ally controlled tumor protein 


14.49 1.43e-25 1-27 


1106 


IPB001983B 


Translationally controlled tumor protein 


15.99 4.43e-20 45-70 


1106 


IPB001983C 


Translationally controlled tumor protein 


16.91 1.41e-24 83-109 
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1106 


IPB001983D 


Translationally controlled tumor protein 


22.98 4.09e-28 123-163 


1107 


IPB000117D 


Kappa casein 


10.18 8.7 le-09 513-547 


1108 


IPB001737A 


Ribosomal RNA adenine dimethylase 


21 A\ 1.86e-09 110-155 


1109 


IPB002048 


EF-hand family 


7.91 1.00e-09 42-54 


1110 


IPB000272 


ATP 1 G 1/PLM/M ATS family 


14.36 8.64e-23 99-134 


1110 


IPB003403A 


Herpesvirus immediate early protein 


21.25 6.59e-09 39-61 


1112 


PR00261B 


Low density lipoprotein (LDL) receptor 
signature II 


15.12 6.29e-09 127-148 


1112 


PR00261D 


Low density lipoprotein (LDL) receptor 
signature IV 


16.87 7.09e-09 127-148 


1112 


PR00261F 


Low density lipoprotein (LDL) receptor 
signature VI 


15.46 3.91e~09 127-148 


1112 


PR00764B 


Complement C9 signature II 


12.47 1.3oe-ll 122-142 


1115 


IPB000276A 


Rhodopsin-hke GPCR superfamily 


11.56 4.32e-10 216-227 


1115 


PR00245A 


Olfactory receptor signature I 


10.98 o.70e-10 190-201 


1115 


PR00245B 


Olfactory receptor signature II 


13.73 2.03e-09 


1115 


PR00245C 


Olfactory receptor signature III 


14.65 3.54e-l(J z/4-zyi) 


1116 


IPB000276A 


Rhodopsm-like GPCR superfamily 


11.56 o.3oe-09 5-lo 


1116 


IPB000276D 


Rhodopsin-like GPCR superfamily 


9.40 6.19e-09 355-371 


1116 


PR00245A 


Olfactory receptor signature I 


10.98 3.16e-ll 165-176 


1116 


PR00245B 


Olfactory receptor signature II 


13.73 1.77e-09 16-28 


1116 


PR00245B 


Olfactory receptor signature II 


13.73 9.07e-10 202-214 


1116 


PR00245C 


Olfactory receptor signature III 


14.65 2.96e-15 249-265 


1116 


PR00245D 


Olfactory receptor signature IV 


9.34 8.58e-12 309-318 


1116 


PR00245E 


Olfactory receptor signature V 


8.96 2.20e-15 356-367 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 1.92e- 10 300-315 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 1.92e-10 565-580 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 2.83e-ll 1213-1228 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 3.90e-09 143-158 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 4.77e-09 448-463 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 7.83e-ll 1336-1351 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 8.02e-10 1093-1108 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 8.92e-12 964-979 


1119 


IPB001928 


Endothelin and related toxins 


14.31 3.00e-09 949-972 


1119 


IPB002861B 


Reeler domain 


10.50 5.95e-l 1 970-998 


1119 


IPB002861C 


Reeler domain 


23.17 1.00e-12 957-1011 


1119 


IPB002861C 


Reeler domain 


23.17 3.45e-09 136-190 


1120 


IPB000079 


Nucleosomal binding domain found in 
HMG14 and HMG17 


8.09 1.64e-37 18-48 


1120 


PR00925A 


Nonhistone chromosomal protein HMG17 
family signature I 


5.67 2.29e-19 18-32 


1120 


PR00925B 


Nonhistone chromosomal protein HMG17 
family signature II 


3.75 3. 18e- 16 34-46 


1120 


PR00925C 


Nonhistone chromosomal protein HMG17 
family signature III 


5.23 7.45e-09 47-57 


1120 


PR00925D 


Nonhistone chromosomal protein HMG17 


7.38 2.00e-14 66-76 
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family signature IV 




1121 


PR00219C 


i-i j 1 • • , TTT 

Synaptobrevm signature III 


o.yi i.joe-uy oy-iuo 


1124 


IPB001304A 


C-type lectin domain 


•\n no o ooa 10 orvQ oti 
1 /.yo o.ooe-iy / 


1125 


IPB002896F 


Herpesvirus glycoprotein D 




1125 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 5.15e-09 323-360 


1126 


PR01007I 


Flagellar hook-length control protein 
signature IX 


o oa n £n fl rvn /Kr? /ton 


1131 


IPB000008D 


C2 domain 


14.83 3.05e-ll 1008-1026 


1131 


IPB000738A 


WHEP-TRS domain 


/r^7 o m„ AA 1AZTO 1 AQO 


1131 


PR00360A 


C2 domain signature I 


15. lo o.3oe-uy ooz-oy4 


1132 


IPB001135C 


NADH-ubiqumone oxidoreductase 49Kd 
chain 


25. 4o y. /ye-uy /z-lzz 


1133 


IPB000276A 


Rhodopsin-hke GPCR superfamily 


I 1 O OAo 10 101 1 QO 

I I ,jo Z.ZVG- Iz lol-iyz 


1133 


IPB000276D 


Rhodopsin-hke GPCR superfamily 


y.4U o.uye-iu 044-oou 


1133 


PR00237G 


Rhodopsin-hke GPCR superfamily 

' i t 7-TT 

signature VII 


iy.2J /.3 /e- 1U 5o4-JOU 


1133 


PR00245A 


Olfactory receptor signature I 


1 a no $? <:n*=» lfi 


1133 


PR00245B 


Olfactory receptor signature II 


u./j o.zue-10 lyz-zu^t 


1133 


PR00245C 


Olfactory receptor signature III 


I^-.OD lUc-lj Zjy-Zjj 


1133 


PR00245D 


Olfactory receptor signature IV 


A 04 7 19 9QQ 707 


1133 


PR00245E 


Olfactory receptor signature V 


S A 79>-19 7d^-7^f^ 


H34 


IPB000425A 


M1P tamiiy 


Q 79 7 77p_1 1 09-1 07 

y.J / /.JJO-ll 7Z~1U/ 


H34 


IPB000425B 


MIP iamily 


99 Qzt 1 7Sp 71 197-17^ 


H34 


IPB000425C 


MIP family 


1 O ^7 7 9R/=» 1 A 9zin_9^zt 


H34 


IPB000425D 


MIP family 


o in q £Jle> 1 707-79^ 


H34 


PR00783A 


Major intrinsic protein family signature I 


1 1 529 ^ ^n<=» 1 A RS-1 07 
11. OZ ^.DUc-l^r Oo-lU/ 


H34 


PR00783B 


Major intrinsic protein tamiiy signature ll 


7/1 1 £7f=»-1Q 197-1S1 


H34 


PR00783C 


Major intrinsic protein iamily signature ill 


17 79 9 Hf\f* 101 AA- 1 S27 

Ij.jZ z. /oe-iu lOH-ioj 


H34 


PR00783E 


Major intrinsic protein family signature V 


1^9^ R lip OQ 198^1^0 


H34 


PR00783E 


Major intrinsic protein iamily signature v 


1^9^ S 17^-11 99^-94.5? 
ID.Zj O. 1 JC"1 1 zzu-zto 


H34 


PR00783F 


Major intrinsic protein family signature VI 


12.30 1.53e-14 308-328 


H36 


IPB002897A 


Monocarboxylate transporter 


K 1 9 A QAo 97 152 zl7 

lo. iz 4.yue-zj 10-4/ 


H36 


IPB002897B 


Monocarboxylate transporter 


24.20 4.30e-32 72-126 


1136 


IPB002897C 


Monocarboxylate transporter 


13. Oj 4.4oe-iJ 143-10S 


H36 


IPB002897C 


Monocarboxylate transporter 


13.63 7.38e-12 177-202 


1136 


IPB002897D 


Monocarboxylate transporter 


■*)A A A r Of „ on OA/C O /l o 

30.44 5.85e-27 296-345 ; 


H36 


IPB002897E 


Monocarboxylate transporter 


32.06 9.59e-24 407-458 


H37 


IPB000454 


Eubacterial and plasma membrane ATP 
synthase subunit C 


27.64 9.85e-ll 94-147 


H37 


PR00122A 


Vacuolar ATP synthase lokDa subunit 
signature I 


10.78 6.00e~18 30-54 


H37 


PR00122B 


Vacuolar ATP synthase 16kDa subunit 
signature II 


10.11 4.32e-26 56-80 


H37 


PR00122C 


^— — mi r^r^ ¥ ^ J 1 1 ^^1 ^F^^W 1 * 1 

Vacuolar ATP synthase lokDa subunit 
signature III 


10.28 9.40e-32 104-130 


1137 


PRUUl zzD 


Vacuolar Air synxnase i oicua suounit 
signature IV 


1 0 07 7 00p-9R 1 ^1 -1 S4 

1VJ.U0 /.WC"ZO IJl 


1137 


PR00124C 


ATP synthase C subunit signature III 


12.82 7.58e-ll 128-153 


1138 


IPB002220A 


Dihydrodipicolinate synthetase 


16.79 9.29e-09 41-53 


1138 


IPB002220B 


Dihydrodipicolinate synthetase 


14.56 6.84e-14 68-89 


1138 


IPB002220C 


Dihydrodipicolinate synthetase 


25.64 1.23e-24 108-155 


1138 


IPB002220D 


Dihydrodipicolinate synthetase 


10.31 6.56e-10 164-175 


1138 


IPB002220F 


Dihydrodipicolinate synthetase 


12.81 6. 14e- 10 234-247 
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1138 


PR00146D 


Dihydrodipicohnate synthase signature IV 


1 A QQ Q '7Qa i 111 ^"3 1 CH 


1139 


IPB001734A 


Sodiumrsolute symporter family 


14.69 5.76e~16 51-67 


1139 


IPB001734B 


Sodiumrsolute symporter family 


10 O/C £L £11* T7 1 £L£L 1f\£ 

18,26 o.oie-2/ loo-zUo 


1139 


IPB001734C 


Sodiumisolute symporter family 


lo.21 7.75e-23 yz4-yjj 


1140 


IPB000074C 


Apolipoprotein A1/A4/E 


oi i ro„ An TOf AT} 

22.23 1.5oe-U9 jod-4zZ 


1140 


IPB002607B 


Hydratase/decarboxylase 


29. DU 3.o4e-iU Z4z-zyu 


1140 


PR00122B 


Vacuolar ATP synthase 16kDa subunit 
signature II 


10.11 7.56e-09 493-517 


1142 


IPB000897A 


GTP-binding signal recognition particle 
f SRP54} domain 


9.15 9.87e-09 611-630 


1143 


IPB000219A 


Dbl domain (dbl/cdc24 rhoGEF family) 


10.71 3.00e-09 156-165 


1143 


IPB000276C 


Rhodopsin-like GPCR superfamily 


8.03 1.41e-10 278-289 


1143 


IPB000276D 


Rhodopsin-like GPCR superfamily 


9.40 2.17e-10 324-340 


1143 


PR00237A 


Rhodopsin-like GPCR superfamily 
signature I 


9.81 8.92e-ll 63-87 


1143 


PR00237B 


Rhodopsin-like GPCR superfamily 
signature II 


12.45 4.08e-09 96-117 


1143 


PR00237C 


Rhodopsin-like GPCR superfamily 
signature III 


14.77 5.50e-12 140-162 


1143 


PR00237E 


Rhodopsin-like GPCR superfamily 
signature V 


13.03 8.46e-09 226-249 


1143 


PR00237F 


Rhodopsin-like GPCR superfamily 
signature VI 


14.34 9.33e-13 268-292 


1143 


PR00237G 


Rhodopsin-like GPCR superfamily 
signature VII 


19.23 6.04e-15 314-340 


1143 


PR01157A 


P2 purinoceptor signature I 


9.05 3.52e-ll 110-122 


1144 


PR00049D 


Wilm's tumour protein signature IV 


0.00 2.07e-09 7-21 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 1.1 0e- 11 266-290 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 1.92e-ll 265-289 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2. 13 2. 23 e- 11 270-294 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 3.00e-09 262-286 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 3.25e-10 271-295 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 4.55e-09 272-296 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 5.09e-09 260-284 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 7.07e-10 267-291 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 7.09e-09 259-283 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMCr2 


2.13 7.40e-12 268-292 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 8.63e-10 263-287 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 8.98e-l 1 269-293 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 9.24e-12 264-288 


1145 


IPB000637B 


HMG-I and HMG-Y DNA-binding domain 


14.21 8.64e-09 272-290 



WO 03/054152 PCT/US02/39555 



Table 3B 
387 



SEQ 

ID 

NO: 


Database 
entry ID 


Description 


Results* 






(A+T-hook) 




1145 


IPB001202B 


WW / rsp5 / WWP domain 




1145 


IPB001422C 


Neuromodulin (GAP-43) 


16.82 8.65e-09 258-293 


1145 


IPB001580F 


Calreticuhn family 


2.93 8.65e-09 277-286 


1145 


PR00403A 


WW domain signature I 


19.41 5.03e-10 373-3oo 


1145 


PR00403B 


WW domain signature II 


13.50 5.33e-15 387-401 


1148 


PR00360B 


C2 domain signature II 


11.64 1.29e-ll 279-292 


1150 


IPB000685G 


Ribulose bisphosphate carboxylase, large 
chain 


10.38 5.23e-09 45-99 


1150 


PRO 1237 A 


Tumour necrosis factor c (lympnotoxm- 
beta) signature I 


<C O A O T7« AA 10 /CI 


1152 


IPB000276A 


Rhodopsin-hke GPCR superramuy 


1 1 T ^lo 1A OAQ 

11. Do /.oje-iu zjo-z^y 


1152 


PR00245A 


Olfactory receptor signature I 


in no i in a in OT) OO'Z 


1152 


PR00245B 


AIT j . • „ . . TT 

Olfactory receptor signature 11 


lo. D.Due-ii z^+y-zoi 


1152 


PR00245C 


Olfactory receptor signature 111 


l4.oD o.o/e-iD zyo-jiz 


1152 


PR00245E 


j. x „• i . XT- 
Olfactory receptor signature V 


o.yo z.U4e-i j ^-Uj-^-i'i- 


1152 


PR00534A 


Melanocortin receptor family signature I 


1 O '7'7 *7 <0» AO 1 *71 1 CI 


1153 


IPB001680 


G-protem beta WD-40 repeats 


1A /II 1 AAo 11 A^/^ zl^T 

1U.43 l.UUe-11 400-40/ 


1153 


PR00320B 


G protein beta WD-40 repeat signature II 


in *7 <^e> 11 AKA 
IZ.oZ /.DDe-11 434-405 


1153 


PR00320C 


G protein beta WD-40 repeat signature lil 


IZ.oZ 4.4oe-ll 4D4-4-00 


1153 


PR00962A 


Lethal(2) giant larvae protein signature l 


iz.yi j.j / e-zz i /-oj 


1153 


PR00962B 


Lethal(2) giant larvae protein signature 11 


io ai q co p oq ^in 


1153 


PR00962C 


Lethal(2) giant larvae protein signature III 


o.U3 D.4Ue-Zo JOZ-jftZ 


1153 


PR00962D 


Lethal(2) giant larvae protein signature IV 


Q 01 A 1 9Q A<1 A7A. 

y.yi 4.ioe-zo 4D 1-4/4 


1153 


PR00962E 


Lethal(2) giant larvae protein signature V 


0.0 1 j.o/e-zi Dji-04y 


1153 


PR00962F 


Lethal(2) giant larvae protein signature VI 


13.40 8.50e-24 568-587 


1153 


PR00962G 


Lethal(2) giant larvae protein signature VII 


1 a c 1 C C A * OA /CAA /CQ Q 

14.51 6.54e-z9 ovJy-o33 


1153 


PR00962H 


Lethal(2) giant larvae protein signature VIII 


13.53 3.5ie-23 o3y-0Do 


1153 


PR00962I 


Lethal(2) giant larvae protein signature IX 


12.48 5.13e-23 708-727 


1155 


IPB002027A 


Amino acid permease 


18.88 8.82e-ll 185-213 


1155 


IPB002027C 


Amino acid permease 


19.67 8.31e-21 385-423 


1155, 


IPB002027D 


Amino acid permease 


22.00 6.79e-21 466-505 


1157 


PR01436C 


NADH-ubiquinone oxidoreductase chain 2 
signature III 


9.53 7.75e-09 82-100 


1158 


IPB001428B 


dUTPase 


18.14 1.00e-14 107-125 


1158 


IPB001969A 


Eukaryotic and viral aspartic protease active 
site 


16.37 5.71e-09 195-211 


1158 


PR00783C 


Major intrinsic protein family signature III 


13.32 5.26e-17 31-5U 


1160 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


'-t nr f-i /I O „ 1A1^7^A 1T£TA 

7.0D y.4ze-lU l/z>U-I/oU 


1160 


IPB000034A 


Laminin B 


OO Ol 1 1Q /1Q7 ^01 
ZZ.Z1 1. cSDc-iy 4o /-DZZ 


H60 


IPB000034A 


t * j„ n 

Lammin B 


T) Ol O QQp 1A IQ^^ 1 ^Q1 

zz.zi z.yye-iu UDO-ioyi 


H60 


IPB000034A 


Lammin B 


11 1 QAo 1*7 C7^ 0 1 A 

Zz.zi o.o4e-w o/j-yiu 


H60 


IPB000034A 


Lammin B 


zz.zi o.ije-iu 1/00-1 /oo 


H60 


IPB000034A 


Laminin B 


Ol Q AQ 1 O^l T)QQ 

zz.zi o\ooe-uy izoo-izycs 


H60 


IPB000034A 


Lammin B 


oi oi o //io aq cfio 
zz.zi y.44e-uy djo-doo 


1 1 /Zf\ 

1 loU 


T "DTD f\ A fi A 1 A "D 


i^aminin ±5 


91 ?? 1 2^e-14 654-692 


1160 


IPB000034B 


Laminin B 


21.22 3.25e-31 1426-1464 


1160 


IPB000034B 


Laminin B 


21.22 3.33e-12 1287-1325 


1160 


IPB000034B 


Laminin B 


21.22 4.77e-10 1757-1795 


1160 


IPB000034B 


Laminin B 


21.22 4.99e-18 601-639 


1160 


1PB000034B 


Laminin B 


21.22 5.45e-09 511-549 


1160 


IPB000034C 


Laminin B 


12.97 3.09e-ll 508-526 


1160 


IPB000034C 


Laminin B 


12.97 3.45e-09 319-337 
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1 1 OU 


tt>tiaaaa'3 a 
1.tJdUUUU.j4L> 


JLamimn t> 


12.97 4.00e-l 1 443-461 


1 1 £n 

1 IOU 


AAAAQ A f" 1 

lr\fc>UUUUo4L. 


i^aminin t> 


12.97 4.05e-12 1377-1395 


1 IOU 


lir I5UUUU.34L/ 


Laminm t> 


12.9/ o.D2e-13 554-572 


1 1 £f> 
1 1 ou 


tptiaaaaq /ir 1 

lr\t>UUUUJ4L/ 


j_»amin.m ts 


1 O 0*7 iC C/C- OA 1 '-7C /I 1 TM 

12.9/ o.5oe-20 1754-1772 




TPRnnnn^zir^ 
I.t.dUUUU.341^ 


i^amimn xi 


12.9/ o. /3e-u9 1284-1302 




lI^.tSUUUU.341^ 


Laminin o 


1 O m O A1 „ AA f AC /-1/T 

12.97 o.01e-U9 59o-olo 


1 1 AH 
1 IOU 


lJrJt>UUUU.o4L/ 


J^amimn rJ 


1 O AT A AA„ 1 1 /"f 1 /-/"n 

12.97 9.09e-ll 651-069 


1 1 An 

1 IOU 


lJr£>UUUUj4Lx 


Lramimn r> 


12.97 9.31e-lo lo53-lo71 


1 1 An 
1 loU 


tdt>aaa<£ 1 
IJrJtJUUUjol 


xiOr-like domain 


4.89 5.50e-09 1754-1762 


1 1 Aid 

I lou 


TDDAH 1 AA1 T7\ 

II'oUUIUUIJJ 


DNA polymerase III, beta chain 


1 yl O £T O O f\ t~\C\ TITO ^OA^ - 

14.35 8.29e-09 2372-2395 


1 1 /^n 
1 IOU 


IroUU I lOyJv 


Integrin beta, C-terminus 


27.45 8.93e-ll 1691-1733 


1 1 An 
1 IOU 


Tt>D AA1 T71 


Mammalian defensin 


19.97 5.91e-09 1728-1756 


1 1 An 
1 lou 


IrrJUUl / /4C 


Delta serrate ligand 


18.25 8.16e-09 545-587 


1 1 An 

1 IOU 


AA1 QQ^ A 

IrUUulooOA 


Laminin N-terminal (Domain VI) 


8.51 8.27e-09 124-131 


1 1 ^a 
1 IOU 


lrr>UUIooo±> 


Laminin N-terminal (Domain VI) 


15.20 5.11e-21 286-308 


1 1 An 

I IOU 


IroUUIooOl^ 


Lammin JN -terminal (Domain VI) 


A CA 1 AA— AA nOyl 1 "> O O 

24.54 1.00e-09 1284-1323 


1 1 An 
1 1 ou 


11^ ±>UU 1 o oOL/ 


Laminin in -terminal (Domain Vlj 


1 -1 ^ y« 1 AA— 11 1 TA1 1 n ilo 

24.54 1.00e-ll 1703-1742 


1 1 An 

1 IOU 


IroUUlooOLv 


Laminin N -terminal (Domain Vlj 


O/l f/1 1 AO. 1A CAD C A 1 "! 

24.54 1.23e-10 508-547 


1 IOU 


IrJoUUloooC 


Laminin N-terminal (Domain VI) 


O H ^ /I 1 ^ o i/i 1 nr ,^ 1 piao 

24.54 l.53e-14 1754-1793 


1 1 ^CA 

l lou 


TDT3AA1 PO^r 1 

IrBUUloooC 


Laminin N-terminal (Domain VI) 


24.54 2.97e-33 319-358 


1 1 ^A 
1 IOU 


TDDAA 1 OO/p 

lrBUOloooC 


Laminin N-terminal (Domain VI) 


24.54 3.28e-l3 391-430 


1 1 £T A 

1 IOU 


IrrJUUIoooC 


Laminin N-terminal (Domain VI) 


i-\ A C A A 1A "\ f\ C C A C AO 

24.54 4.l9e-l0 554-593 


1 1 An 
I IOU 


IrrSUUloouL, 


Laminin N-terminal (Domain VI) 


24.54 5.67e-10 1423-1462 


1 1 An 
I IOU 


TOT5AA1 QO<P 

LroUUl oooL 


Laminin N-terminal (Domain VI) 


A IT A iC O O 1 1 .CAD /"O ^7 

24.54 6.33e-ll 598-637 


i 1 f^n 
1 IOU 


IrVoUU loooL, 


Laminin N-terminal (Domain VI) 


O/l C.I T /I A. 1A 1 nn 1 j i /• 

24.54 7.49e-10 1377-1416 


1 1 An 
1 IOU 


TDT3AA1 QQ<CP 

IroUUlooOLy 


Laminin N-terminal (Domain VI) 


A A CA O 1 1 11 C 1 /"AA 

24.54 8.11e-ll 651-690 


1 1 An 

I IOU 


T13T3AA1 Q°£r* 
1I*X5UU 1 OOOJJ 


i^aminm iN-terminal (Domain VI) 


lO C1 1 11 , AA vlOA A C "1 

12.51 1. 7 le-09 439-457 


1 1 An 

1 IOU 


Lrisuu looOU 


Laminin r>-terminai (Domain Vl) 


12.51 3.65e-17 io99-l717 


1 1 An 

1 IOU 


IroUUlooOU 


.Laminin JN-terminai (Domain Vlj 


10 C1 O AC„ 1A 1 7CA 1 T/TO 

12.51 o.95e-10 1750-1768 


1 1 An 

I IOU 


litSUU 1 ooOU 


Laminm in -terminal (Domain VI) 


1 A C1 A Af „ OA O O n /| Af 

12.51 9.u5e-22 387-405 


1 1 An 

1 IOU 


Lir r>UUlooOJi 


Laminm JN -terminal (Domain VI) 


1AAA 1 11 - lO 1AOA 1AAO 

10.90 1.21e-13 1282-1298 


i i <^n 
1 IOU 


IroUUloool} 


Laminin N-terminal (Domain VI) 


1AAA 1 ,n. AAOI'? OOO 

10.90 1.47e-09 317-333 


1 1 An 

I IOU 


IrriUUl ooorL 


Laminin N-terminal (Domain VI) 


1AAA A O '1 _ 1^T 10TC I^AI 

10.90 4.82e-16 1375-1391 


1 1 An 

1 IOU 


TT>T3AA1 QQ/Ct? 

II\t>UUIooo.c 


Laminm JN -terminal (Domain VI) 


1A AA C C\^7 ^ AA £^A 

10.90 5.97e-09 649-665 


1 1 ^n 
1 IOU 


lrr>UUl<S(5oJi 


Lammin IN -terminal (Domain VI) 


1 A AA /C — 7 /I — AA A A ~\ A CI 

10.90 o.74e-09 441-457 


1 1 ^n 
1 IOU 


T"D1DAA1 0 Q CJJ 


Laminin N-terminal (Domain VI) 


1A AA —7 /-f O _ 11 1 T d 1 

10.90 7.43e-13 1752-1768 


1 1 ^n 

I IOU 


IrrlUUloooli 


Lammin JN -terminal (Domain VI) 


1AAAn^T1~ 11 C C\tC y"1A 

10.90 7.61e-ll 596-612 


1 1 *^A 

1 IOU 


lroUulooori 


Lammin JN -terminal (Domain VI) 


1 /-» r\A O TC— 1 1 CA/' £"T1 

10.90 8.35e-ll 506-522 


1 1 ^n 
1 IOU 


IrrJUUIooOJi 


Laminin N-terminal (Domain VI) 


1 A AA A C A _ OA CCA C S~ Ct 

10.90 9.50e-20 552-568 


1 1 An 
I IOU 


lrr>UlL5oo4C 


Factor I membrane attack complex 


11 AA O OO— AA 1/tAI 1 /</o-i 

13.00 3.22e-09 1403-1421 


1 1 An 

L IOU 


nn AAA 1 1 A 


1 )pe 111 LUx'-liKe signature 1 


1/1 AC 1 /TO„ 1/1 1 /I 1 /■ -|/1i, 1 

14.05 1.68e-14 1416-1434 


1 1 fzn 
1 IOU 


O LJ AAA1 1 A 

rKUUOl 1A 


1 ype 111 bur-like signature I 


1 /! AC 1 ZTA 1 1 /I /i /•- /-/-i 

14.05 1.69e-ll 644-662 


1 1 £A 

I loll 


T1D AAA 1 1 A 

FROUUl 1A 


1 ype 111 litjl'-liKe signature I 


1 /I AC 1 OO 1 C 1 Ann *1 <^j-v^" 

14.05 3.88e-15 1277-1295 


1 160 


TinAAAl 1 A 

PR00011A 


nn ttt T7 /~i T7 i 'i • i t 

Type III EGF-like signature I 


14.05 3.93e-16 1370-1388 


1 i <ca 

1 160 


PR00011A 


Type III EGF-like signature I 


14.05 3.95e-13 436-454 


1 1 An 


pDAAf|1 1 A 


' i '\rnd TTT Th (~i T?_ 1 1 1/ - ** ciminttiro T 

i ype in E/V>jr-iiK.e signature i 


14. ud D.uue-uy jy i-ouy 


1160 


PR00011A 


Type III EGF-like signature I 


14.05 5.05e-12 1318-1336 


1160 


PR00011A 


Type III EGF-like signature I 


14.05 5.43e-ll 695-713 


1160 


PR00011A 


Type III EGF-like signature I 


14.05 6.82e-10 501-519 


1160 


PR00011A 


Type III EGF-like signature I 


14.05 8.83e-16 547-565 


1160 


PR00011B 


Type III EGF-like signature II 


13.08 1.00e-09 591-609 


1160 ' 


PR00011B 


Type III EGF-like signature II 


13.08 1.66e-10 695-713 


1160 


PR00011B 


Type III EGF-like signature II 


13.08 4.18e-ll 644-662 
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1 X \}\J 


PR0001 1 R 


TTmo TTT "DOT? Mire* rmntm-n TT 

lype in jDiar-iiKe signature 11 


13. Oo 4.35e-15 1370-1388 


1 1 &C\ 

x ±UV/ 


pponm i ti 

x^xxAJUU 1 1 15 


lype in xiLrr-iiKe signature il 


13.08 5.57e-12 436-454 


1 1 60 
1 1 uu 


PT? 00.0,1 1 "R 
Jr JtvUUU 1 1 15 


lype in HLjr-iiKe signature 11 


13,08 5.71e-17 547-565 


1 160 

X lOu 


PR 0001 1 R 
J: JxUUUl 115 


lype in nor-UKe signature 11 


13.08 6.4oe-12 1416-1434 


1 1 60 

1 JLUU 


PR OOOI 1 R 
XrSWJUKJ 1 1X5 


iype in Jtsor-iiKe signature ll 


13.08 7.52e-13 1277-1295 


1 160 


PR 0001 1 R 

rlvUUU 1 1x5 


lype in liijrr-UKe signature 11 


1 <) AO O -1 O 11 10 10 -1 0 y^" 

13.08 8.48e-ll 1318-1336 


1 160 


pr 0001 1 r 1 


iype in liUrr-UKe signature 111 


25.79 2.62e-09 051-679 


1 160 

X L\J\J 


PR0OO1 1 P 


iype ill Jtiur-iiKe signature ill 


25,79 o.53e-10 1423-1451 


1 160 


proooi 1 r> 

.TxvUUU 1 1 U 


T, m a TTT RPR ll'lria, cn'rmn+iirQ T\/ 

iype m iiijrr-iiKe signature iv 


IO 1 *^ 1 1 0« 1/1 lOOA 1000 

12.12 1.18e-14 1370-1388 


1 160 

X X KJ\J 


proooi i n 


i ype in nvjr-iiKe signature i v 


12. 12 3.63e-10 695-713 


1 160 

X X \J\J 


PROOO1 1 D 


i ype in c/Or-iiKe signature i v 


10 n /) 1 /i cat fin 
12.12 4.9oe-14 501-519 


1160 


PROOOI ID 


Type III EGF-like signature IV 


12.12 6.01e-09 591-609 


1 160 


proooi 1 


iype in nor-iiKe signature iv 


IO 11 H ~ 1A £AA Z~£~<*i 

12.12 7.65e-10 644-662 


1 1 60 


proooi i n 


iype ill nCrr-iiKe signature IV 


12. 12 8.25e-17 547-565 


1160 


PROOOI ID 


Type III EGF-like signature IV 


12.12 8.33e-13 436-454 


1 1 £fi 

1 10U 


Tin nnn 1 1 tv 
r^KUUUl 1JJ 


1 ype 111 JaGF-hke signature JV 


12.12 8.92e-14 1416-1434 


1160 


PROOOI ID 


Type III EGF-like signature IV 


12.12 9.17e-13 1277-1295 


1 1 /CA 

1 160 


i~>T) A A A 1 m 

PROOOI ID 


HP- — — TTT T™ 1 TT 1*1 * j tt 7" 

Type III EGF-like signature IV 


12.12 9.23e-09 1747-1765 


1 loU 


PR00877D 


Plant PEC family metallothionein signature 

IV 


6.45 1.89e-09 1308-1315 


1 1 £1 


TPnooooo./i/*" 1 


L/aminm Jb> 


12.97 6.42e-10 205-223 


11^1 
1101 


IroUUUlDZ 


Aspartic acid and asparagine hydroxylation 
site 


8.86 1.00e-10 158-173 


1161 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 9.29e-09 196-211 


1161 


IPB000359A 


Cystine-knot domain 


23.24 4.86e-10 129-153 


1 1 1 

1 iol 


IrBUUUSOl 


|-J /—1T7 1'f 1 _ _ ■ 

EGF-like domain 


4.89 9.25e~10 46-54 


1 1 /C 1 

1 iol 


T"r»T~> a A 1 \ 

lrBOOI ml A. 


Dismtegnn 


23.93 9.65e-09 95-135 


i iri 

1 loi 


IrrJUUl / /4C 


Delta serrate ligand 


18.25 1.71e-31 37-79 


1101 


TT>"r> A A 1 T7 A /""i 
1P15UU1774C 


Delta serrate ligand 


18.25 9.17e-12 68-110 


1101 


TDD 1 > 

IrBUUloOZr 


Membrane attack complex 
components/pertorin/complement C9 


29.39 2.65e-09 170-217 


1 1 O 1 


TPROH1 060T7 

IJroUU 1 oo zr 


Membrane attack complex 
coiriponenTs/periorii 1 / complement 


OA OA O O r\r\ "t O 

29.39 8.53e-09 132-179 


1 1 61 


TPR001 


Lamimn in -terminal ^uomain vi^ 


1A OA A C~l , » in /I /I /CA 

10.90 y.o/e-10 44-60 


1161 


PR 0O01 OA 


iype ii izAjT-iiKe signature i 


A1 1 T7« AA 1 /I O 1 C A 

12.91 l,27e-0y 143-154 


1161 i 
1 1 0 1 


PROOOI OP 


iype ii HLjr-iiKe signature ill 


AO O C /i „ AA 1 O 1 rn 

6.98 2.54e-09 163-173 


1161 

1 1 O x 


RROO01 1 A 


i ype in iivjr-iiKe signature 1 


1 A AC /T OO™ A A OA CT 

14.05 o.8oe-09 39-57 


1161 


PR 0001 1 R 
x ivUUU 1 115 


13436 111 nor-iiKe signature 11 


IO AO T OO- 1A "3 A C~7 

13. Oo /.ooe-lU 39-57 


1161 


PROOOI ID 


Type III EGF-like signature IV 


12.12 4.57e-12 39-57 


1161 
1 1 0 1 




Proline rich extensin signature VII 


4.02 7.79e-09 221-246 


1162 


IPB000471A 


Interferon alpha, beta and delta family 


27.36 2.86e-34 45-98 


1 1 cn 

1 loZ 


PR00266A 


Interferon alpha and beta subumt signature I 


13.41 9.59e-14 67-79 


1 1 CI 
1103 


1PB000975D 


Interleukin-1 


24.45 5.55e-09 59-98 j 


1 1 a 

1 lo3 


1PJB000975E 


Interleukin-1 


28.12 9.80e-09 103-142 


1 1 /CO. 

1103 


t>t> r\r\**t/? a a 

PR00264A 


Interleukin-l precursor family signature I 


18.63 1.00e-08 62-82 


1163 


PR 01 360P 


iniencuKin-i icLcptor dntdgonist precursor 
IL-1RA signature III 


IO. 11 yl O/Ic 1 1 CC OO 


1163 


PR01360F 


Interleukin-1 receptor antagonist precursor 
IL-1RA signature VI 


14.44 3.11e-12 124-142 


1164 


IPB000072 


Platelet-derived growth factor (PDGF) 


13.51 4.00e-26 75-97 


1164 


IPB003498F 


Probable DNA packing protein, C-terminus 


23.31 7.55e-09 154-193 


1165 


IPB000975E 


Interleukin-1 


28.12 3.57e-12 95-134 


1165 


PR00264A 


Interleukin-1 precursor family signature I 


18.63 9.85e-09 55-75 
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IxcSUlLa 


1165 


rRU02o4C 


lnte^cUKlll- , 1 precursor iaimiy signature ah 


1Q ^7 4 90e-16 108-123 


1165 


PR01357F 


Interleukin-1 alpha/beta precursor family 
signature vi 


17.87 7.15e-10 108-123 


1165 


PR01360E 


Interleukin- 1 receptor antagonist precursor 
1L-1KA signature v 


9.69 9.33e-13 95-115 


1165 


PR0l360r 


InterleuKin- 1 receptor antagonisi precursor ( 
iL- 1 ka signature v i 


14 4.4, Q S6p-1 R 1 1 6-1 ^4 


1166 


lPBUU027oG 


Kiioaopsm-iiKe KjrK^K. supenanmy 


8 fn 6 50e-l 1 3^-44 

O.v/J U ( JwW^l X — » J "l I 


1166 


PR00237r 


KH0Q0psin-nKe ur^K. superiairiiLy 
signature vi 


14 S4 5 05e-16 23-47 


lloo 


rKUUZo /Cjt 


ivnoQopsin-iiK.e utjt ^-jcv bupciitiiiiiiy 

ci omdhiTP "V/Tl 
olglldtUIw Y XX 


19 23 8 S0e-13 70-96 


1 1 66 

L 100 


PI? fin407P 


Tr» tprl f^i i l"*i n — R rfofntrir Qioiifitiirp \^ 
111LC1 lCUlVlJI ~0 ICCC|Jt\Jl olglldLUiW V 


7 33 3 84e-17 6-21 


1 1 66 
1 100 


PPHH/197P 
jrivUU^tZ /r 


Tnti=r1f»i'itriTi-S vf*np>r\tr\r <3icrnn+nrp "VT 
XlltCllCUJvlIl O lCL<CpiL;l dl^LlctLUl Ci YX 


12 53 2 63e-12 48-64 


1 1 66 
1 lOO 


PP 0.049 70 


Tntpflpnl^in-S rpppntnr <iipnature VTI 


10 47 8.53e-17 98-114 


1 1 66 
1 1DO 


pp nosv^D 


TntprlpuVin J?T5 rpppntnr «;ienatiire TV 


15.06 1.85e-18 114-129 


1 ID / 


PP 00414A 


Palmitnvl nrntpin fhinpstera^e sienatureT 


10.61 3.44e-31 26-48 


1 1 67 
1 J. O / 


PRfi041 4R 


polrnifnvl nrntpin tliinp<;f"pra«;ft .siPnature TI 

I ill 1 1 1 i L\_f_y 1 JJlULL/lll UllUvulvlaov ulgllULUiu j.x 


11 87 6.79e-25 66-85 


1 1 67 
110/ 


PPfifi41 4r* 


Pn1m-fto\/1 nrotpin tliinpQtprncjS SlPTiature TT^ 


13.39 1.90e-23 106-122 


1 1 67 
i 10/ 


PPHHA1 ATI 


polrmtrwl nrotpin tfiinp^sfpra^e sipnature TV 

± dlllll LU_y I yJl.\Jl\HlL UllUvuiwi claw oxguuiui v a t 


17.22 2.13e-27 151-170 


1 1 67 


ppnn/11 4P 


pQlmitovl nmtpin tTiinpsstpra^P ^ifm^tlire V 

XdllllllAJYl |/lvlVlll llll\Jt'Otwl aOv OlgilfcH-lAJL W V 


17 10 1.43e-27 185-204 


1167 


PR00414G 


Palmitoyl protein thioesterase signature VII 


12.98 2.36e-24 258-276 


1 lOO 


TPisnnno i ^ a 


oerpins 


13 01 4 18e-20 73-96 


I 1 6Q 

II DO 


Lr dUUUZ 1 JD 


oerpins 


9 87 9 68e-09 181-193 


1 1 /CO 

1 loo 


too Anno 1 ^p* 


oerpins 


13 90 7 lle-11 208-222 


1 1 

1 loo 


11^L>UUUZ 1 jIJ 


oerpins 


15 35 3 88e-16 310-336 


1 1 f^Q 

1 lOo 


LrD\JK)\J£ 1 Jli 


O ami n c 

oerpiiib 


15 36 1 56e-18 391-415 


1 1 on 
1 1 Ju 


TPT^nm 771 

UtdUUIZ/ 1 


iVldnillldllcUl UCldlalll 


19 97 3 42e-21 103-131 


I 1 TO 

I I /Z 


TPnnnnoco 


i yrus>uic opcc-iiic jjivjiciii l?iivojjii<h.«oc/ &ii^± 

dual specificity protein phosphatase family 


10 77 5 85e-13 439-449 


1 1 TO 
11/2 


ppnn7nn a 


JrrOLClIl LyiOolllC pilUopilctlaoC olglla.LU.lW X 


6 05 1 00e-10 295-302 


1 1 TO 
1 1 /2 


pp nnonnR 


Di<Afa!n +m*/^ci'M*a TxTirtCTxTici'f'ciopi cionotnr^ TT 

jrrotein lyruaiiic piiuopiictLctoc big,iiaiuic h 


17 06 5 70e-?l 313-333 


1 1 TO 

11/2 


t>"d nmnAP 
JrKUU /Uuv> 


JrrOtem LyrOSlIie pilOapXlalaac BlgllclLUlC 11X 


13 89 9 05e-16 399-416 


1 1 70 

11/2 


t)d riA7nnn 


Drrvf «! n tin-ncinp nlincnnu+acp citmainrf* T\/ 
JrrOLClIl LyiOolllC piUJopildLctoC ol gild 1- HI C XV 


12 83 7 84c- 19 436-454 


1 1 71 

11/2 


DDAA7finT7 

rKUU/UUxi 


xrotein tyrosine pnuopiiaiaoc oigiidLuic v 


16 60 1 93e-10 467-482 


1 1 OO 

11/2 


jrKUU /UUr 


r^roieiii tyi usiiie pinjopiiaidoc oi^udLLiic vi 


10 33 1 00e-ll 483-493 


1 1 TO 

1172 


nn A1 071TJ 

rKUio / Iri 


C3 nlm j-vt*i«=»11 O /V/3»*e1T11Cl tVK^/^l licit* t\rt*AGinP 

ijamioneiid/ x ersiiiici iiiv_>u.uidi lyruoinc 

■nhrtCiVhnl'QCip cionfltiirp V 
piUJopiididot; oigiidLuiv v 


11 71 2 11 e-09 437-448 


1187 


IPB000008C 


C2 domain 


23.37 2.44e-09 61-100 


1 1 QO 
1 lo / 


lr'tJUUZOH-ZA. 


T tremr»ViAcnVir»1inOCP f'Q+UIX/t'lf flfMTIQITI 
l^ySOpilOSpilOlipcloC L/dLdiyLlO tlUllldlll 


18 37 1 69e-13 355-380 


1 1 Q7 

1 lo / 


l.roUUZO'+ZES 


XjyoUpilUopiH-'lipdot/ L«.icn_y liu uwiiidiii 


11 84 4 38e-15 383-407 


1 1 CO 

1 lo / 


Lrr>UUZo4ZiS 


T x/cirtt-\Vn-\cnVin1iT^Q cp P£i1"n1"\/Kr« /1r\tnnin 
JUySOpnObpiUJlipdoc udUdiytiL/ viuiiidiu 


18 19 6 91 e-10 509-534 


1 1 QO 

1 lo / 


TPonno/C/i 0/^ 
l^rJUUZc^Zlj 


T i «-i/-xT-»l-i/^c'-r\lirx1i-»-*c> op oo+cil x.H"ir» rl/"\m fain 

L/ysopnospnoiipd.bc L/dLdiyiiu uouidiii 


34 1 1 6 72e-10 678-726 


1 lo / 


FKU(J3oUr> 


L/2 aomain signature 11 


1 1 64 8 67e-12 88-101 

X X * V-l" O - \J f \m* J- ^-J VJ X v/ X 


I 1 QQ 

I I 00 


pp nnc/^nni 


VertCDldLC lllCLdllvlllUJllt/lJl olglldLLllC/ XX 


6 90 7 18e-09 52-65 


1189 


IPB000566B 


Lipocalin and cytosolic fatty-acid binding 

Tirotpin 


8.91 9.53e-09 122-132 


1189 


IPB002223 


Pancreatic trypsin inhibitor (Kunitz) family 


17.66 L32e-23 285-319 


1189 


IPB002223 


Pancreatic trypsin inhibitor (Kunitz) family 


17.66 9. 12e- 15 247-281 


1189 


PR00179A 


Lipocalin signature I 


13.97 2.13e-10 38-50 


1189 


PR00179B 


Lipocalin signature II 


7.67 4.91e-12 122-134 


1189 


PR00179C 


Lipocalin signature III 


17.26 4.60e-09 150-165 


1189 


PR00759A 


Basic protease (Kunitz-type) inhibitor 
family signature I 


14.67 5.88e-09 228-242 
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Results* 


1189 


PR00759B 


Basic protease (Kunitz-type) inhibitor 
family signature II 


12.35 3.81e-10 294-304 


1189 


PR00759B 


Basic protease (Kunitz-type) inhibitor 
family signature II 


12.35 8.71e-12 256-266 


1189 


PR00759C 


Basic protease (Kunitz-type) inhibitor 
family signature III 


12.43 6.00e-14 304-319 


1189 


PR01215A 


Alpha- 1 -microglobulin signature I 


14.08 3.25e 7 18 32-45 


1189 


PR01215B 


Alpha-l-microglobulin signature II 


14.28 7.00e-15 47-58 


1189 


PR01215C 


Alpha- 1 -microglobulin signature III 


8.22 3.63e-19 91-106 


1189 


PR01215D 


Alpha- 1 -microglobulin signature IV 


12.88 1.53e-27 113-132 


1189 


PR01215E 


Alpha-l-microglobulin signature V 


12.83 1.66e-24 145-166 


1189 


PR01215F 


Alpha-1 -microglobulin signature VI 


10.31 9.59e-25 173-192 


1189 


PR01273A 


Invertebrate colouration protein signature I 


14.08 3.53e-09 33-49 


1200 


IPB001594 


DHHC-type Zn-fmger 


23.81 7.47e-31 99-143 


1201 


IPB001909 


KRAB box 


17.37 9.61e-26 35-69 


1202 


IPB001909 


KRAB box 


17.37 9.61e-26 35-69 


1203 


IPB000961A 


Protein kinase C-terminal domain 


16.82 5.97e-09 74-108 


1204 


IPB000001D 


Kringle 


11.31 3.92e-15 81-97 


1204 


IPB000126A 


Serine proteases, V8 family 


11.75 9.51e-12 81-96 


1204 


IPB000177K 


Apple domain 


13.19 3.55e-ll 83-115 


1204 


IPB001254A 


Serine proteases, trypsin family 


9.98 6.06e-18 81-97 


1204 


PR00722A 


Chymotrypsin serine protease family (SI) 
signature I 


12.06 3,37e-17 82-97 


1208 


PR00179B 


Lipocalin signature II 


7.67 7.30e-09 121-133 


1208 


PR01174F 


Retinol binding protein signature VI 


11.76 9.34e-09 120-136 


1208 


PRO 1254 A 


Prostaglandin D synthase signature I 


12.32 9.79e-09 31-54 


1208 


PRO 1 275 B 


Neutrophil gelatinase lipocalin signature II 


9.02 3.16e-ll 39-49 


1209 


IPB001007C 


von Willebrand factor, type C repeat 


20.89 5.97e-10 34-55 


1212 


PR00019A 


Leucine-rich repeat signature I 


11.72 2.50e-12 177-190 


1212 


PR00019A 


Leucine-rich repeat signature I 


11.72 7.82e-09 390-403 


1212 


PR00019B 


Leucine-rich repeat signature II 


11.42 1.64e-13 174-187 


1212 


PR00019B 


Leucine-rich repeat signature II 


11.42 9. 18e- 11 387-400 


1213 


IPB000834B 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (Ml 4) family 


13.51 2.50e-17 103-117 


1213 


IPB000834C 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (M14) family 


17.20 2.80e-15 172-188 


1213 


IPB000834D 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (M14) family 


18.95 4.72e-12 199-225 


1213 


IPB000834E 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (Ml 4) family 


9.80 2.15e-09 228-242 


1213 


PR00765B 


Carboxypeptidase A metalloprotease (M14) 
family signature II 


14.48 1.39e-15 99-113 


1213 


PR00765C 


Carboxypeptidase A metalloprotease (M14) 
family signature III 


10.88 1.82e-10 179-187 


1213 


PR00765D 


Carboxypeptidase A metalloprotease (Ml 4) 
family signature IV 


14.06 9.45e-12 233-246 


1214 


PR01536C 


Interleukm-1 receptor type I and type II 
family signature III 


19.92 7.ooe-oy o/-yu 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 1.30e-ll 56-93 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 2.22e-15 44-81 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 2.29e-19 38-75 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 2.74e-09 20-57 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 2.82e-15 68-105 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 3. 05e-10 92-129 
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JK-cSultS 


1221 




riDnnar collagen ^-terminal uuiiiam 


1 1 46 1 9?e- 1 1 26-63 


1221 


TT)T> AAflO 0 *C A 

IFBUUUoojA 


rioniiar collagen o-terminai aomain 




1221 


iJrBUUUbojA 


_riDriiiar conagen i^-iermmai aomain 


1 1 46 1 I6e-19 39-6Q 


1221 


lirMjUUUoo jA 


riDrmar conagen ^-Terminal aomam 


1 1 46 ^ 57e- 14 47-84 


i n i 
1221 


IroUUUoojA 


riDriUar coiiageii i^-ieiiiiiricii aoiiiciiri 


1 1 46 4 I6e-1 S 71-108 


1221 


TDDnnnoo< a 
IjKdUUUoBjA 


rioniiar conagen i^-termmai aomain 


11 46 4 1 le- 1 6 80- 1 1 7 


1221 


TTYD AAAO O C A 


JriDriiiar conagen o-ierminai aomain 


11 46 4 98^-15 41-78 


1221 


IjtoUUUoojA 


rioriiiai collagen v^-tenninai aomain 


1 1 46 4 69p- 1 4 65- 1 02 


1 01 1 

1221 


TTTDAAAQ 0^ A 


riDrinar conagen l> -terminal aomain 


1 1 46 4 65p-1 5 SO-87 


1221 


IrBUUUoo jA 


JriDriiiar conagen ^-terminal aomain 


1 1 46 5 S6p-1 ^ 69-QQ 


1 OO 1 

1221 


TDDAfiAOOC A 

iJrVBUUUoajA 


JriDriiiar conagen v-*~ierininai uonidin 


1 1 46 5 56e-14 8^-120 


1221 


T"D"DA AAOQ ^ A 

IJtVdUUvoojA 


riDrniar conagen v^-iermiiidi uoiudin 


11 46 S Q1e-14 99-66 


1 OO 1 

1221 


TtyQAAAQ A 


riDriLiar conagen ^-xenmnai aoniaiii 


1 1 46 7 07e- 1 0 95- 1 32 


1221 


TT1"DAAAQ S? <T A 


JriDriiiar conagen c--tenmnai aomam 


11 46 7 46p- 16 86-1 93 


1221 


lr\t>UUlJoo DA 


FiDrniar conagen v^~Tenninai aomain 


1 1 46 7 59e- 1 6 77- 114 


1221 


T"D"QAAAQQ^ A 
lJrJt>UUUoOjA 


piDriiiar conagen v^-ieriiiiiiai uuiiiam 


1 1 46 8 ?Qe-1 2 53-90 


1 OO 1 

1221 


IrVtSUUUoo j A 


JriDriiiar conagen i^-termmai aomain 


1 1 46 8 36e-14 59-96 


1 11 1 

1221 


TDDAftrtCQ^ A 


JriDriiiar conagen i^-xermmai aomam 


1 1 46 9 00e-19 35-72 


1111 
1221 


ir rSUUUooDA 


JriDriiiar conagen Ly-tenniriai uomaui 


11 46 9 42e-14 74-111 


1111 
1221 


TrjD AAAOO CD 

IPBOUUooDJB 


JriDriiiar conagen ^-terminal aomam 


10 15 1 OOp-90 79-1 95 


1221 


TTjn AAAOOC 1~> 

IjPBuUUoojJB 


JriDriiiar collagen i^-termmai aomam 


10 15 1 99p-16 33-86 


1111 
1221 


TDDAAA O 0 CD 

IrBUUUooDB 


r l Drill ar collagen v^-termmai aomam 


19 15 1 47e- 18 5 1 - 1 04 

17' IJ l.*T/C lO Jl 1V7 


1111 
1221 


IjKdUUUoojJd 


riDnnar conagen i^-termmai aomam 


19 15 1 51p-?1 69-192 


1 ii 1 

1221 


IFBUUUoojB 


JriDriiiar collagen i^.-tenriinai aomam 


10 15 1 74p-90 36-89 


1111 
1221 


lJrBUUUoojJtJ 


JriDriiiar collagen i^-tennmai aomam 


10 15 1 Q4p- 1 0 90- 1 43 
17. IJ i.^^tc— i\j y\j" i*t«7 


1111 
1221 


IrBUUUooDB 


JriDriiiar conagen o-xermmai aomain 


19 15 1 Q8p-09 15-68 

17. IJ 1.7uC"U7 IJ 


1111 
1221 


Tnt>AAAOO CD 

lrBOUUooDB 


JriDriiiar collagen c-terminai aomain 


10 15 9 91p-14 97-80 


1111 

1221 




JriDriiiar conagen ^-xermmai aomam 


19 15 3 03p-1 7 78-1 31 

17. IJ J.UjC" LI / O'lJi 


1111 
1221 


TDDAAAO OCD 


JriDriiiar conagen ^-terminal aomain 


19 15 3 99p-1 5 94-77 


1 n i 
1221 


IDDAAAOO CD 


riDnnar conagen v^-terminai aomam 


10 15 3 5?4p-13 84-137 

17. 1 J J.OnC-lJ OH-lJ/ 


1111 

1221 


TDDAAAO © CD 

IFBUUUoojB 


JriDriiiar collagen L^-termmai aomam 


19 15 3 Q5p_94 49-95 

17. U J.7JC-Z'T T/.-7J 


1221 


TTITJAAAO O CD 

IrBuUUooDB 


JriDriiiar conagen L/-termmai aomain 


19 15 4 16p-18 30 83 


1221 




riDrniar conagen \^-xermmai aomam 


19 15 4 39e-93 75-198 


1111 

1221 


TDDAAAOC^Q 


riDrniar conagen ^-Terminal aomam 


19 15 4 70p-90 60-1 1 3 

17. 1 J *T. / UC \J\J 1 U 


1221 


TTIDAAA O O CD 


JriDriiiar collagen c-termmai aomam 


19 15 5 07p-90 45-98 

17. IJ J.U / C tJ 70 


1111 
1221 


TDDAAAQ QCD 

lJroUUUoojJt> 


JriDruiar collagen \^~ieriiiinai aornain 


19 15 6 40e-09 3-56 

17' IJ U.tUv \J Zs J J \J 


1111 
1221 




riDrniar conagen o-iermmai aomam 


19 15 6 61e-1 8 63-1 16 

17. 1 J \J.U1C" i O UJ x xw 


1 ii 1 

1221 


TT1DAAAO C CD 

IFBOUUooDB 


riDrniar conagen ^-terminal aomam 


19 15 7 69p-99 57-1 1 0 

17« 1 J / • U^C _ Z«Zr J / l L\J 


1111 
1221 




JriDriiiar conagen v^-xerminai aomam 


19 15 7 63e-09 9-62 

±7. X— ' / .UJC"U7 VjjL, 


1111 
1221 


1 Jr JtS U U U o o ji> 


Piuniiar conagen v^-LCiiiimai uuiiiaiii 


1 9 15 7 71 e-09 12-65 


1111 
1221 


T"D"D A A A Q Q *s T3 


JriDriiiar conagen l^-ici inmai aoniaiii 


19 15 7 8 1 e-2 1 39-92 

17i U / .OJlv x. J7 


1111 
1221 


TDDAAAOQ CD 


JriDriiiar conagen v_.-ierminai aomam 


19 15 7 84p- 1 9 66- 1 1 9 

17. I J / ■ Otv 1 7 \J\J 117 


1111 

1221 


TTIDAAA C Q CD 

IrJDUUUoojJD 


jriorniar conagen v^-terminai aomam 


19 15 7 95e-1 3 1 8-71 

17. IJ / .7JO lj lO /x 


1 H 1 

1221 


T11DAAA O O CD 

lrBOUUooDB 


ri Dnnar conagen ^-terminal aomam 


19 15 8 89p-1 0 93-146 

17. U 0.07C 1\J 7J ItU 


1 T> 1 

1221 


inn AAAOO C"D 

IPBUUUoqjJd 


JriDriiiar collagen u-terminai aomam 


19 15 9 07p-99 54- 1 07 

17. lJ 7.v /C~Zr^, J*r~ 1VJ / 


1221 


IPB000885B 


PiTirillar f?Allnp'f»n C!-tRmnina1 Hompiin 

X i Ui ll lcll vUllvluVll V .' Lvi Jill 11C4>1 Viwl 1 1C111 1 


19.15 9.16e-12 81-134 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


19.15 9.41e-13 21-74 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


19.15 9.88e-20 48-101 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


19.15 9.91e-12 87-140 


1221 


IPB001073A 


Complement Clq protein 


22.14 1.00e-ll 40-74 


1221 


IPB001073A 


Complement Clq protein 


22.14 1.42e-09 55-89 


1221 


IPB001073A 


Complement Clq protein 


22.14 1.67e-ll 79-113 


1221 IPB001073A 


Complement Clq protein 


22. 14 2.97e-09 43-77 



WO 03/054152 PCT/US02/39555 



Table 3B 
393 



JLJJ 


uataoasc 
encry ilj 


uescripiion 




1 791 
LZZ 1 


TPR00107^ A 




22 14 3.39e-09 91-125 


JLZZ 1 


TPROm 07^ A 
ijt.duu lu / oj\ 


Pnmnlptn pnt 1 n r^rnf/^i'n 
V„ UJ upicinciii i Lj JJlVJLVlll 


22 14 3 75e-10 31-65 


1791 


TPRAA1 A7^ A 


f* 1 ^\TY^V\1f*TYl#^'tti' 1 (1 T\rc\t(*\T\ 


22 14 4 09e-09 85-119 


1 991 


tpraai A7^ a 


^**rti*iTn1 f*m Anf f ( 1 /i t^r'/~\tf ! 'i ti 
i^tjiiipidiiciii uitj pujiciu 


22 14 4 45e-13 34-68 


1 791 
1ZZ 1 


TPPtrim n7^ a 


V_/UlIipiCUlCIll JJlUlvHI 


22 14 4 60e-12 94-128 


1 791 
IZZi 


TPnflm A7^ a 


V_vUIIipiCHlCill V-^JLq pIULC/Ill 


22 14 5 50e-ll 88-122 


1 791 
1ZZ I 


TPI*fiA1 A7^ A 


/~ 1 r\tnn1f*rYi/ a trl' 1 /~i fMWf^lTl 


22 14 5 67e-ll 76-110 


1 791 
1ZZ L 


TPRAA1 A7^ A 


^umpicinciii piuiciii 


22 14 7 10e-10 52-86 


1 791 




V^UIIipiCIIldll 1 pujicin 


22 14 7 33e-ll 82-116 


1 791 
1ZZ 1 


tpraai A7^ a 


v>iJlIipiCIIlCIXl V> 1 CJ pi UlClll 


^2 14 7 51e-13 37-71 


1221 


IPB001073A 


Complement C 1 q protein i 


22.14 7.83e-ll 97-131 


1 791 
1ZZ1 


TPT3HA1 A73 A 


l«/OmpiClllcriL U1L[ piULCili 


9? 14 8 17e-10 58-92 


1221 


IPB001073A 


Complement Clq protein 


22.14 8.59e-09 49-83 


1221 


IJrBUUlU /3A 


Complement ciq protein 


77 1J.Q£1f>..1^ 100-1^4 
ZZ. i*+ zf.O IC- 1 J) IVJVJ-lJ'r 


1221 


IPB001073B 


Complement Clq protein 


20.88 7.75e-28 150-184 


1221 


IrBUUlu /3C 


Complement CI q protein 


1 -3 A7 O « p 1 71/1_7"7Q 


1221 


IPB001073D 


Complement Clq protein 


7.60 7.68e-09 251-260 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


zo. iz i .uye- is> sz-ij4 


1221 


IPB001442A 


C-termmal tandem repeated domain m type 
4 procollagen 


17 1 ^0<a no 01 14^ 
Z0.1Z i.DyQ-yJy yl-iHj 


1221 


TT»nAA1 A A 1 A 

IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


7^ 17 1 1 < ^/t *! 1 rfC 

zo. iz l.^-ze-io o^-i id 


1221 


mriAAi /i /i 1 A 

IPB001442A 


C-termmal tandem repeated domain m type 
4 procollagen 


zo. iz z.^-ie- 1 / j l-oj 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


zo. 1 Z z.4 /e- 1 D /0-IZo 


1221 


TT»1~» /I f\ 1 A A 1 A 

IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


Zo. IZ z. /oe-1 / 0 /-l ly 


1221 


1PB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


Zo. IZ Z. oye-ZU 4y- 1 U 1 


1 <-»i 1 
1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


zo.iz j.y/e-uy iu-oz 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


O/C 11/1 ni^» 1 ^ 7Q 1 71 

Zo. iz 4.UJe-io /y-i3i 


1221 


Tnn/1A1 >l A 1 A 

IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


0^: 1 1 A 7Q 0 OA Af^ 

zo. iz 4. /ye-zu 40-yo 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


1/C 11/1 OA« 1 O 77 

Zo. 12 4,oUe-l / ZD- / / 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


1^: 11/1 OAn 17 10 on 

26. Iz 4.oUe-l f zo-oU 


1221 


IPB001442A 


C-termmal tandem repeated domain m type 
4 procollagen 


i£T 1 1 oia 11 /tr\ 01 
2o. Iz 4.oZe-Zl 4u-yz 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


1/C 11/1 O/Cq 11 10 

zo. iz 4.yoe-iz u-oj 


1221 


TT^Tl A A 1 A A A 

IPB001442A 


C-termmal tandem repeated domain m type 
4 procollagen 


11C Qf a AQ OO 1 A(\ 

zo. iz 3. 0 je-uy 00- 14U 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 6.56e-18 34-86 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 6.77e-14 22-74 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 7.39e-23 37-89 


1221 


IPB001442A 


C-terminal tandem repeated domain in type i 26. 12 7,55e-14 16-68 
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SEQ 

ID 

NO: 


Database 
entry ID 


Description 


Results* 






4 procollagen 




1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 7.99e-20 70-122 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 8.07e-22 43-95 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 8.30e-23 61-113 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 8.58e-20 58-110 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 9.09e-15 19-71 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 9.18e-19 55-107 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 9.37e-12 85-137 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 9.56e-18 73-125 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 9.59e-24 52-104 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 1.00e-ll 104-124 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 1.49e-09 47-67 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 1.85e-09 107-127 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 2.82e-09 41-61 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 2.82e-09 86-106 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 3.31e-09 71-91 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 3.55e-09 56-76 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 3.68e-09 50-70 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 3.92e-09 68-88 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 5.14e-09 89-109 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 5.24e-10 101-121 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 6.01e-10 38-58 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 6.72e-09 59-79 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 7.81e-09 35-55 


1221 


IPB001442B 


C-tenninal tandem repeated domain in type 
4 procollagen 


12.38 9.15e-09 44-64 


1221 


PR00007A 


Complement C1Q domain signature I 


20.64 7.47e-17 143-169 


1221 


PR00007B 


Complement C1Q domain signature II 


15.63 5.26e-14 170-189 


1221 


PR00007C 


Complement C1Q domain signature III 


16.13 2.07e-14 214-235 


1221 


PR00007D 


Complement C1Q domain signature IV 


9.66 3.91e-10 249-259 


1223 


IPB000053H 


Thymidine/pyrimidine-nucleoside 


12.28 9.76e-09 28-37 
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entry ID 


Description 


Results* 






■tit 
phosphor yl as e 




1223 


IPB000095E 


PAK-box /P21-Rho-binding 


i / . oz 4. i ye- Liz jj-DZyj 


1223 


IPB000494C 


T-| • 1 ■• lit**. J _ fC 'Li' D \ 

Epidermal growth-factor receptor (bCjJbK), 
L domain 


24. 4 U o.oUe-31 Zol-oU/ 


1223 


X iTO ir~0\ /O. /—> /" v /4 ✓O. j€ » ~ 

IPB000494D 


Epidermal growth-factor receptor (nurKj, 
L domain 


1Q 4/1 *7 <n<-» 9Q 199„17 1 

iy.4^f /.Dvje-zy dllto 1 1 | 


1223 


IPB000494E 


Epidermal growtn-tactor receptor kj, 
Lf aomain 


01 9 /1/=;^„9z1 170.41 zl 
ZI.Oj Z.'+Oe-Z.'H- 0 /!7-HlH 


i ooo 

1223 


IPBuOOooir 


iri^N/rnopnnin/riioteKin rno-omGing icpcdt 


1 6 50 ^ 1 5p-1 1 968-322 


1 ion 

1223 


lPB0UU95yB 


r UJLvJ Dox aupiicatea region 


1 S 6S 1 QQp-1 0 2S 1-291 


1223 


TTinnAAn^ 1 /o 


irrotem Kinase ^-teiruinai aomam 


k 48 9Qp-1 ^ 274-3 OS 


1 o n 

1223 


TDDAA1 O /I C A 


l yrosme Kinase catalytic domain 


29 45 7 1 9e-26 267-307 


1223 


IPB001245B 


Tyrosine kinase catalytic domain 


21.68 5.24e-23 325-363 


1223 


IPB001359H 


Synapsm 


99 58 ^ 51 i=»-1 0 80^-85^ 


1223 


IPB001359H 


Synapsin 


22.58 6.85e-09 777-827 


1223 


IPB001772C 


BCinase associated domain 1 


oa /r/T *7 QOp 1 o 9^9 9Q9 


1223 


IPB003306E 


WIF domain 


25.51 5.35e-12 379-424 


1223 


IPB003527C 


MAP kinase 


14. /u z.4oe- 14 /jyou / 


1223 


PR00109A 


Tyrosine kinase catalytic domain signature I 


12.56 5. 14e-10 237-250 


1223 


PR00109B 


Tyrosine kinase catalytic domain signature 

II 


t 1 O /Ct n lO OT/l om 

11.07 2.oIe-lo Z/H-ZyZ 


1223 


PR00109C 


Tyrosine kinase catalytic domain signature 

TTT 

III 


11.86 1.87e- 10 324-334 


1223 


PR00109D 


Tyrosine kinase catalytic domain signature 
IV 


17 £T1 7 1 0 0 91 1/11 1A^ 


1223 


PR00109E 


Tyrosine kinase catalytic domain signature 

■\ T 

V 


19 OO ^ fiOo 9fl 1QH zt1 A 


1223 


PR01217B 


Proline rich extensin signature II 


4.82 6.09e-09 927-943 


1224 


PR00453A 


Von Willebrand factor type A domain 
signature I 


1 1 70 1 1 AQ CA fin 


1225 


IPB000734 


Lipase 


in o< 0 11c no 1 /^id. 1 952 


1227 


IPB001304A 


C-type lectin domain 


17.98 3.48e-10 41-65 


1227 


IPB001304B 


C-type lectin domain 


iA 10 c no qq 111 


1227 


PR01504F 


Pancreatitis-associated protein signature VI 


12.72 6.45e-09 153^167 


1237 


IPB002181A 


Fibrinogen beta and gamma chains C- 
terminal globular domain 


IO A A O 0.0,-k 7/CA 

15.44 2.32e-I0 244-20U 


1237 


IPB002181B 


Fibrinogen beta and gamma chains C- 
terminal globular domain 


on 1 si q 1 n 991 9f\7 

2U. 10 y.3 /e-iu 2 / iou / 


1237 


IPB002181C 


Fibrinogen beta ana gamma cnains 
terminal globular domain 


1 5 s7 1 nn/3_ns 9QA- 


1237 


IPI3UU21M1J 


ribrinogen oeta and gamma cnains 
terminal gioouiar uomam 


9Q 1 £ 5 14>_1 0 %\ 9-^59 

Z7.IO J. I'M?"!? JlZ'JJA 


1257 


lrr>UUU4o3 


j_,eucme ncn repeat ^-lermmai aomam 


11 IS 1 5 493-4^7 


1 O 

1257 




i_,eucme-ricn repeat signature 11 


11 47 c ^ap-10 174-1^7 


1257 


PR00019B 


Leucine-rich repeat signature II 


11.42 8.33e-10 316-329 


1263 


IPJ30Q1/O2A 


Disintegrm 


91 Q1 fs. 45^-17 4^4-474 


1 9^i 


J r DUU 1 f\jZ,jr\ 


T^l CI Yt tf> €Tf\ Tl 

JL/lolilLC^L 111 


23 93 6 47e-09 416-456 


1263 


IPB002870A 


Reprolysin family propeptide 


12.22 3.31e-10 63-79 


1263 


IPB002870B 


Reprolysin family propeptide 


24.73 6.56e-14 109-147 


1263 


IPB002870E 


Reprolysin family propeptide 


11.90 2.50e-ll 341-353 


1263 


PR00289A 


Disintegrin signature I 


14.29 2.65e-13 450-469 


1263 


PR00873D 


Echinoidea (sea urchin) metallothionein 
signature IV 


8.25 7.35e-09 608-626 


1264 


IPB002168A 


Lipolytic enzymes 


11.40 5.07e-ll 139-153 
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Database 
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Kesuits^ 


1 9/^4 
iZOH 


TPT3A091 ^RR 


Lfipoiyiic enzymes 


10 ^ 14p«1Q 1 ^7 904 


1 Of A 
IZ04 


tpraa9 1 f£C 


jLfipoiyLic enzymes 


Q 77 4 nnp.1 9 1 & 970 


MftA 
lZO*t 


TP"RO091 £8r» 


j-iipoiytic enzyrnes 


1Q A 7fj^ 1 5 1£Q 1Q1 


1 9 £4 
IZOH- 


TPT3O094£QT 
l.r.DUUZ'+oyi 


uipepiiayi peptidase iv, iN-xerniinus 


1 0 QQ ^ 9Qp HQ 91 1 990 
lu.^yf \j.z,¥&-\jy zi i-zzy 


1 9^4 
1ZU*+ 


pom 040R 


i yrobyi-iiviN a syiiLneLai>e bigiiaiurc n 


14 7 00f=»-iiQ 94^-9^0 


1 Off 
IZOO 


TPT3AA07*\1 A 


oiusienn 


19 90 £ 4^,-= ISO K 
iz.zu o.H-je-io z-io 


1 Off 

IZOO 


TPT3A0A7-t1"D 


oiusienn 


1 1 49 1 fifi<=» 4fl 10 SI 


1 Off 
IZOO 


tptj aaa7^it*\ 


oLusterin 


1*^ 14 1 nn^ 4n i^i 9ri4 
iD.j^t i.uue-'+u ioj-zu*+ 


1 Off 
IZOO 


tpt3aaa7*;i'R' 


L^iustenn 


11 94 1 nn<=» 4n 9in 97Q 


1 Off 
IzOO 


TP"DAAA7C1T? 


oiusterm 


1 7 qo i nn» ac\ 1 1 n 1^1 


1 Off 
IZOO 


l.r.t5UUU /Zovj 


v^iusterm 


17 71 Q T4 179 1QA 


1ZO / 


TPT3 AAA9 1 *; A 


oerpins 


1 1 fk 1 Q 1 4*=» 1fi 1 A7 1 1 A • 

lo.ui y. i^e-io iu/-!ju 


1 Ofl 
1Z0 / 


TPT3AAA9 1 s/" 1 


Ql»fM v« fl 

oerpins 


1 1 QH 7 85p 1*^711 99^ 

ij.yu /.ooe-ij zi i-zzj 


1ZO / 


TPT3AAA91 
lr JrSUUUZ 1 DU 


oerpms 


1 *\ 1*a 1 74<a 17 114 1/1H 


1 9£7 
1ZO / 


tphaoa9 1 ^t? 


oerpms 


1 *^ 1<^ & &9.r- 1 /=; 4A1 49^ 
Ij.jO O.Oc5e-10 *+\J L-HtjLD 


1 9frS 
IZOO 


tpraoa9 ha 

lr JtJUUUZ 1 J A 


oerpins 


11 01 Q 14p» IS 19^-145? 


1 OfQ 
IZOo 


tp"raaa91 <tz 

IJtJdUUUZIDiJ 


oerpins 


Q S7 1 89^ 1 1 91 8 910 

y.o / i.oze-ii zio-zju 


IZOo 


tptjaaa9 1 

l.r JdUUUZ 1 do 


oerpins 


11 QO 7 1 *\ 947 9^1 
1J.7U / .ooc-Ij Z^f /-ZD I 


1 9^8 
IZOo 


TPRAfi^') 1 *TT"i 
lr tSUUUZ 1 jIJ 


oerpms 


1 ^ 1<% 1 74p- 1 7 1<IO 17^ 
1J . j«5 j . t ne- 1 / jjU-j / 0 


1 ofo. 
izOo 


lrr>UUUzl Jii 


oerpins 


1^7 1 Qa IO 417 4/C 1 

id.oo /. iye-i 0 h-j /-*to 1 


iz Id 


tptjaai in7 a 
IJrJoUUl dU /A 


Kiioaanese signatures 


1 1 Q52 C 1 O 1 O 90 


1 9 71 
1Z / j 


tpraa 1 1 a7p 


iNjioaanese signaxures 


1 & 00 1 RQi=»-.1 4 4a~£1 
IO.U_7 .5 ,o>c~ I*+ ^o-Ol 


1 771 
1Z / J 


TPRnni iA7f~ i 


Jvnoaanese signatures 


1 S 1 1 1 9Sp„1 S 59„ 1 01 
lJ.l 1 I.ZOC-IJ OZ-lUl 


1 771 


TPTJAAI 1A7FI 


ivnoaanese signatures 


1 ^ C7 4 00«= If, 110 1 

id. 0/ *+.uue-zo iiu-ijj 


1 771 
1Z / j 


tpt^aoi in7in 


ivnoaa-nese signatures 


1110^ 1^ 1 0 1 R0 1 Q1 


1 771 
1Z / J 


TPRAA1 1A7f"i 
lr DUU I D\J IKJ 


Jvnoaanese signatures 


11 71 Q 11p> 1/^9^1 9^51 
11. / j y .j ic- 1 0 zj 1 -zoo 


1 77/1 


tpt^aaooai 


X^t"i"i*i firl q 

jvringie 


11114 14 1 47 1 £1 


1 774 
IZ /*+ 


tptiaoaoai 

lr\L5UUUUU I VJ 


jsjingie 


90 90 4 4£f» 1 1 9*^0 9S7 


1 774 
IZ / H 


tproooooi t-j 

lI\t>UUUUU 1X1 


jsjringie 


1 9 94 9 RO<=> 1 0 900 100 
iz.z*r z.owc-iu ^yy-j\jy 


1 774 
IZ /*> 


tptiaaai 7£ a 

I.T.DUUU1ZOA 


oerme proteases, v o iamny 


1 1 7S Q 9Sp 1A 1 47 1 fO 

11. /J 7.ZJC-1U 1*+/-1UZ 


1 77/1 
1Z/4 


tpqaaai 771^ 


Apple oomain 


11 1Q9 tQp 11 1 4Q 181 

lo. iy z.i oe-i i i*+y-ioi 


1274 


IPB000177L 


Apple domain 


11.33 9.92e-10 178-216 


1 07/1 

lz/4 


IrrJLKJUl / /JN 


Apple domain 


1 A 17** IQo 9/1 9QQ 191 

iiM / j. iye-z*+ Zoy-oZj 


1274 


IPB000177O 


Apple domain 


14.39 9.00e-16 324-352 


1274 


TTITIAAI Of /I A 

1PB001254A 


Serine proteases, trypsin iamily 


y.ya J.3/e-lo 14/-loi 


1274 


IPB001254B 


Serine proteases, trypsin family 


15.01 5.50e-16 297-320 


1 TT /I 

1274 


TT"»T~> f\ f\t ^ A 

IPB001254C 


Serine proteases, trj^psin family 


16.54 4.8oe-20 327-346 


1274 


IPB002093D 


BRCA2 repeat 


8.99 9.77e-09 8-46 


1274 


PR00722A 


Chymotrypsm serine protease family (SI) 

• J T 

signature I 


12.06 3.8Se-15 148-163 


1 T7 /I 

1274 


PR00722B 


Chymotrypsm serine protease family (SI) 
signature 11 


12.69 3.74e-09 203-217 


1274 


PR00722C 


Chymotrypsm serine protease iamily (SI) 
signature ill 


10.74 1.69e-14 296-308 


iz/y 


PKUU /U41 


Calpain cysteine protease (C2) family 

cionatnrp 
olgllaLUlC 1A 


y.4z4.I4e-14 14J-I/J 


1284 


IPB001706 


Ribosomal protein L35 


21.01 1.00e-18 119-157 


1285 


IPB001706 


Ribosomal protein L35 


21.01 1.00e-18 119-157 


1287 


IPB000822 


Zinc finger, C2H2 type 


14.67 L43e-ll 399-424 


1287 


IPB000822 


Zinc finger, C2H2 type 


14.67 2.50e-20 259-284 


1287 


IPB000822 


Zinc finger, C2H2 type 


14.67 3.25e-19 231-256 


1287 


IPB000822 


Zinc finger, C2H2 type 


14.67 5.50e-19 343-368 


1287 


IPB000822 


Zinc finger, C2H2 type 


14.67 6.54e- 18 203-228 
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LmJ 


Database 
entry ijj 


JUescription 


xvesuiis 


1 987 
IZo / 


tphooos99 

lr JjUUv/OZZ 


z^inc linger, LZriz xype 


1 4 67 8 S(V- 1 ^ ^ 1 AO 


1 987 
IZo / 


tprooo899 

lr JC3UUUOZZ 


Z.111L linger, i^znz type 


1 & ftl R 69p_ 1 8 9 87-^ 1 9 

It.U / O.OZC- 1 O ZO /"J 1 Z 


1 987 
IZo I 


TPP1O0O899 
lr jjUUUqZZ 


Ziinc linger, c-zriz xype 


lH-.o/ y.iue-zz l /z)-zou 


1 987 
IZO / 


lr.DUUUoZZ 


Ziinc linger, v^zriz type 


1 A ^7 Q lAp 99 ^71 ^Q^ 
i^+.o / j*. iue~zz j / i~o"o 


1 987 


tppiooi 97s 

1.T13UU IZ / J 


uivi jl^ina DinQing uoniain 




1 987 


TPR001 97S 


uivi JL/iN/v omaing aoniain 


10 17^ 57f»-1 1 97^-1 Izl 

Ly , l / D.J / C" 11 Z/J"j1t' 


1 987 

IZo / 


tprooi 97s 

IrDuUlZ / 0 


-LJivi JLiiNA uinomg aom^iiu 


1 0 1 7 A %Ap>-\ 1 1 01-9^0 


1 987 


tprooi 97s 


J-/1V1 ui\zY Dinaing aomain 


10 17 0 60f*-19 947-986 

li/.l/ 7.0UC - IZ Z*r / -ZOO 


1 987 

IZO / 


1JT IZ / J 


J-/1V1 J-71>Z\ Dinumg UOIIlalH 


10 17 0 R7p- 1 9 1 -^70 


1 987 
IZO / 


tprooi ono 

IrUUU 127U27 


jSJVtVJj UOX 


17 %1 ^ 99 16 ^0 
i / . j / o.ooe-zz lo-jw 


1 987 


pr 0004R a 


C ^9T-T9 _tT.JT><^ 71ftn -fl T> fral* Ct <ST\Otl Iff* T 

v^iinz»-Lype ZiHiv ungei oigiid.iuie 1 


Q OA 9 SQe-1 ? ^06-400 


1 987 


PR 00048 A 


/~ , 9 T— T9 I71T1P "flT1fTP>T" CI cm tltl 1TP T 

^ziiii"i,ypc zjiio liiigei oigiiatuic 1 


0 04 9 8Re-00 ^4.0-^^^ 


1 987 


pr 0004s a 


^znz-iype zinc linger biguaiure 1 


0 Ozl ^ 77f=»_1 A 768-781 


1 987 




i^zxiz-iype zinc imgcr signature i 


O 04 ^ K P 14 984 907 
27.27'+ J.Ijc-14 Zo^f-Z27 / 


1 987 
IZo / 


prooo48 a 


i^znz-type zinc linger signature 1 


0 04 6 63^ 00 719 79^ 
^.27^+ O.Ojc-Uy jIZ-jZj 


1 987 


prooo48 A 


\^zriz~iype zinc linger signature 1 


0 04 6 77^-1 H 9^6 960 
y.y^r o. / J2>e-iu zjo-zoi? 


1 987 
IZo / 


prooo48 a 


v^zriz-type zinc linger signature 1 


Q Q4 Q 7 6/=» 1 7 9HH 917 
y.ytf y.jOG-LJ ZUU-Z1J 


1 987 


proooact^i 


v^-zriz-iype zinc linger signaiure 11 


^ ^9 1 Hflo 19 188 1 Q7 

j.jz i.uue-iz 100-127/ 


1 987 

LZO / 




v^zriz-Lype zinc linger signature 11 


^^9 7 9^f=- 1 1 784 7Q7 

J JZ J.ZjC-11 J27J 


1 987 
1Z0 / 


pp oooast} 


^zriz-type zinc linger signature 11 


04 nn<=» no 7^£ 7/^^ 


1 987 
IZo / 


PROooAQn 


11 z -type zinc linger signature 11 


J.jZ 4.0OC-IU Z^H— Z3J 


1 9 87 
IZo / 


PR00048T* 


v^znz-type zinc linger signature 11 


^ ^9 ^ ^0p no 7nn 7fio 

J.JZ j.JUc-Uy jOU _ jU27 


1 987 
IZo / 




v^zoz-type zinc linger signature 11 


^07 nrt#=-19 979-981 
J.«2>Z /.UUC-IZ Z/Z-Zol 


1 909 

izyz 


Tpnoooo7 Ar* 


i_/aminin 0 


19 07 7 lip HQ ^4 79 
IZ.27/ / *D ie-U27 DH- /Z 


1 909 

1Z2?Z 


tprooos^i 


iivjj7-iiKe aomain 


4 80 8 CWf* AO ^4 69 
'f.oi? o.\//e-U27 jH-OZ 


1 9Q9 
1Z27Z 


tprooi '7'7zir> 
IrrJUUl / mD 


jLJeiia serraie ngana 


1 0 97 01 no 9H 
Yy.AD D.y ie-U27 zu-oo 


1 9Q4 


TPRHfll 007T3 
lr D\j\J L\J\J fr> 


von wineorana iactor 5 type repeat 


in ri7 9 sop no 09 im 

lu.UJ Z.027e-027 2?Z- I U 1 


1 90S 


T/PR000899 


z^inc ringer, i^zriz type 


14 67 1 00a 1 8 477 4^8 


1 90S 


Tpnoooao9 

Lr r> \J KJ \J OZZ 


z»inc ringer, v^znz type 


14 fTl 1 flAp 99 AH^ 47A 
L^.Z) / 1 .UUe-ZZ *H/j _i rju 


1 90S 


TPPJ0O0899 
Lr X5UUUOZZ 


Zjint- linger, \_/Zriz type 


14 67 1 OfV-99 791-74^ 
L*T*\J f 1.7UC"ZZ jZl-jnO 


1 90S 


TPR000899 
lrr>UUL/oZZ 


Ziine linger, lzhz type 


14 67 4 AfWin 96^ 900 

IH'.O / H-.UUC-lu ZOJ-Z27U 


1 90S 

1Z27»2» 


TPR000899 


Zjinc ringer, i^znz type 


14 67 4 86^-10 480 -^14 


1 90S 


TPR000899 


z_iiuu lixigci, Lznz type 


14 67 6 9^f*-10 777-409 


190S 
IZ7J 


TPR000S99 
Lr J3UUU OZZ 


Zjine linger, Lznz type 


1 4 67 7 7Sf»-70 46 1 -486 


1 90S 


tprooos99 


ZjIiil linger, v^zriz type 


14 67 8 S0p_ 10 74Q-774 


1 90S 


TPT3001 999 
IrDuU 1 zzz 


i jtiilj zmo no uon aomain 


94 67 7 66p-00 477-460 


1 90S 


TPROA1 97S 
irrJUvlZ / J 


i^ivi L'inza. omaing aomain 


10 17 7 8Sp-00 70Q-.748 


1 90S 

izbo 


TPPHA1 97S 
IrrJUUlZ / 0 


uivi uiNA Dinaing aomain 


10 17 4 77a 1 0 777 77£ 
127.1/ L t. ! /e- 1 U DO / -D /0 


1 90S 


TPT3001 97S 
IrDUU IZ /.? 


uivi uiNA Dinuing aomain 


1 0 1 7 S 99p 00 49 1 4<^0 

127.1/ J).ZZe-U27 HZl-fOU 


1 90S 


tppiooi 97s 


T ^7v/T T^\I A Ti"t n i^l i m fT j'I^ty^^it^ 

jL/ivi L-/i>i/\. uinQing aomain 


10 17 8 49<=»-10 707-479 
127. 1 / O.T-ZC-IU DyD "tJZ 


1 90S 

1Z273 


tprooi 97s 

IrJjUUIZ /3 


LiLVL L/iN/v omaing aomam 


1 Q 1 7 Q 97p 1 O 44Q 4RS 

12/.I/ 27.z /e- 1 u H-H-y-foo 


1 90S 


TPRHA1 000 


K'R A R V»r»v 
JSJVrVD DOX 


17 77 0 S0p 79 10 44 
1 / .D I y.J\Jc-DA lU-H-f 


1 90S 


pp nn 048 a 


^zriz-type zinc linger signature i 


O 04 1 00p 10 718 771 
27.27'H- L ,\J\JG- L\J D Lo-DD L 


1 90S 


pr 00048 a 


v^zjnz-iype zinc linger signature i 


Q Q4 1 OOp 1 7 409 41 S 
27. 27*+ 1.UVC-1J) HUZ-T-iJ 


1 90S 

izyo 


prooo48 a 


v^zxiz-type zinc ringer signature i 


Q 04 1 64p 1 7 74^ 7SQ 
27.27^ l.O^e-l^ DHO-DDy 


1295 


PR00048A 


C2H2-tvoe zinc fmeer signature I 


9 94 5 76e-12 374-387 


1295 


PR00048A 


C2H2-type zinc finger signature I 


9.94 5.85e-14 458-471 


1295 


PR00048A 


C2H2-type zinc finger signature I 1 


9.94 7.92e- 14 486-499 


1295 


PR00048A 


C2H2-type zinc finger signature I 


9.94 8.62e- 14 430-443 


1295 


PR00048B 


C2H2-type zinc finger signature II 


5.52 1.50e~09 418-427 


1295 


PR00048B 


C2H2-type zinc finger signature II 


5.52 3.50e-09 474-483 


1295 


PR00048B 


C2H2-type zinc finger signature II 


5.52 6.79e-10 334-343 


1295 


PR00048B 


C2H2-type zinc finger signature II 1 


5.52 8.50e-09 390-399 
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SEQ 

ID 

ISO: 


TV A. 1 

Database 
entry ID 


Description 


xvcSUUS 


lZyo 




zinc linger, v^zxiz type 


14 67 1 6Qc-18 304-329 


1ZVO 


IroUUUoZZ 


zinc linger, v^zriz type 


14 67 3 25e-14 444-469 


lzyo 


IVIdUUUoZZ 


zinc linger, i^znz lype 


1 4 67 3 77e-l 8 220-245 


lzyo 


lJrJbJUUUoZZ 


Ztinc linger j v^zriz type 


14 67 6 25e-19 248-273 


lzVo 


lrJb>UUubZZ 


Zjinc linger, v^zriz lype 


14 67 6 73e-21 416-441 


lzyo j 


Tr>t> a r\ f\ q i o 
irrJUUUoZZ 


Ziinc linger, lzhz type 


14 67 7 00p-20 276-301 


1 o a^t 

1296 


TTIDAAAOI O 

lrBUOObZZ 


Ziinc imger, v^ztiz type 


14 67 8 ?0p-1 6 388-41 3 


1 o A/T 

1296 


l^BUUUoZZ 


Ziinc iingei, i^zxiz type 


14 67 8 90r-23 102-217 


1296 


lrBUOUoZZ 


Ziinc linger, v^zriz type 


14 67 8 36e-21 332-357 


1296 


TBDrtAAOT) 

IFoUUUoZZ 


z,mc linger, l^zmz type 


14 67 8 50e-19 472-497 


12^6 


T"DT1 AAAC1 0 


Ziinc linger, v^zriz type 


14 67 8 62e-18 500-525 


1296 


TDDAAAOO 


Zvinc linger, ^zriz type 


14 67 9 18e-21 360-385 


izyo 


TDT3AA1 77*1 


uivi ijin a Dinairig uoindin 


19 17 1 97e-10 488-527 


lZyo 


TDT3AA1 07^ 

lr\fc>UU LZ /3 


uivl J-JiNA oinamg aomdin 


19 17 2 37e-09 376-415 


1 oo^ 

izyo 


TDT2AA1 OT^ 

lirr>UUlZ/D 


UlYi UlN^v DlTiaill^, QOlIialll 


19 17 2 95e-10 ^64-303 


izyo 


TDT3AA1 77^ 


U1V1 U1NA oinGing uoniaiii 


19 17 3 96e-09 236-275 


izyo 


T"DT2ftfl1 
IroUU vLi y 


r*\ A A T"Y\T A V\i nrliiifT rl Qin 

JL/1V1 UlNrv DITiaillg QOIIlcllIl 


19 17 5 10e-09 432-471 


izyo 


TTJTaAA t 17^ 


i^ivi uinuiiig uornd.ni 


19 17 5 38e-10 208-247 


1296 


IfrSUU 12/ J 


UIV1 UtN A UlllCllng (lOIHallL 


19 17 5 50e-10 404-443 


1296 


lr\bUUlZ/;> 


dm. jjina uinamg aomain 


19 17 5 76p-11 460-499 


1296 


TT1DAA1 OTC 

IfBUULZ/j 


L/M. ulna omaing aotuain 


1 Q 1 7 5 Q0p-09 1 80-? 1 9 


1296 


TBDAA1 

lr^r»UUlZ/D 


uivl uiNA Dinaing uomaiii 


19 17 6 79e-l 1 348-387 


1296 


TDDAA 1 OT^ 

ItVoUulZ/j 


uivl uiNA oinamg aomain 


1Q 17 8 30p-10 320-359 


1 o a /c 
1296 


TT3DAA1 AAA 


JvKAtJ DOX 


17 37 5 50p-37 16-50 


1296 


T>t> AAA/1 O A 


czwz~type zinc imger signature i 


q Q4 i 38e-09 217-230 


1296 


T>TJ AAA/1 O A 


L/Zxiz-type zinc ringer signature i 


Q 94 9 06e-1? 245-258 


i o n /C 
1296 


Tif) {\f\f\A O A 

FKUUU4oA 


ozriz-type zinc linger signature i 


0 94 9 06f>-1 ? 301-314 


12y6 


T>T) AAA/1 O A 


ozriz-type zinc imger signature i 


Q 94 3 86p-1 0 973-286 


i o A/c 
1296 


nn AAA/1 Q A 


v^zriz-type zinc imger signature i 


0 94 4 71 p-1 2 41 3-426 


1296 


tjt> AAA/1 O A 


ozriz-type zinc imger signature i 


0 Q4 4 86p-1 3 399-342 


1296 


T>n AAA/1 O A 

l J KUU(J4oA 


czriz-type zmc imger signature i 


9 04 5 50^-15 385-398 

y.y t -t JJl/C-lJ J0J"J7O 


1296 


T>T) AAA/1 O A 

l J KUUU4oA 


OZriz-type zmc imger signature i 


Q 04 S0p-1 5 469-489 


izyo 


"DT> AAA/1 © A 


v^Zriz~type zmc imger signature i 


9 94 5 91e-1 0 497-51 0 
y * y*-T y *y i c i v "Ty I y i \j 


1296 


"DT> AAA/1 O A 


v^Zriz-type zmc imger signature i 


0 04 6 1 4p-1 3 1 S9-909 


I2y6 


T)T) AAA/1 Q A 


L^zriz-type zmc imger signature i 


q 04 6 1 4e-1 3 595-538 


1 OOA 

i2yo 


PPAOAAQ A 
JrK.UUU4oA 


ozri^-iype zinc linger oigncuuic i 


9 94 6 82e-12 357-370 


i2y& 


DD AAA/1 CT3 


oziTZ-type zmc imger signature 11 


5 52 1 00e-10 261-270 


i2y6 


t>"D AAA/1 CD 


^Zriz-type zmc imger signature 11 


5 59 1 00e-10 289-298 


1296 


PR00048B 


C2H2-type zinc finger signature II 


5.52 2.29e-10 205-214 


1296 


T-»T> AAA /I O D 


L^Zxiz-type zinc imger signature 11 


5 S9 4 86p-10 513-599 


1296 


T*T» AAA/1 OT> 

PR0Q04813 


cZriz-T3^pe zinc linger signature ii 




1296 


T>T» A A A A 0 T~» 

PR00048B 


CZriz-type zmc imger signature 11 


^ S9 5 5n<=-10 4S^-4Q4 


1296 


T»T* AAA/f OT> 

PR0004SB 


CZrlz-type zmc imger signature ll 


S ^9 5 50p* 1 1 34^-3S4 


1296 


PR0004SB 


CZrlz-t3^pe zmc imger signature ll 


S *J9 70<=» If! 9^^-949 

j.jz o. /^?e-iu zdj-zh-z. 


1296 


PR00048B 


C2H2~type zinc finger signature II 


7 1 1 49Q 4^S 


1 9QQ 


lriyyjUKJ Ly\f 


binding region 


5 86 1.00e-ll 188-198 


1299 


IPB001818B 


Matrixin 


26.48 4.63e-13 87-128 


1299 


IPB001818C 


Matrixin 


24.38 4.54e-09 134-179 


1299 


IPB001818D 


Matrixin 


14.91 3.36e-25 182-213 


1299 


IPB001818E 


Matrixin 


8.86 4.26e- 10 222-235 


1299 


IPB002870E 


Reprolysin family propeptide 


11.90 2.00e-09 190-202 


1299 


PR00138A 


Matrixin signature I 


12.54 9.25e-10 37-50 
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! STCO 
ID 
NO* 


entry ID 


x# e& crip 1 1 on 


Results* 


! 1299 


PR00138B 


iVia.lL lA.il! M^UalUie 11 


14. Hr. j ie-1 1 iUo-lzl 


1299 


J, XWVJ IJOv 


xviairiAin signature ill 


zU.U/ o.ooe-11 132-160 


1299 


PR001 


xvldLl 1A.11 x ol^IldLUrC I V 


1/1 <»'7 1 Q/Zn OI IOO OIO 

i4. D / l.ooe-zl loo-21j 


1299 


PR00 1 38F 


iVldLllAlll signature V 


/.IU y. / /e-1 1 222-235 


1299 


PR00480B 


Astacin family signature II 


14.35 1.14e-09 183-201 


i 1^01 

1 IJul 




lmmunogioDunn ana major 
iiioiuiroixxpaii unity complex Qomain 


20.23 6.09e-l 1 150-187 


1305 


TPB002579C 


i^uiiiaiii ui uuKnown lunciion uurzo 


J.oye-2o /5-1U2 


1305 




ivuiuain oi uuKnown iunction uxjrZD 


11 CC 7 no. 10 1O0 1 O A 

11. 3D /.yze- 12 128-139 


1305 


TPB002S7QF 

XX J_> UI'aJ / 


x-/L>iriciiii ui unKnown iunction uurZD 


lz.14 y.31e-13 149-159 


1307 


IPB002350 


xvazcti-typc serine protease inniDiior lamuy 


3i. /o 4.uye-io Zo-oo 


1 1307 


PR00290R 

X XVL/ V/ _7vy XJ 


is.azd.i- type berine protease innioiior 

c{ on ci i"i irp TT 


lo.oo l.zoe-lu 39-j(J 


i 1308 


IPB001462G 


xxc;ija.uii<iviiai x proLciii v^-tciTninai Qoniaiii 


1*7 "3 QO Q r\o ^ro ^70 

1 1 .ll 3.3oe-uy jZ- i l 


[ 1309 


IPB001271 

XX \J\J\J1.£*I JL 


xvxdu ll lidll all UclCIlblll 


iy.y/ 3.4ze-zi uy-iov 


l 1312 

X %/ x xw 


IPB002041 A 


uir-uiiiumg nucicar protein i\.an iamiiy 


ll.lz y.3oe-lz 63-107 


1 1312 


PR00449A 

X JlwUTT/A 


x xaiioivjixiiiiig piotuui rzi ra£) signature i 


lz.4o z.D/e-lo o3-o4 


1312 

J- — / X Xw 


PR00449B 


i lau&iuiiiiiiig pioiem rzi ras signature 11 


1 /f OA /I O 1 _ AA O £~ 1 AO 

14. oy 4.ole-Uy 50-lUz 


1312 


PR00449C 


x laiiaiuiiiiiiig piuLcui x^i idb signature 111 


ic\ c 04: Q n i ao Toe 
1D./U o.zoe-lz 1U3-IZJ 


1312 


PR00449D 


T^rari c TiPlTm 1H(T nr r^t pin T3 0 1 rao c-i rmo+n»"/a T\ 7" 
x laxiaxoilllxilg jJUJLClxl xXl Lao blglidLLire IV 


1 f\ A7 ^ fA Q io i /T'7 i on 

lu.z/ o.oue-13 10/-I0U 


1 1312 


PR00449E 

X XX-V/v/ f \ ^ X^ 


x xaiiaiuiiiiiiig {JiULCifi r Z.L TaS Signature V 


n o A/i Q 10 oAi 001 

lo.oy z.U4e-i3 zvl-lld 


1314 


IPB000822 


7\v\c "fincrpr f~ , '7T-T'7 t\mp 

•fc-'lllv' Xlllgv/ij V_/Z<JLLC( Ly|JC 


\ A £J1 1 ^Oo 1 Q 1 /CC 1 OA 

14.0/ i.oye-io loj-iyu 


! 1314 


IPB000822 


xjiiiW' iiiigei, uznz Lypc 


1/1 1 *7C^ 1A OAC ooa 

14.0/ 1. /je-ly 3UD-330 


i 1314 

X «-^ F X i 


IPB0008^2 

XX U v V\/U^xv 


£-iixi\s xiiigci 9 ^/znz type 


1 /t /C*7 O Ola 17 077 OAO 

14.0/ z.y3e-l / z / /-30z 


1314 


IPB000822 

XX x^vuv/u^^ 


^-*nit' xiii£d, v^Z/iiz, uypc 


14.0/ o.uoe-io iy3-zio 


! 1314 

i M. ' — ' x r 


IPB000822 


Xjiiiu iiiigci, v^zrxz iype 


1/1 /CT < /IAq 1 c 001 o/t^r 
14. 0 / o.4Ue-lo zzi-z4o 


1314 

J X — * x V 


IPB000822 

XX JJUUUUxy^ 


7inr fin crp»r f^9rT9 tvnp 
z^iiiw* iiiigci^ v^znz iype 


1 A £11 < 1/1 T31 oro 

i4.o/ o.o3e-l4 333-3_>o 


! 1314 

X — ' X | 


IPB000822 

XX X-* \J \J \J KJ xv 


7inr fin oer f~ 1 9 T-T9 tvnp 
xjiiil- xiiigci ? v^zxxz type 


1/1 /CTQOIIq 10O/1A OT/1 

14.0/ y.3le-io z4y-z/4 


1314 


IPB001275 

XX X-» V \J X. I 


x^ivx x-zin^t. uiiiuillg LlUillain 


IO 17 O OOa 10 0O1 Q/CA 

iy.1 / z.cSoe-lz 3zl-3uU 


1314 


IPB001275 

XX JJuw x xw / 


OlvT TiM A ttin/Tincr rlnmaifi 
xvivx xL^lN^rv UlllLllllg U.UlIlcllIl 


1Q 17 Q /JO^ irv 0 A A O A O 

iy. 1 / 3.43e- 1U 2(jy-z4o | 


1314 


IPB001275 

X X X—* V \/ X XV / ^/ 


Dl^f T^T^JA TiinHino Hnmnin 
xyjivx x-'x^ix^. L/xxiunig ivH-Hliaxll 


iy. 1 / 4. ioe-iu zoj-jU4 


1314 \ 

1 X X F 


IPB001275 

XX J— * Uv X XV / *v 


xyivx X-'lN^v UillUiilg U.UiilalIl 


1Q 17 /l Q/T „ 1 1 OOT 07^ 

iy. 1 / 4.ooe-l 1 z3 /-z /o 


1 1314 

x w' x r 


IPB001275 


i^llvT PlTvJA ninHincr /Trvmtiir* 
x^ivx ±-/l\x\. UlllUlIlg (aOllld.111 


IO IT C 7/Cq 1A oni 000 

iy.1 / j.zoe-lU Z¥d-532 


1 1314 


IPB001275 

XX Uv V x f 


rVlvT Dl"NJ A nin/Tin<r flAmciin 
X-/1VX X-/in/\ UlllCllilg UUIlldin 


in 17 < 11 1 Q 1 OOA 

iy. 1 / o.ooe- 1 1 loi-zzO 


1314 


PR00048A 


\-^rx<i-Lypc zinc linger signature i 


QO/ITT'2^ m Q AO 

y.y4 z.z3e-iu 3Uz-3 1 j 


1314 


PR00048A 


^zrxz-iype zinc linger signature i 


O O/l O OAa IO HA O/IO 

y.y4 z.zye-13 33U-343 


1314 


PP 00048 A 


^xriz-Lype zinc imger signatuie l 


Q O/t O OQa IO OIO OOI 

y.y4 z.y3e-i3 z io-z3 1 


1314 

X X F 


PR00048A 


1*9 TT9 — t"\/nf* Tinr> rinrrpT* ci rm Gtiiro T 

v^zxxz-Lype zinc imger signature i 


O O/l /I 1 0 1 npi or\o 

y.y4 4.ooe-i3 iyu-zU3 


1314 


PR 00048 A 

x x vv/ w V/" <J xv 


v>zxxz-Lype zinc linger signature i 


Q Q/t < O /C/-. 1 1 OT/1 OOO 

y.y4 j.zoe-11 Z/4-ZO/ 


1314 

X X 


PR00048A 


o^xxz~Lypc zniu linger signature 1 


Q O/l ^ 1 Aa. 1 *2 1 /CO 1 7C 

y.y4 o.i4e-io ioz-i/3 


1314 

x x r 


PR00048A 


1^9 T-T9~t^7nF' 71M flUtYPr c"i rm n r-/=» T 

v^zrxz-iype zinc imger signature 1 


O Q/t Q A7 0 10 oro 0"7i 

y.y4 o.u/e-13 3do-371 


1314 

x X I 


PR00048A 


v-^^xxz, type ziiiu imger signature i 


Q QA 8 Ofio nQ T/1/C OCA 

y.y4 o.ooe-uy z4o-zj>y 


1314 

x x r 


PR00048B 

X X v\_/ (J UT U XJ 


v^-^ nz, type zinc imger signature 11 


C CO O OOo 1 A 1 TO 1 OT 

D.J A Z.Zye- I U 1 /o- i 0 / 


1314 

x w>* x r 


PR00048B 


i 9 rT9— tvnp 71T1 A finOfPT' cirmq-fiiro TT 

^x-xx^-iypc z,inc linger bignature n 


C CO 1 C7o 1A HO OOT 

D.jz o.D /e-lU 3 lo-3z / 


1315 


TPBOOOOOfi 


vciLcuidie nieiaiiotnionein, iamiiy i 


1*3 A 1 1 <0« 1ATO 100 

13.41 1. Doe- 10 78-123 


1315 


IPB000006 


Vertebrate metallothionein, familv 1 


13 41 1 92e~09 88-133 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 2.06e-10 63-108 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 2.65e-09 28-73 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 2.93e-09 5-50 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 3. 07e~12 72-117 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 3.57e-09 82-127 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 4.86e-09 83-128 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 5.06e-10 56-101 



t 



SE 
ID 

NC 

13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
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Database 
entry ID 



Description 



Results* 



IPB000006 



Vertebrate metallothionein, family 1 



IPB000006 



Vertebrate metallothionein 



IPB000006 



Vertebrate metallothionein 



IPB000006 



Vertebrate metallothionein 



IPB000006 



Vertebrate metallothionein 



IPB000006 



Vertebrate metallothionein 



IPB000006 



Vertebrate metallothionein 



13.41 5.06e-10 77-122 



family 1 



13.41 5.10e-ll 46-91 



family 1 



13.41 5.68e-09 36-81 



family 1 



13.41 5.96e-09 48-93 



family 1 



13.41 6.30c-ll 87-132 



family 1 



13.41 6.33e-09 25-70 



family 1 



13.41 6.97e-09 93-138 



IPB000006 



Vertebrate metallothionein 



family 1 



13.41 7.48e-10 15-60 



1PB000006 



Vertebrate metallothionein 



family 1 



13.41 8.20e-ll 101-146 



IPB000006 



Vertebrate metallothionein 



family 1 



13.41 8.44e-09 73-118 



IPB000006 



Vertebrate metallothionein 



family 1 



13.41 8.81e-09 61-106 



IPB000006 



Vertebrate metallothionein 



family 1 



13.41 8.84e-10 33-78 



IPB000006 



Vertebrate metallothionein 



family 1 



13.41 9.38e-12 34-79 



IPB000006 



Vertebrate metallothionein 



family 1 



13.41 9.S2e-09 91-136 



IPB000006 



Vertebrate metallothionein 



family 1 



13.41 9.90e-10 102-147 



IPB000359B 



Cystine-knot domain 



19.26 3.48e-10 129-147 



IPB000359B 



Cystine-knot domain 



19.26 6.34e-09 90-108 



IPB000437 



Prokaryotic membrane lipoprotein lipid 
attachment site 



13.83 7.23e-10 10-21 



IPB000437 



Prokaryotic membrane lipoprotein lipid 
attachment site 



13.83 9.31e-10 20-31 



IPB000726B 



Glycoside hydrolase family 19 



5.39 5.68e-ll 8-18 



IPB000726B 



Glycoside hydrolase family 19 



5.39 7.84e-ll 18-28 



IPB001007C 



von Willebrand factor, type C repeat 



20.89 4.27e-09 74-95 



IPB001169K 



Integrin beta, C-terminus 



27.45 1.00e-10 10-52 



IPB001169K 



Integrin beta, C-terminus 



27.45 2.21e-ll 81-123 



IPB001169K 



Integrin beta, C-terminus 



27.45 4.04e-09 18-60 



IPB001169K 



Integrin beta, C-terminus 



27.45 4.77e-13 3-45 



IPB001169K 



Integrin beta, C-terminus 



27.45 7.31e-ll 13-55 



IPB001271 



Mammalian defensin 



19.97 1.00e-10 12-40 



IPB001762A 



Disintegrin 



23.93 2.24e-09 75-115 



IPB001762A 



Disintegrin 



23.93 2.41 e-09 95-135 



IPB001762A 



Disintegrin 



23.93 5.41e-09 114-154 



IPB002174A 



IPB002174A 



Furin-like cysteine rich region 
Furin-like cysteine rich region 



30.51 1.79e-ll 5-36 



30.51 1.90e-09 56-87 



IPB002174A 



Furin-like cysteine rich region 



30.51 3.19e-10 12-43 



IPB002174A 



Furin-like cysteine rich region 



30.51 9.16e-12 2-33 



IPB002494A 



Keratin, high sulfur B2 protein 



12.44 1.00e-08 15-48 



IPB002494A 



Keratin, high sulfur B2 protein 



12.44 l.lle-11 122-155 



IPB002494A 



Keratin, high sulfur B2 protein 



12.44 1.67e-12 68-101 



IPB002494A 



IPB002494A 



IPB002494A 



IPB002494A 



IPB002494A 



IPB002494A 



IPB002494A 



IPB002494A 



IPB002494A 



IPB002494A 



IPB002494A 



IPB002494A 



IPB002494A 



Keratin, high sulfur B2 protein 



12.44 1.72e-14 74-107 



Keratin, high sulfur B2 protein 



12.44 1.75e-l 149-82 



Keratin, high sulfur B2 protein 



Keratin, high sulfur B2 protein 



Keratin, high sulfur B2 protein 



Keratin, high sulfur B2 protein 



Keratin, high sulfur B2 protein 



Keratin, high sulfur B2 protein 



Keratin, high sulfur B2 protein 



Keratin, high sulfur B2 protein 



Keratin, high sulfur B2 protein 



Keratin, high sulfur B2 protein 



Keratin, high sulfur B2 protein 



12.44 2.00e-12 11-44 



12.44 2.07e-ll 64-97 



12.44 2.33e-12 54-87 



12.44 2.85e-17 78-111 



12.44 3.42e-09 98-131 



12.44 3.52e-09 79-112 



12.44 5.00e-15 117-150 



12.44 5. 60e-10 21-54 



12.44 5.65e-09 102-135 



12.44 5.93e-ll 5-38 



12.44 6.00e-15 83-116 



WO 03/054152 PCT/US02/39555 
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SEQ 

1JLI 

INX); 


Database 
entry ID 


Description 


XVCo tllliJ 


LD LD 


1"DR AAOzlGzl A 

lrJtDUUZ4y4/Y 


xvCIdllll, lllgll SUilUl SDZ, pUJlOHi 


12 44 6 01e-10 63-96 


ID ID 


T13T1AAO/10/1 A 


iveratin, mgn suiiur jdz proiein 


12 44 6 13e-09 6-39 


loir 
LD LD 


T"DT3AAO/10y1 A 


iveratin, mgri suiiur r$z proLcni 


12 44 6 56e-17 73-106 


101-^ 

LD LD 


TT3T3AAO/4Q4 A 

1 Jr JDU UZ4y 4/V. 


iveratin, nign suiiur x>£> prott;iii 


12 44 6 77e-13 16-49 


1 t i r 
LD LD 


tt>d aao/iq/1 A 
lr jfc>UUZ4y4A 


jtvcraiin, nign. suiiur dz {jiolciii 


12 44 7 24e-10 118-151 


ID Id 


I Jr r> U UZ4y 4/v 


iveratin, mgn. suiiur r>z proLem 


12 44 7 67e-12 113-146 


1 o 1 r 

1 J ID 


i"DT3AAO/1A/1 A 
lr jDUUZ4y4A 


jVcratirij mgn sunui r>x piuLcin 


12 44 8 00e-19 112-145 


ld Id 


tdd a ao/1Q/1 A 
lr JD U UZ4y 4/Y 


iverdtiii) nign suiiur jdz- piutciii 


12 44 8 16e-10 59-92 


LD ID 


lr JDU UZ4y 4/Y 


ivcid.Lin 5 nign sunui piuLcni 


12 44 8 56e-14 93-126 


Jo ID 


lrjt>UUZ4y4/Y 


tTtirotin LiinrTt CM ll "flit* T59 T\t*/^'f £>1 Tl 

rverdiin, nign sunui sdz> piDiciii 


12 44 8 57e-19 88-121 


1 0 1 ^ 
IJ ID 


lr JD\J UZ4i? *r/V 


TiT f»-rc»4"i -n lri crh cii1"Piit 130 TVTf^MTI 

rvcrciLiii, mgii sunui \s>±- pujiciii 


12 44 8 65e-09 69-102 


ID ID 


LJrr>UUZ4y i f/V 


LVCI dLlil, lllgll SUilUl Da piULOlll 


12 44 8 67e-10 84-117 

X 1 —' • ■ 1 V • V*° # V/ X V/ V/ 1 J. J_ / 


1 "3 1 
I j ID 


lr jDUUZ4y4/Y 


ivetdiiri, mgn sunui dz piuiciii 


12 44 9 0^e-09 89-122 


LD ID 


TT>13AAO/1Q/l A 
LJrJz>UUZ4y4A 


jvcrdiin, nign suiiur dz, piuiuixi 


12 44 9 71e-09 10-43 


ID ID 


JDUUZ4y4/\ 


js^eidim, mgn suiiur pujicin 


12 44 9 80e-10 103-136 

X • 1 V *S ■ w V/ V/ X V/ X V/ a w' V 


1D1D 


TT> DA no AR 
lJrJDUUZ4y4x3 


XVwIdLlIl, lllgll SUilUl DZ. piULCill 


10 58 1 27e-10 141-155 

X \I ■ w VJ 1 X « / w X V/ X \ .M. JL •w' 


1 Oi r 
ID ID 


TT>DAA0.4Qi4R 


ivcrdim, mgn sunui oz, piuicin 


10 58 3 92e-09 107-121 

1 Vik/U w/ » ^ iW V/ \J *S XV// X Xw X 


ID ID 




T/^ r»rci+i n VticrVi oiiItiit R0 TWrvtMin 


10 58 4 49e-10 131-145 


1 3 1 *\ 
ID ID 


lr JDU UZ*+;7 *+jD 


T^fvatirt riicrVi cnlfiir R0 TiTotpiTi 
jTvvidllil) lllgll auiiiui x>x> ^JisjiXsXii 


10 58 9 10e-12 102-116 


1 3 1 < 
LD LD 


TDDAAOilGAR 
lr LDU U Z4y 4 £> 


iveraiin, nign suiiur oz* piuicni 


10 58 9 10e-12 92-106 

1 Vik« U » X V/V^ X ^ -Xw X V/ V/ 


ioi r 
LD1D 


TDD A AO AQ AC* 
Lr r>V KJ Z^f y *+ \^ 


jtverdLin, mgri sunui dz. piL>icni 


14 46 1 00e-08 19-62 

X ~ V/ X«V/V/W V/V X \J>A-^ 


1 3 1 

LD LD 


TT>~FM}niAQAC* 
Lr D\J\J£ £ ty £ ry-' 


r/"/z»>-Qti ri TiicrLi cnlfnr R0 TTrr^tfin 

IVCldLlll, lllgll oUlJLUl X>X* piUFldll 


14 46 1 52e-15 86-129 


131^ 
LD ID 


TDDAAOAQAf^ 
lriDUUZ4y4V^ 


jverdimj mgn sunui r>z, piuLcin , 


14 46 1 59e-09 14-57 


LDLD 


rDDAAO/Kj/tf" 1 

lrr>UUZ4y4l^ 


jveraTiTi, nign suiiur lda pruicin 


14 46 1 79e-10 8-51 


1D1D 


TT>DA AO A QAC* 
lFJ3uuZ4y4v^ 


JveraXirij mgn suiiur dz, pruxcm 


14 46 2 55e-ll 62-105 


LDLD 


lr jt>UUZ4y4v^ 


am+lri Vi-J rrVi clll-fiit* R0 T"\T*/"\+P»in 

iveratin, nign suiiur oz, piuicni 


14 46 2 66e-13 61-104 

X ^1 « W Zy . V W X v/ X X V/ r 


LD LD 


LJrr>UUZ4y4L/ 


fc»-»-n+-tf» ViioTi mil At if 130 i^i*/^tf=»-f n 

iveraiin, mgn suiiur jdz proLcin 


14 46 3 39e-17 101-144 


LDLD 


!J J h>UUZ4y4C 


Jveraxin, nign suiiur r>^ proiem 


14 46 3 46e-1 6 1 10-153 


LDlD 


lrJt>UUZ4y4L/ 


iveraimj mgn suiiur oz, prutcui 


14 46 3 80e-10 80-123 

l~i~V/ • V? V/ W X V/ V? V/ X i« m-J 


131-*; 

LDLD 


tdda AO/i o Ar^* 
LJrr>UUZ4y4L^ 


IS f\ Irirrli pill A ir* R.0 Tif/rf'P'i Y\ 

jveraiin, mgn suiiur dz, pruieiii 


14 46 4 71e-25 81-124 

X ^ . ™ V_/ T. / X w \J X XA^ 


1 3 1 <C 

LD LD 


tddaao/IQ/I j"" 1 
lr r>UUZ4y4V_, 


IZ'/afo+i-n Ti-irrVi pn 1 fill* R0 T\f/^TP»1T1 

Lveraiin, nign suiiur dz piuiuui 


14 46 4 76e-10 67-110 

IT, "T \J • f v/ V/ X V/ V/ / XXV/ 


LDLD 


tho aah a c\ac^ 
lriDUUZ4y4v.- 


Kcrutiiij nign suiiur dz> pioLciii 


14 46 5 lle-10 111-154 

X • ^TKs *s * X X Vv I v XXX X 


1-31 « 

LDLD 


TPDA mAQAC* 


f^" f^vd't'i r\ LiicrLi cii1"Ait R0 TiTritF'lTl 
JVClaLlllj lllgll OUl±Ul DZ> piVJLClll 


14 46 5 lle-10 52-95 

X • 1 V^ ^/ « X X >^ V X^ 


LDLD 


lJrr>UUZ4y4V^ 


ivcrdim, mgn sunui sdz, piuicni 


14 46 5 84e-18 76-119 ! 

X • T V/ fc/ a w r w x v/ / V/ X X ^ 


LDLD 


rDDAAOAQAf"* 
Lr DUUZ*+y*+V^ 


is»crdLin, mgn sunui dz. piuLcm 


14 46 6 42e-10 51-94 


1 "5 1 < 

LD LD 


LJrJDUUZ4y4^ 


1Z" e^-f-r-j-f -i" n IrJcrln cillfllt* 130 ■rtf/rtpl Tl 
IVCldLlll, lllgll aUllUl r>z, piotciii 


14 46 7 00e-15 105-148 


1 ll r 

LD LD 


LJrr)UUZ4y4Ly 


T^/ar^a-f-iTi T*iifrTi OlllTlTT 130 TM*/~\TfiiTI 

js>eraiin> nign sunui oz. piuiciii 


14 46 7 47e-10 85-128 


LDLD 


TDDA AO /I A/1 f 1 

lFJb)UUZ4y4L^ 


lr^£i-»«ri+l -fl Vl 1 frll dlltfllt* 130 TVtwf'^l Tl 

Jverdtin, nign suiiur sdz, protein 


14 46 7 62e-15 96-139 


LDLD 


lJrh>UVJZ4y4\^ 


l^"tit»o+ir* \\k crV\ ci 1 1 Al it R.0 T\rrvtp 1 
Kcl aim, lllgll btlllUl XjZ, plVJlClii 


14 46 7 64e-t6 66-109 


LDLD 


Lr JDUUZ4y4V_> 


T{T t^mtif\ riiori ci lift it R0 TiTPitf^in 
JVClcttlllj lllgll SUilUl DZ< pivjLClii 


14 46 7 98e-09 95-138 


LDLD 


TDDAAO/1 A/If"' 
lrJDUUZ4y4^ 


ivcrdiin 9 mgn sunui dz. piuiciii 


14 46 8 07e-24 91-134 


LDLD 


TDD A AO A QA 

lJrr>UUZ4y4l^ 


If orofirt Tri rrTi ClllfilT R0 TlTr\f^lTl 

ivcrdiiii, ^ngii bunui Dz, piuiciii 


14 46 8 13e-12 106-149 

X ™ •^rvy vJ • x +j v> x x \j \j x r ^ 


LDLD 


TT3DAAO/10/I 

lJrr3UUZ4y4L^ 


if Afatiti TtJ frl*i o 1 1 1 Al 1 1* 130 T*tT/*\TolTl 

jverdim, mgn suiiur dz, proicin 


14 46 8 44e-21 71-114 

x i »"v/ v?»~"v xw x /x x x r 


1 3 1 «i 
LDLD 


lrr>UUZ4y4L> 


T/" ^fo-fiTl Vii crVi ClllfilT 130 TlT/^+f^i Tl 
IVCrdllllj lllgll oUlllll J_>Z, piVJlClIl 


14 46 8 45e-ll 72-115 

X « v/ u« Tt/V xx f xv x r _ 


I'll r 

LDLD 


TDDAAO/1 A/1 

LrJDUUZ4y4^ 


TZ^rotin Vii crVi Clll"filT R0 Yxfrxtt^Tn 

rverdiiUj nign sunui sdz, piuiein 


14 46 8 70e-14 90-133 




TPB002494C 


Keratin, hish sulfur B2 tjrotein 


14.46 9.00e-l 146-89 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.30e-14 100-143 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.65e-10 75-118 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.82e-l 177-120 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.83e-09 39-82 


1315 


IPB003571B 


Snake toxin 


18.08 1.20e-09 22-45 


1315 


IPB003854B 


Gibberellin regulated protein 


19.26 9.5 le-10 45-83 


1315 


PR00858B 


Crustacean metallothionein signature II 


5.93 3.39e-09 49-67 



WO 03/054152 PCT/US02/39555 
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III 


Database 
entry ID 


Description 


Results* 


i n *i 

1 J JL J 


rI\UUojoD 


(~^ri lcto ^PQn tn ofol 1 f*vr rx i /~\fi t"i o i rm <a'TT irj^ T ll 

ubiaucd.li riieiaiioLnionein oigiidiuiw 11 


S 01 7 99f=>-0Q 1 S-ll 


111^ 


pp A^c•74^ , 

rrvUUo /4V_/ 


rungi-i v mctauotnionem signature ill 


A 17 1 7^*a HQ fS) 


111^ 


pp aoco^P; 


"M^a*V> ol'/^ri/a "fvxz^+o 1 1 /"\4*lx 4 /"Vfx ntti ri-i rrti 0"fn Tl 

i^eiriaioae nieiaiioinionein signaLure 11 


7 *\ mp. no 1 ni 

/.DO J.VJie~W!7 07-lUZ 


1116 

JL J IO 


lr JQUUUUUO 


verccDrate iriciaUOTnioiieiri, iarriiiy l 


11 A\ 1 10^-10^7 1 ni 


1 1 1 6 


lr r»uuuuuo 


v cite or axe me taiiotiiionein, lainiiy l 


ij.h-i ize-uy j/-oz 


1 1 1 /? 

1 j 10 


tppiaaaaa/; 

IrrSUUUUUO 


veneDraie meraiiotnionem, iamny i 


1 1 41 < >d1<=> AO ^7 Q7 

ij.^fi j.'Hie-uy Dz-y/ 


1 1 1 £ 
IjIO 


tpraaaaa/; 

IrrSUUUUUO 


veneDraxe metaiiOLiiioneiii, iamny i 


11 41 ^Q<= HQ 1Q Cl 


1j 10 


tphaaaaa/^ 

IrriUUUUUO 


veneoraie metaiioinionein, iamny 1 


1 1 41 < /;c/a no 111 
lj.hl j.ooe-uy oo-iij 


1 1 1 £ 
1 j 10 


tpqaaaaa/; 
Irr>UUUUUO 


v ene orate metauotmonein, iamny i 


11/11 ^ 7fto 11/17 01 

ij.4i j./ue-ii 4/-yz 


i j 1 0 


tp tjaaaaa/; 

IrrSUUUUUO 


v eneDraie rneiaiioinioneiii, ianiiiy i 


1141 ^ 7/1 d 1H Q7 117 
Ij.41 j./^fc-lU o/-1jZ 


i n ^ 
u io 


tpraaaaa/; 
lr ouuuuuo 


veireorate nietaiioiriioneiri, iamny i 


1141 ^ S7*=» AO 1 52 £S\ 
Ij.^+1 J.o/e-Ui? 1o-Oj 


IjIO 


IrrSUUUUUO 


verceDrate metanotiiionein, iamny i 


11/11 ^ OAa 11 QO 1 Ql 

Ij.4-1 O.ZUe-11 oo-Ijj 


1 11 £ 


tptsaaaaa/; 

IrrSUUUUUO 


veneDraie meianotnionein, iamny i 


11/11/? /?n« no 1 ^7 /;i 
ij.41 o.oue-uy 1 /-oz 


1 1 1 £ 
1 J IO 


tphaaaaa/; 

lr riUUUUUO 


veneDraie meLaiiotnionem, iamiiy l 


11 41 Q l^i= AO 11 

ij.*+i o,jje-uy Zj-oo 


1116 
1 j 10 


IJrrSUUUUUO 


veneoraie rneiaiiotnionein, iamiiy i 


11/11 C l^o AO IS 111 

Ij.^i o.jje-uy /o-izj 


1 11 £ 
1 J 10 


tpraaaaa/; 

lr rSUUUUUO 


veneoraie rneiaiioiiiionein, iamny i 


1141 Q 4^<=i 1 A 07 117 

ij.^i o.^je-iu yz-ij/ 


1 1 1 £ 
1 J IO 


tphaaaaa/; 
IrrSUUUUUO 


veneoraie metaiiotnionein, iamiiy i 


1141 Q 7Aa 1 1 17 /?7 

Ij.-h-i o./ue-ii zz-o / 


111/; 
1 J 10 


iptiaaaaa/; 
lr rSUUUUUO 


veneoraie rneiaiiotnionein, iamny i 


1 1 41 8 QQi=> AO 11 HSl 

1 j.^f 1 o.yye-uy jj-/o 


1 Q 1 /C 
! 1 J 10 


tppjaaaaa/; 
IrrSUUUUUO 


veneoraie metaiiotnionein, iamny i 


1 1 /I 1 O /I^Co AO /II CQ 

ij.4i y.4De-uy 43-00 


111/; 
1 J 10 


TpnAAAi^oti 
lrr>UUUjjyr> 


^ystme-Knot aomam 


1 O I/? 1 AA«a 1 A OA 1 flQ 

ly.zo i.uue-14 yu-iuo 


111/; 
1 J 10 


tpti aa ai <;qr 


\^ystine-Knot domain 


10 1/; 1 aq /i <; /;i 

ly.zo i.ooe-uy 4j-0j 


1316 


IPB000359B 


Cystine-knot domain 


19.26 2.97e-09 80-98 


in/; 

1 j 10 


TD T> A AA 1 *T CVD 


L.ystme-Knot domain 


ly.zo j. /oe-uy izu-ijo 


1 i i ^ 

IjIO 


tdtjaaai^qti 


Lyystme-Kriot domain 


IO 7/C O^xa IO OA OQ 

ly.zo o.ZDe-iz zu-jo 


111/; 
1 J 10 


Lrt>\j\}\JjDytS 


L/ystine-Knot domain 


IO OZ? Q 11q AQ HI 

ly.zo o.jie-uy 


i j 10 


td*d aaai ^qd 


Cystine-knot domain 


1 o o/c 0 1 £~ nn o< /ii 

ly.zo y. loe-uy zd-4j 


111/; 
IjIO 


tphaaai^qp 
lr rsuuu j j yrs 


i^yscine-Kxiot domain 


1 Q OZ? O 1 AO 111 

ly.zo y. loe-uy yj- 1 1 j 


1 J 10 


TDTJAAAH^Tl 

lrrSUUU /Z0r» 


ijriycosiae nydroiase iamny iy 


^ QQ C /I AO 01 11 

j.jy o.4oe-uy zi-j 1 


1 1 1 /; 
IjIO 


tt>haaaq7ap 
IrrSUUUy /Ur 


ueveiopmentai signaling protein, wnt-i 
iamny 


0*3 A1 ^ Q/l/a AO ^O 1 A7 

Zj.4j o.y4e-uy oy-iu/ 


1 1 1 £ 
IjIO 


tphaai 071 

lrrSUU 1Z / 1 


iviammanan deiensm 


1Q Q7 1 OC^a 1A1^ ^1 

iy.y/ 1. j DC - iu / j"jj 


111/; 
IjIO 


tptjaai 171 

IJrrSUUIZ / 1 


iviammanan oeiensm 


1 O Q7 1 SCq 1 A 04 1 11 

iy.y / i.ooe-iu y4- izz 


1 j 10 


TPRfim 071 
IrrJUUIZ / 1 


iviammanan aeiensin 


1 Q Q7 7 Q4/=» 1 A 7^ 1 Al 

iy.y / z.y4e-iu /d-iuj 


111/; 
1 j IO 


tpraai 071 

IrrSUUlZ/ 1 


lvtammanan oeiensm 


1Q 07 4 Qlo AQ <A 7C ' 

iy.y / 4.yje-uy ju-/o t 


111/; 
IjIO 


tt>tiaai 171 
irrsuuiz/ 1 


iviammanan oeiensm 


1 O Q7 A Q'Xa AO 1 11 1 

iy.y / 4.yje-uy yj-izj 


1116 
1 j 10 


TPHAA1 171 
IrrJUUIZ / 1 


iviammanan oeiensm 


1 O Q7 ^ 1 A 04 ^O 

iy.y / D.z4e-iu Z4-0Z 


1 11 £ 
IjIO 


tpraai 171 
IrrJUUIZ / 1 


ivianiniaiian aeiensin 


1Q 07 ^ AQ 1A 

iy.y / j.zje-uy juoo 


111/^ 
IjIO 


TPRnm 171 
lrrsuu iz / 1 


iviammaiian aeiensin 


1Q Q7 ^ ^R*^ AQ 8^1 111 

iy.y / j.joe-uy oj-iij 


1 j 10 


tt>h aai ooi 
IJrrSUUIZ / 1 


iviammanan oeiensm 


1 Q QO 1 1 Co 1 A 52A 1 AC 

iy.y/ /. i oe-iu ou- iUo 


111/; 
IjIO 


tphaai 171 

IrrJUUIZ / I 


lviammajian aeiensin 


IO 07 7 77<=k AQ Ol 

iy.y / /.zze-uy oj-yj 


IjIO 


TTJTl AA 1 17 1 
IrrJUUIZ / 1 


iviammanan aeiensin 


1 Q Q7 7 l^r» 1 A 4A <Q 

iy.y/ /. jje- iu 4U-O0 


111/; 
IjIO 


IrrJUU 1Z / 1 


iviammanan oeiensin 


1 O 07 C A£q 1 A CO 1 17 

iy.y / o.uoe- iu oy- 11/ 


1 11 /; 
IjIO 


TDD AA 1 n 1 

IrrJUUIZ / 1 


Mammalian oeiensm 


1 O Ol C OAo AO K /?1 

iy.y/ o.zue-uy jj-oj 


111/; 
IjIO 


TDn aa i m 
IrrJUUIZ / 1 


iviammanan oeiensin 


1 A Q7 O Ho 11 K Al 

iy.y/ o.jie-11 0-40 


111/^ 
IjIO 


TT>ta aai t? 1 
IrrJUU 1Z / 1 


iviammaiian oeiensm 


in q*7 0 OfTc AQ C/1 110 

iy.y / o.joe-uy 04-1 iz 


IjIO 


TPRnni 171 
ir r>uo iz / 1 


iviaiiiiiictiictii udciioiii 


1Q 07 R ^Qp.m 10-^7 


1316 


IPB001271 


Mammalian defensin 


19.97 S.85e-09 64-92 


1316 


IPB001271 


Mammalian defensin 


19.97 8.94e-10 49-77 


1316 


IPB001271 


Mammalian defensin 


19.97 9.35e-09 74-102 


1316 


IPB001271 


Mammalian defensin 


19.97 9.67e-09 34-62 


1316 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 1.97e-09 94-127 


1316 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 2.23e-18 53-86 


1316 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 2.85e-13 108-141 
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Table 3B 
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LLf 

1VO* 

L^ixJ t 


Li at ao as e 
emry id 


Description 


Kesults* 


i J IV/ 


TPR0094Q4A 


T<f < a i"H'l"in riioTi cnlfnt" ?59 i^t'/^ilf^ i t~i 
IVCXdllXl, XXIgXX SUXIUX LjjL pXUtCXXX 


19 44 9 S^r 1Q 4J2 Q1 


1116 


TPR0094Q4 A 
1 r Jj) U u ZH- y*t.rV. 


in T>ioT"i oiil"fitt* 159 r\i*At/ain 

xveiaLin, nign suiiur jdz protein 


19 44 9 Q4» OQ 4Q 59 

iz.*+h- jL.yHQ-vy ^fy-oz 


1116 

IjIU 


TP150094Q4A 


xvcidLiii, xixgxx suiiur xjz protein 


19 44 ^^5<= 19 "X "Xf. 


1116 


TPR0094Q4A 


xvdaiiii, iii^ii ouiiur xj^s proieiii 


19 44 T ^S/*-1^ 1^ 4£ 


1116 


TPR0094Q4A 

LIT XJ UUZiT 


IvCIallilj lllgli bU.lJ.UI Ju>Zr piULCIXl 


1 9 44 ^ 44*= - 19 1 52 S 1 


1116 

1J1U 


TPR0094Q4A 


XvCldllll, IllgXX bUllUl 1_)Z< proLcin 


1 9 44 1 £Hp 19 8 41 


1116 


TPR0094Q4A 


T<T f^vai"ii~\ 1*1 1 frli ciil'fi'ii* T-^9 r^fr\+<=»i"H 
XVClctllllj lllglX oUXXlil UjL pi UlClXl 


19 44 1 71^ 1 Q Q9 1 9£ 
iz.^h j. / ie _ Ly yj- izo 


1116 

1 J 1 V 


TPR002494A 

X.U Lj\J\J 4*vy'-rf\. 


xvCla.1111, lilglX &U11UX 13 Z< piULCUl 


1 9 44 4 48<=-0Q ^4 87 


1116 

IJ X KJ 


TPR0094Q4A 
xir ij\j\j 


T<r^t"!ati'M Tiicrn onlfiir T-19 T\t*r» , f"<^iTi 
IVCXctliXl, XXXgll bUXlUl X3Zr piuxcxii 


1 9 44 4 68p 10 118 1 s 1 

1Z.TT T.OOC 1U X 10-lJl 


1116 


TPR0094Q4A 


XX.dcttlll 9 Illgll &U1XU.1 13 z, pi UlClll 


1944471*= 1798 61 

XZ.Ht Hr. / Xc- 1 / Zo-O 1 


1116 


TPR009494A 


T^'< s »T'5i'f'in ViicrVi cn1"fiiT T^9 T\t*rt'F#a'tn 
XVC/lclLXllj XXIgiX B UlltiX J_>Z> pivitcin 


1 9 44 4 SQ^-1 0 94 **7 


1116 


TPR0094Q4A 


xvcxcttxii, iiigii buxiur xjz, proieiii 


1 9 44 4 Q7ps HQ Azl Q7 


1116 


TPR0094Q4A 


jvcidLiii, ixigix istiiiur 13 z. protciii 


1 9 44 4 QQp 1 0 90 £9 


1116 

LJ 1 \J 


TPR0094Q4A 


XvClatlllj XllgXl otlllUl 13 Z. piOlCXll 


1 9 44 ^ 04«= 1 1 4^ 7A 

iz.^^f z) .UH-e- xj e Tj- /o 


1116 


TPR0094Q4A 


ivcxatxxx, nign buiiur oz protein 


1 9 44 ^ 1 O^ 17^8 71 
IZ.^f^f J. lUe- 1 / JO-/I 


1116 

1 J 1 U 


TPR0094Q4A 


xvoiatin, mgn suiiur i3z. protein 


1 9 44 ^ 84*a OQ 09 


1116 

X J 1 \J 


TPR0094Q4A 


xvcxd-txix, ixx gix buxxitr 13 z, protein 


1 9 44 ^ Q9*a 14 88 191 

x z.'4-H- j.yze-i^f oo-izi 


1116 


TPR0094Q4A 


ivcxatxxi, xxxgxx btixiui x3z/ protein 


1 9 44 <^ 1 8^ 11^ 

iz.hh- o.oz>e-ij oj-i io 


1116 


TPR0094Q4A 

1 JT XJ UUi,T7T , A 


xvciatxii, xxigii btixiui x3z, protein 


1 9 44 6 77^ 1 1 ^8 1 Ol 

iz.*f*f o.//e-iz) oo-iui 


1116 


TPR0094Q4A 


xv.erd.x.in, m^n. suiiur xjz protein 


1 9 AA 7 99o 1 9 TX 1 H/^ 
IZ.^f^t /.ZZe-I/ /3-IUD 


1116 


TPR0094Q4A 


xveratm, mgn suiiur x3z protein 


19 44 7 <;8<= OQ 1Q ^9 


1116 


TPR0094Q4A 


xvcxatxiij xixgii ouiiur x3Z/ protein 


19 4zl 7 1 n 1 fi^ 1^^ 
IZ.H^f /.03e-lJ 1Uj-1jO 


1116 


TPR 009404 A 
ix ijuuz t ry*T.r\ 


jverdim, mgn suiiur x3Z protein 


19 44 7 89» 1 Q 9*5 


1116 


TPR0094Q4A 


iveratin, nign suiiur 13 z protein 


1 9 AA 8 fl7/=> 1 < Q Q 1 


1 11 6 

1 J X u 


TPRO094O4 A 

lx xjUVJZ*ry*1-.rV 


iveratin, nign suiiur di, protein 




1116 


TP750094Q4 A 


iveratin, nign suiiur rsz protein 


19 /l/t 8 O/lo 19 Q8 191 


1116 

LJ L\J 


TPR0094Q4A 


iveratin, nign suxiur xtsz protein 


19 Q Clio OQ TO 99 


1116 


TPR0094Q4A 


T^" fjk r* oil n Tii ctTi c 1 i 1 1 i T" T3 3 nrrvfoin 

xvciatxxi, xxxgxx suxxux X3Z pioteiix 


1 9 zlzl Q 1 ^ 1 A 98 111 
IZ.H-'t y. lOe-14 /o-l 1 1 


1116 

U 1 U 


TPR0094Q4A 


iveratin, nign suiiur dz protein 


19 AA Q <^Q/a IO SO 199 


1116 


TPR0094Q4A 


T^^fQ+in Vii frn onlflTr 159 nrrtlfin 

xvcidiiii, ingii ouiitxi J3Z prutein 


19 44 Q 7/^^ 14 T5 


1116 


TPR009404R 


T^pk-fo+in ni <tVi culfnt* R9 fw/'vf^'ifi 
XVCXaLXll, XXX gXX otllXtlX 13 Z piOLOlIX 


10 ^8 1 98^_1 1 99 ^ 
1U.JO X .ZOC- X I ZZ-jD 


1116 


TPR009404R 

XX XJ V \J-Z.tI'*TXJ 


xvcxcitxii, xxxgxi ouxiui dz protein 


IO ^8 9 97f» OQ 87 101 

lu.jo z.z/e-oy o/-xui 


1116 


TPR0094Q4R 

11 XJ V/V-/A>*t J'tXJ 


xvciatxxi, xxxgxi suxiux X3Z piotein 


10^81 6Q<=» 1 0 Q9 1 0£ 
lu.jo j.oye-ii/ yz-iuo 


1316 


TPR0094Q4R 


T^f^fCitiTi Til crh Giilfiit* R9 rvri-k+f^ivi 
XVCi tltXXX, XXXgll otlXXUl 13 Z pXULCXIX 


10 ^8 ^ 99<=» 1 1 77 Q1 


1116 


TPR0094Q4R 


xvcxatxii, iixgn suxiui X3Z piotein 


10 ^8 ^ 78p« 1 1 79 8A 
1UJO J./OC"ll /z-oo 


1116 

LJ 1 U 


TPR0094Q4R 

lx: XJ> \J \)ZS-ry t-XJ 


lverdcin, nign suiiur x_>z proiem 


10 ^8 ^ 07i= OQ *\9 

iu.z>o o.u/e-uy jz-oo 


1116 


TPR009404R 


jvcrdiuij nign suxiur jjz protein 


10 ^8 ^ 90/= 1 1 "37 ^1 

iu.zjo o.zoe-ii ji-jl 


1116 

LJLKJ 


TPR0094Q4R 


jverdtm, nign suxiur X3Z protein 


ia co /: </t#a 1 n 97 A 1 

iu.jo o.O'fe-iu zz-'H-i 


1116 

IJ 1 u 


TPR0094Q4R 

XX Ju>UUZ'Ti' < -rXJ 


xverdtiii, xixgxx suxiux j_>z protein 


10^8^\ ^4f=« 1 0 Q7 111 

iu.jo o.O'fe-iu y/-iii 


1116 

LJ 1 \J 


TP150094Q415 


xverdxin, nign suiiur x_>z protein 


m <;s a q1p to 1 07 191 
iu.jo o.yie-iu luz-izi 


1316 


TPR009404R 


xvvjidtxxx, xxxgxx ouxiux ij z prutein 


10 ^8 6 Q1 *=-1 0 69-7£ 
iu.jo o.yie-it/ oz~ / 0 


1116 

LJL\J 


TPT50094Q4R 
LIT Jj \J KJZ,*iyHD 


iverdiiix, xxxgxi suxiur xjz protein 


If! *\8 7 08<= OQ 119 

iu.jo /.uoe-uy iiz-izo 


1116 

LJ L U 


TPT50094Q415 
lx xj \J \JZ,HyHD 


T^"<st»r5+ltl Tii Ctrl Olllfllf T59 nrA^oiVi 

iveratin, nign suiiur .dz protein 


1H ^8 7 08» OQ fH 81 

iu.jo /.uoe-uy 0/-01 


1116 


TPR0O94Q4R 
Ltr x3 u UZ'+y^+lj 


iveratin, nign suxiur xsz protein 


in tro q 0 c« 1 3 A9 ^ 1 

iu.jo y.zje-iz 4 /-oi 


1116 


TPR0094Q4R 
lx XJ UZHy 


lif'o'^ollTl Vl 1 oil oill"fllT* 159 nrrxfoi ti 

iverdiiii, ixxgn suxiux oz proxem 


in <;c 0 A At* 11 117 i"5i 
iu. jo y.^fH-e-i 1 ii/-1ji 


1316 


TPR002494C 


TCpratiti liip'li <5iilfiir R9 r^ro+f^in 

IVVvl dLlll, lllgil OU11U1 \JL\JL\sLLL 


14 46 1 00p-0S 15-S8 

I't.'tVJ 1<uuC "UO IJ JO 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.18e-ll 55-98 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.22e-17 101-144 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.36e-ll 95-138 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.54e-16 1-44 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.60e-31 41-84 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.94e-26 26-69 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 2.76e-13 25-68 



WO 03/054152 PCT/US02/39555 
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SEQ 
ID 

NO: 


Database 
entry ID 


Description 


JtCcSUIlS'* 


13 lo 


lrrJUUZ^yH-C 


Jveraiin, mgri suiiur dz protein 


14 46 % 45e-10 20-63 


13 lo 


TDDAA1 /) O/l 


jveratm, nign sunur Jt>z protein 


1 4 46 ^ 47p~22 56-99 


1 O 1 £. 

1316 


lrfc>UUz494C 


Jveratm, nign suiiur rsz protein 




1 O 1 /z 

1316 


TTlT"i (\ AO A C\ A f\ 


jveratm, nign sunur rsz protein 


14 46 4 1Qp-1? 1 16-1 59 


1316 


TT1TD f\ AO ACkA 

lrBUUZ494C 


iveratin, nign suiiur uz protein 


14 46 4 ?6p-18 71-64- 


1316 


1PB002494C 


Iveratin, nign suiiur dz protein 


1 4 46 4 47p-94 8 1 - 1 74 


1316 


TT1T1 AAO AC\ A Z -1 

1PB002494C 


iveratin, nign sunur t$z protein 


14 46 4 60p-17 6-49 


1316 


IPBQ02494C 


rveratm, mgli suiiur r>z piotem 


14 46 4 S0p-9S 76-1 1 9 


1316 


LrBUUz494C 


jveratm, nign sunur dz protein 


14 46 4 X4p-10 65-1 OS 


1316 


lrBU024y4(_. 


Jveratm, nign sunur r>z protein 


1 4 46 4 f) 60-1 0^ 


1316 


TB'DAAOylQyir 1 

LrBUU24y4C 


jveratm, nign sunur Jt>z protein 


1 4 46 4 95p-09 1 00-1 4^ 


13 16 


lrBUU24y4C 


jveratm, nign sunur ±>z protein 


14 46 5 01 e- 13 90-1 33 


13 16 


lrJt>UU24y4U 


jveratm, nign sunur dz protein 


14 46 5 50p-25 31-74 


1 O 1 /C 

13 16 


ll J i3UU24y4C 


T^"^i-*»ioi4" , t »oi *h -l /orTo mi l^t it* IJ / ^ -»o\ y \ 4* /o> 1 T*^ 

jveratm, nign sunur r>z protein 


14 46 5 54p-10 35-7S 


13 lo 


lr±JUUZ4y4L.. 


jveiatm, nign sunur dz protein 


14 46 5 54e-10 80-123 


13 lo 


TT3 T5 n m /l Q /I z^ 1 
lrr>UUZ4y4v^ 


T/^ £}**«r\ +-1 -to V* 1 rrlo r< 1 t 1 *f*i i t* T^O torrvfoi 

jveratm, mgn sunur r>z protem 


14 46 6 42e-10 121-164 


1 "5 1 *C 

131o 


/I A/IP 

lFr>UUZ4y4C 


jveratm, nign sunur r>z protein 


14 46 6 50p-23 46-89 


1316 


lJrJ3UUZ4y4C 


Tc'''' ri*#r» ^-t -»o <OrVi /illl'Alt* |_J O o v O\T*/^4"^1 "IO 

jveratm, nign sunur Jt>z protein 


14 46 6 52p-21 91-134 


1316 


lrBUUZ4y4U 


jveratm, nign sunur joz protein 


14 46 6 64p-26 96-139 


1316 


1PB0UZ494C 


jveratm, nign sunur J5Z protein 


14 46 6 91p-1 1 85-128 


1316 


lJrB0U24y4C 


jveratm, mgn sunur r>z protein 


1 4 46 7 04p-9? 1 1 1 -1 54 


131o 


lr r>U UZ4y 4L/ 


Ti^o*«nf 1 -ro Tol i~riOj rill i "fl 1 1" 15 ^ »OiT"i~*^*^1 

jveratm, nign sunur dz protein 


1 4 46 7 1 2p- 1 0 70- 1 1 3 

it.^U / • LJU\y XV/ /V/ X X »> 


1316 


TDDrtm /I O/t/^ 1 

lrBUUZ494C 


jveratm, nign sunur dz protein 


14 46 7 67e-33 16-59 

1*T,*TV/ 1 .\J 1 V7 »/ J X VJ — ' 7 


1316 


lrBUUZ4y4C 


jveratm, mgn sunur dz protein 


14 46 7 75p-?8 86-129 


1316 


Tt>13 A AO >j A A 

lrBUUZ4y4C- 


jveratm, nign sunur dz protein _j 


14 46 8 34e-10 75-1 18 


1316 


TDDAAT/ffl/IP 

irB0Uz494O 


jveratm, mgn sujiui dz protein 


14 46 8 53p-13 50-93 

1*T.*T\J O.J JO 1 J «/VJ 7J 


1316 


TT1T1AAO /I A /I O 

lrBU0Z4y4C 


jveratm, mgn sunur dz protein 


14 46 8 70p-90 1 1-54 


1316 


ll J B(JUz4y4C 


jveratm, nign sunur dz protein 


14 46 8 74p-09 10-53 


1316 


lrBDU24y4L- 


jveratm, mgn sunur dz protein 


14 46 8 88p-93 36-79 


1316 


TT1TOAAO ACiA/~i 

lrBU0z494C 


Jveratm, mgn sunur dz protein 


14 46 8 88p-98 66-109 


1 O 1 iC 

1316 


lPBUUz4 u 4C 


jveratm, nign sujiui dz protein 


14 46 9 30p-10 30-73 
i*T.H'0 -7 . j o e*" x o JU-/J 


1316 


TTJTlAAl A A A /"** 

!PB0024y4C 


Jveratm, mgn sunur dz protein 


14 46 0 39^-97 51-Q4 


1316 


TDDflAI Ad A Z" 1 

lPBUUZ4y4U 


1/^/%««r«f ttt Toi nrlo^ 101 iI-Pi ii* 13 O t^t^rt+rii to 

jveratm, nign sunur dz protein 


14 46 9 38p-23 71-1 14 

JLt.'TVj 7iJOv iiJ / X XX" 


1316 


lPBUU24y4L 


AM/xi-i -*<■* -i /-*-Vv nil! "Pi t -v" L3 ^1 v\v/'\4*/Ot to 

jveratm, mgn sunur dz protein 


1 4 46 9 57p-1 7 1 06-1 49 


13 16 


lrBUUZ4y4C 


jveratm, nign sunur dz protein 


14 46 9 58p-09 105-148 

1*t.*tvJ 7.JOO U7 1UJ no 


1316 


lrr>UUZ4y40 


jveratm, nign sunur dz protein 


14 46 9 60p-25 61-104 


1316 


lroUUZ4y4<U 


jveratm, mgn sunur dz protein 


14 46 9 91P-10 40-83 


1316 


TADAAO O O OA 

1PBU03oooL> 


■ v*7- * -^1* Jl /-»vvo r*» t to "r\\T *f Qt'tVl 1 tot 1 CI 

j^ Y -ncn domain in -terminus 


94 70 6 1 6p-09 1 00-1 49 


Tin 

1317 


TTJDAAAAA^ 


\ ^rt-»^*yO\To\t»#0 + ^i tVt ^4'0 1 7 /Okf"Toj 1 /^tO £2*1 tOi "P*ifO "»OVOj 1 1 1 / *1 

verteorate meiauotnionein, iamny 1 


13 41 1 00p-08 38-83 

1J.41 X . V.' V/ V/ O ^ O O.^ 


1317 


TDDAAAAA/C 


\ /ioW' /oTo\f fo,^r^ ton a+o 1 1 1 f\10 ft* t n "fo TY^ tlx/ 1 

VcrtCDraXc Tneiduoiuiouein, idmny i 


13 41 1 1 8p-09 165-210 


1317 


TAT3AAAAA/C 

IPBUUUUuo 


verte orate meidiioinionein, iamny 1 


13 41 1 98p-09 85-130 


ion 

1317 


TTiDAAAAA/C 

1PBDOO0U6 


\ T f+.+*4» AtutA^d ton ^4'ol 1 rfo\4"Vi 1 /~\to £ii to oPn von tl tj 1 

verteDrate meiauotnionein, iamny t 


13 41 1 39p-1 0 130-175 

J.J.'+x 1.J7C 1U 1 Jv 1 / J 


1317 


TT>"DA AAA A/C 

IPBOOOUOd 


VerteDrate metauotnionein, iamny 1 


13 41 1 46p-09 60-105 


1317 


TAT*» A AAnA^ 

1PB000U06 


Verteorate metaiiotnionein, larnuy i 


1^41 1 ^5p-09 70-1 1 5 

ij.tl l.JJv \jy /V/~1XJ 


1317 


IPB000006 


Verteurate metaiiotnionein, iamny i 


H 41 9 9Qp_0Q 176-991 


1317 

i J 1 / 


TPR000006 


Vert^Krate metallothionein. familv l 


13.41 2.35e-10 64-109 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 3.11e-09 69-114 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 3.32e-10 104-149 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 3.94e-09 239-284 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 4.12e-09 125-170 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 5.32e-09 201-246 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 5.59e-09 190-235 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 5.68e-09 175-220 
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SEQ 

TTV 

ID 

iNU: 


Database 
entry ID 


Description 


Results^ 


1 T17 

1 3 1 / 


tphaaaaaa 


verteDrate metaiiotnionein, lamiiy i 




1 1 1 H 
131/ 


lrt>UUUUUO 


verteorate metaiiotnionein, iamny i 


ii n « g/ia i a 7in o-s;*: 


ion 
13 1 / 


i"D"daaaaa£ 


verteorate metaiiotnionein, tamiiy i 


11/11 ^ Q'To AO QA lOK 


131/ 


tdtjaaahaa 


verteDrate metaiiotnionein, iamuy i 


11 Al ^ QA^ 1A 1 7A 91-^ 


1 1 1 7 
13 1 / 


tt>t3qaaaa£ 


verteDrate metaiiotnionein, iamny i 


Ij.H-1 J.^'+e-lt/ ZUU-Z4J 


1017 
131/ 


IrrJUUUUUO 


verteorate metaiiotnionein, iamny 1 


i-j 11 <: T3 P aq 1 3£ 711 


131/ 


iptjaaaaaa 


verteorate metaiiotnionein, iamny i 


1-5 Ii OOp AQ 1/CA 7AC 


1017 
i J 1 / 


TDT3AAAAA£ 


verteorate metaiiotnionein, iamny i 


11117 IQo 1 A 77^ 77A 

u.^i /.^ye-iu zzd-z/u 


1 "3 1 1 
13 1 / 


t"dt3aaaaa£ 


verteorate metaiiotnionein, iamuy i 


1111 7 A1p> AQ 1 1 ^\ 1 £A 


13 L / 


TDOAAAf^A/C 

If oUUUUUo 


verteorate metaiiotnionein, iamny i 


11/117 A1<i AQ 1 AA 

1.x -41 /.H-5e-uy j iuu 


131/ 


lrJ3UUUUUO 


verteorate metaiiotnionein, iamny i 


11/117 ^C*i 1A 7/1 /^Q 

lo.^fi /.ooe-iu zh~ ^ 


\ T317 
13 1 / 


IrrJUUUUUO 


verteorate metaiiotnionein, iamny i 


11/11 7 Q7i» 1A OK 7^A 

io.^ti /.y /e- iu zio-zou 


1-317 
131/ 


tdoaaaaa^ 


Verteorate metaiiotnionein, iamuy i 


11/11 5 1^\o HQ 1^ AA 


13 1 / 


IJrrJUUUUUo 


Vertebrate metallothionein, family 1 


11/11 Q iCn AQ /I C QA 


1317 
I 3 1 / 


IJrhSUUUUUo 


Verteorate metaiiotnionein, tamiiy i 


11/11 Q /C7r» AO Q^ 1/1 A 

13.41 o.oze-uy yj- 14U 


1317 

13 1 / 


IrrJUUUUUO 


verteorate metaiiotnionein, iamuy l 


11/11 Q llo 1A 1 7A 1 
13.41 y. 13e-lU 1ZU-10D 


1317 
131/ 


TDDAAAAA^ 


Verteorate metaiiotnionein, iamuy 1 


11/11 Q O7o AQ 7AC 7^A 

13.41 y.z/e~uy zuo-z3u 


13 17 

131/ 


IrriUUUUUO 


verteorate metaiiotnionein, iamuy 1 


11 A 1 Q 77<a fiO 7Afi 1<1 

13.41 y.z/e-uy zuo-zdi 


1317 

13 1 / 


tpt^aaaaaa 


verteorate metaiiotnionein, iamny i 


1 1 11 Q 1*Jp» AO 1 7A 1 71 

13.41 y.4oe-uy lzo-i/i 


1-317 

131/ 


rDTJAAQAAA 


verteorate metaiiotnionein, iamuy i 


11/11 Q<1o 1A 11 7Q 

13.41 y.oie-iu 34- /y 


1017 
131/ 


TDT3AAAAAA 


verteorate metaiiotnionein, iamuy i 


11/11 Q Qlo no 1 < 1 1 QA 

13.41 y.yie-uy iz>i-iyo 


1317 

13 1 / 


TPT3AAQAAA 
lr J3UUUUU0 


verteorate metaiiotnionein, iamny i 


11 11 Q Q1o HQ 7Q 7A 

13.41 y. y ie-uy zy- /4 


1317 

131/ 


ttdtj nnn3 cgd 
I1^UUU33 yJ3 


/ ^ w f ,-i 4- ^ y*\ 1 y^*<^ yx+ yJ y-\ y^i * >fi 

oystme-icnot domain 


1Q 7A 1 OCp AQ 17S 11A 

ly.zo i.zoe-uy izo-i^fo 


1Q17 

131/ 


TDDAAA3<ffD 

lrBUuU3DVo 


Uystme-Rnot domain 


1 Q 7A 7 CAo 111 Al 1 Q 1 

ly.zo z.oue- 1 1 103-101 


1317 

131/ 


irt>K)K)\JDJ yi3 


oystme-Knot domain 


1Q 7A 1 ^1/a HQ 7/11 7A1 

ly.zo 3.33e-uy z4j-zoi 


1317 

131/ 


TDTJ AAA3<OD 

lrr>\)\J\)oD yt> 


Lxystme-Knot domain 


1Q 7A A 1$J#=» no 77 1A 

ly.zo 4.3oe-uy zz-4u 


1317 

131/ 


TDDAAA3<OD 

lrr>UUU3C?yr> 


Uystine-Knot domain 


1Q 7A A AAo HQ 71H 7AA 

ly.zo o.uoe-uy Z4o-zoo 


1 7 1 7 
131/ 


TDDAAA7^QT3 


L^ystme-Knot domain 


in OA A 1/l<=i AQ 7A1 971 

ly.zo o.j*+e-uy zlo-zzi 


1-317 
131/ 


TDT3 A A C\ 3 < OT3 


L^ystine-Knot domain 


1 Q 7/t A A1*» 17 1 OR 71 A 

iy .zo d.ojc- iz lyo-zio 


131/ 


fi A A3< QT3- 

lr^J3 U UU3 o yr> 


^ystme-Knot oomam 


1Q 7A 7 7^^ 17 A7-SA 


1317 

13 1 / 


lroUUU3!!>yr> 


uystme-Knot domain 


1Q 7A £ AQ 77 QO 

ty.zo o.ooe-uy /z-yu 


13 17 

131/ 


T"D"D AAAQ^7t( 

lr'rJUUUyo /li 


Z,mc linger JNr-Ai type 


71 cc i 7i/=» no 111 1^1 
zi.oo 3. /3e-uy iii-iji 


1 7 1 7 
13 1 / 


TDT3 A A AO A7T? 


z,inc linger iNr-yvi type 


71 S$2 A /lAf=» AQ RQ-17Q 

zi.oo o.4oe-uy oy-izy 


1-317 

13 1 / 


TDT5AA1 1 (iOF 

IJrrSUU 1 1 t>yiv 


integnn oeta, c-termmus i 


77 1^ ^\ 1 Co AQ 71 A 7^B 
Z/.4I) 3.3oe-Uy ZIO-Zjo 


1317 

13 1 / 


Tt>r>nAi 77 1 
IrtSUUIZ/ 1 


JVLammaiian aetensm 


1Q Q7 1 AA 0 AC Al SO 

iy.y/ i.uue-uo oi-oy 


13 17 
13 1 / 


TDDAA 1771 


Mammalian aetensm 


1 A Q7 1 /1Q« AQ 117 1 £H 

iy.y/ i.4ye-uy 13Z-10U 


13 17 
13 1 / 


1 no no 1 171 

IrrJUUlz/ 1 


Mammalian aeiensm 


lO Q7 1 OOo 1A /17 7A 

iy.y/ i.ooe-iu 4z- /u 


ion 

1317 


1FB001271 


Mammalian defensin 


1 ri m *"> 1 aa 1 m ooi 

iy.y / z. I3e-uy iy3-zzi 


1317 


TTJT5AA1 771 

lrii00127 1 


Mammalian aetensm 


1 c\ m O 1 1 « AO 1 Al 1 A 1 

iy.y / z.3ie-uy 103-iyi 


13 1 / 


TDDAA 1771 

lrr>UUlz/ 1 


Mammalian aetensm 


io an 7 /i7<a no 717 7'7n 

iy.y/ z.4/e-uy Z4z-z/u 


1317 

13 1 / 


TTDTD AA1 77 1 

IrJoUUlz/ 1 


Mammalian aetensm 


1 q Q7 <7 OAo AQ 1 <1 1 

iy.y / z.oue-uy id/-ioj 


1317 

131/ 


TDDAA 1 071 

LrrSUUlZ/ 1 


"N yv tA**IV* 1 a j— \ am y*3 y\4^* m 

Mammalian aetensm 


1 a Q7 i oOo in CA C/1 

iy.y/ 3.zye-iu j d-o4 


1317 
13 1 / 


TDDAA 1 T71 

lrrsvJUlz/ 1 


Mammalian aeiensm 


1 O Q7 A 7A*= AQ 1 A7 1 OA 

iy.y / 4. /oe-uy ioz-iyu 


1317 

13 1 / 


TDDAA 1 371 

IroUUlz/ 1 


Mammalian aeiensm 


1A 07 C AQa AQ 7 A ■'n/I 

iy.y / D.uye-uy zo-34 


1317 

13 1 / 


TDD AA1 37 1 

LFJdUU12/1 


Mammalian aetensm 


10 07 < fiflo 17 1 A Q 17< 

iy.y/ 3.uye-iz 145-1/0 


1117 

1-3 1 / 


1IM5UU 1Z / 1 


ivianTrritiiid-n. uciciimii 


10 Q7 ^ OIp-HQ A^-QA 


1317 


IPB001271 


Mammalian defensin 


19.97 6.25e-ll 57-85 


1317 


IPB001271 


Mammalian defensin 


19.97 6.29e-10 51-79 


1317 


IPB001271 


Mammalian defensin 


19.97 6.47e-10 52-80 


1317 


IPB001271 


Mammalian defensin 


19.97 6.65e-10 158-186 


1317 


1PB001271 


Mammalian defensin 


19.97 7.05e-09 227-255 


1317 


IPB001271 


Mammalian defensin 


19.97 7.22e-09 167-195 


1317 


IPB001271 


Mammalian defensin 


19.97 7.53e-10 17-45 
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ID 
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Results* 


1017 


1 771 

IrrSUUlZ 1 1 


Mammalian derensin 


19.97 7.71e-10 147-175 


IT 1 7 
131/ 


TDX5 r\r\ 1 77 1 
LrBUUlz/ 1 


Mammalian defensin 


19.97 7.87e~09 202-230 


13 1 / 


IDDHH 1 77 1 

IFoUUlx / 1 


Mammalian defensin 


i A A^7 o o A_ r\r» 1 Ol O 1 1 

19.97 8.20e-09 183-211 


1T17 
131/ 


TDTSAm 77 1 


Mammalian delensm 


1 A A*"7 A 1 O „ A A 1 r\0 ^ O /A 

19.97 9.1oe-09 192-220 


1017 
131/ 


iDnnni 77 1 
IJrBUUiZ/ 1 


Mammalian defensin 


1 A at r\ o c „ 01 i rvo 

19.97 9.35e-09 81-109 


131/ 


TDDHH 1 771 

LrBUUIZ / 1 


Mammalian defensin 


1 A AT A Z'T— Ar\ 1 ^1 1 1 

19.97 9.67e-09 123-151 


1017 
13 1 / 


TDDf\n 1 7/C7 A 

lirkiuUL /OZA 


Disintegrin 


23.93 4.88e-09 91- 13 1 


13 1 / 


T"DT> A A 1 TzT A A 

lr BOO 1762 A 


Disintegrin 


23.93 7.8oe-12 99-139 


1017 

13 1 / 


TT3TJ AA7/1 H/1 A 

LrBU0z4y4A 


Keratin, high sulfur B2 protein 


12.44 1.00e-14 161-194 


1017 
131/ 


LrBUUZ4y4A 


Keratin, high sulfur B2 protein 


12.44 1.41e-18 201-234 


1 °. 1 7 
13 1 / 


LrBUUZ4y4A 


Keratm, nign suliur Bz protein 


12.44 l.ooe-17 166-199 


13 1 / 


TDDAl^7/IQ/l A 

lr\t5UUZ4y4A 


Keratin, nigti suliur Bz protein 


in /I /i 1 ia on o/rr 

12.44 1.72e- 10 232-265 


1 1 1 7 

13 1/ 


0/l A 

UrUUUZ4y4A 


iveratm, nign sullur Bz protein 


n /i /i 1 nn„ i c i o/r nor* 

12.44 1.88e-15 196-229 


131/ 


lrr>UUz4y4A 


Keratin, high sulfur B2 protein 


12.44 2.16e-16 50-83 


1017 
131/ 


lrBUUz4y4A 


Keratin, high sulfur B2 protein 


lO A A o r\c\ z o o 

12.44 2.35e-09 5-38 


1-317 
13 1 / 


TDT3 AA7/1 Q/1 A 

lrr>UUz4y4A 


Keratin, nign suliur Bz protein 


10 /1/t O A A „ 10 OA 110 

12.44 2.44e-12 80-1 13 


1017 
131/ 


lr\t> U UZ4y 4 A 


Keratin, high sulfur B2 protein 


10 A A 7 -1 /I „ 1/t 1 A^T 1 7 A i 

12.44 2.44e-14 106-139 


1017 
131/ 


lrJb>UUZ4y4A 


Keratin, high sulfur B2 protein 


10 /l/t O ^T_ 1ZT 1T1 1 /"yl 

12.44 2.67e-16 131-164 


nn 
131/ 


TDDAA7/1A/1 A 

lrr>UU24y4A 


Keratin, high sulfur B2 protein 


1 O /I /I 1 /^T_ 1 1 /f Z' 1 TA 

12.44 2.67e-16 146-179 


131/ 


AC\A A 

lFB0U24y4A 


Keratin, high sultur B2 protein 


12.44 2.84e-10 251-284 


1 Q1 7 

131/ 


lrB0U24y4A 


Keratin, high sulfur B2 protein 


10 A A O 10_ 1T TO 

12.44 3.12e-17 40-73 


T317 
131 / 


lroUUz4y4A 


Keratin, high sulfur B2 protein 


1 O A A O /II A A AT 1 O 

12.44 3.42e-09 97-130 


1717 
131/ 


TDDAA7/IO/1 A 

lJrr>UUZ4y4A 


Keratin, high sulfur B2 protein 


10 A A O ,i ^ _ -1 r- xo 

12.44 3.42e-13 15-48 


1017 
13 1 / 


lrBUUZ4y4A 


Keratin, high sulfur B2 protein 


10 /4 A 0 1A 1T1 OA/f 

12.44 3.43e-19 171-204 


1-317 
131 / 


T"DT3 AH7/1 Q/1 A 

Lr J3UUZ4y4A 


Keratin, high sulfur B2 protein 


IO /1/t 1 /II— 1AOA£" OOA 

12.44 3.43e-19 206-239 


1717 
131 / 


TT>DflA7/10/1 A 

lrr>UUZ4y4A 


Keratin, high sulfur B2 protein 


1A A A O ^T_ 1 /-"/-> r-o 

12.44 3.57e-16 25-58 


1717 
131/ 


TDT3 AA7/1 Q/1 A 

lrBUUZ4y4A 


Keratin, high sulfur B2 protein 


10 /I/I 1 01 _ AA At T /I 

12.44 3.81e-09 41-74 


1Q17 

13 1 / 


TX»"D AA7/1 Q/1 A 

lr!5UUZ4y4A 


Keratm, nign suliur BZ protein 


10 /1/t *3 O^T— 1A AO/^ A£TA 

12.44 3.86e-19 226-259 


1717 

131/ 


TT)"0 AA7/f CM A 

lrr>UUZ4y4A 


Keratin, high sulfur B2 protein 


1A A A O O O _ -t C IT/'' AAA 

12.44 3.88e-15 176-209 


1 1 1 *7 

131/ 


T"DT3AA7y1Q/1 A 

lrr>UUz4y4A 


Keratin, high sulfur B2 protein 


1 A A A A AA_ AA 01 1 1 A 

12.44 4.00e-09 81-114 


1717 

131/ 


TT>"D AA7/1 CM A 

lrx5UUZ4y4A 


Keratin, high sulfur B2 protein 


IA/1/tytAO^IOAII A/1/1 

12.44 4.23e-13 21 1-244 


1717 

131 / 


TTD T3AA 7 A Q A A 

lrr>UUz4y4A 


Keratin, high sulfur B2 protein 


1 A /I /I /I 1 1/1 AC TO 

12.44 4.36e-14 45-78 


1117 
131 / 


TDDAA7/10/1 A 

lrrJUUz4y4A 


Keratin, high sulfur B2 protein 


1A A A A A A _ 1T101 A1/t 

12.44 4.44e-17 181-214 


1 1 1 *7 
131 / 


TDOAA7/10/1 A 

lrJbUUZ4y4A 


Keratin, high sulfur B2 protein 


1A A A A T 1 _ 1TOA /TO 

12.44 4.71e-17 30-63 


131/ 


T1DT3 AA7/1 Q/1 A 

lrl3UUZ4y4A 


Keratin, nign sullur Bz protein 


1 A /l /t /1 OT— AA 1 1 A ~\ A C 

12.44 4.87e-09 112-145 


1117 
131/ 


TD 13 A AO A OA A 

irr>UUZ4y4A 


Keratin, high sulfur B2 protein 


1 A A A A OA— 17 111 1/1/1 

12.44 4.89e-12 111-144 


1117 
131/ 


1DD AA7/1 Q/1 A 

1 ro UUZ4y 4 A 


Keratin, nign suliur BZ protein 


17 /t /I CAA— 1A 1A1 A A /I 

12.44 5.0Ue-19 191-224 


1117 
131 / 


Tt>t3 AA7/1 Q/1 A 

lrriUUZ4y4A 


Keratin, nign suliur bz protein 


1 A /I /I AT— 1 O A/T 1 AA 

12.44 5.27e-13 96-129 


1117 
131/ 


TTJT3 AA7/1 Q/1 A 

lrr>UUz4y4A 


Keratin, high sulfur B2 protein 


1A /I /t Tf. A A 1 TT A 1 A 

12.44 5.35e-09 177-210 


1117 
131/ 


TT>T3 AA7 AC\A A 

H J BUUz4y4A 


Keratin, high sulfur B2 protein 


1A /t/1 £T /I A— . 1 A O /T CA 

12.44 5.40e-10 26-59 


1717 

13 1 / 


TTJTD A AO A A A A 

lrB002494A 


Keratin, high sulfur B2 protein j 


1 A A A £? ^ A 1^1A1 10/1 

12.44 5.50e-15 101-134 


1717 
131/ 


TTJDAnO A C\A A 

lrB002494A 


Keratin, high sulfur B2 protein 


1A A A £TT1_ 11 1^1 lO/l 

12.44 5.71e-ll 151-184 


17 17 
131 / 


LPB002494A 


Keratin, high sulfur B2 protein 


12.44 5.73e-13 60-93 


17 17 
131 / 


1PB002494A 


Keratin, high sulfur B2 protein 


12.44 6.14e-16 35-68 


17 17 
131/ 


1PB002494A 


Keratm, high sulfur B2 protein 


12.44 6.28e-14 75-108 




Lr X5U UZt^t- A 


Keratm, nign sunur dz prorem 


17 /1/1 HO 7"27 77A 

lz.44 O.ole-Uy Z3/-2/K) 


1317 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.66eA6 136-169 


1317 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.66eA6 65-98 


1317 


1PB002494A 


Keratin, high sulfur B2 protein 


12.44 6.66e-16 70-103 


1317 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.71e-09 36-69 


1317 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.73e-18 156-189 


1317 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.79e-ll 221-254 


1317 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 7.00e-ll 20-53 
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Table 3B 
407 



JlU 
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Results* 


1117 


1 r D U \J£ c Ty i -T£\. 


jvcrdiin, nign suiiur r>z protein 


1 9 AA 7 11o 11 1 O/^ 1 -^O 

lz.^-^f /.iie-ii izo-iDy 


1117 
1 J 1 / 


l r d V UZ4y 4 j\ 


Jveratm, nign suiiur r$z protein 


lz.44 /.1/e-io lzl-154 


1j 1 / 


TPRfifVMQid. A 


Jveraun, nign suiiur dz protein 


j iz.44 /.zze-i / zio-z4y 


1117 

LJ 1 / 


TPR 0094.04 A 
li Lj \J \j/>*z? 


iveratm, nign suiiur dz protein 


lO *7 AA** 10 1 C/C Tin 

1Z.44 /.44e-lZ ISo-Ziy 


1117 

LJ L I 


TPR009d.Q4. A 


jveratin, nign suiiur dz protein 


iz.44 Ooe-uy /o~iuy 


1117 
IJ 1 / 


TPR009AQA A 


iveratm, nign suiiur rsz protein 


IT 7 77« nn 1 07 OK 

iz.44 /.//e-uy IoZ-zid 


1117 
lj i / 


TP 13009/104 A 


xveratin, nign suiiur rjz protein 


iz.44 /.y/e-uy i4Z-i>o 


1117 
lj 1 1 


TPR009AQ4 A 
lr r>vUZ4:74.rY 


iveratin, nign suiiur t>z protein 


1Z.44 o.UUe-lZ z3o-zoy 


1117 


TPT30094Q4 A 


iveratin, nign suiiur r>z protein 


1Z.44 o.U/e-lo jj-oo 


1117 

LJ L / 


TPR009AQ4 A 


iveratin, nign sunur uz protein 


11 /I /I 0 Ala. 1A 0'7fi 

1Z.44 o.4 /e-iu Z4o-z/y 


1117 

LJLJ 


TP"R009AQ4 A 


iveratm, nign suiiur ±5Z protein 


17 /I/I o in or\ 

iz.44 o.o/e-iu o-3y 


1117 
LJ 1 / 


Lr D U UZ4y4A 


iveratin, nign sunur r>z protein 


17 //| o n7« 11 1/11 17/1 

Iz.44 o.yoe-11 141-1/4 


1117 


TPR009AQ4 A 


iveratin, nign sunur riz protein 


17 A A O n a 1 C 77 1 7iC/1 

IZ.44 y. 13e-l3 231-Z04 


1117 
i j i / 


TPR009AQ4 A 


iveratm, nign sunur Jt>z protein 


17 /l/i Q 77^ 17 7/f 1 77/1 

IZ.44 y.Joe-lZ 241-Z/4 


1117 


TPR0094Q4 A 
Lr 15 U UZHi'H'A 


iveratm, nign sunur r>z protein 


1 7 a a n oia no TA7 o /in 

iz.44 y.oie-uy zu/-z4U 


1117 
10 i / 


TPR0094Q4 A 


iveratin, nign sunur dz protein 


17 /|/l n OHo AO Q ^ 1 1 o 

iz.44 y.yue-uy oj-iio 


1117 

LJ A I 


TPR009A04R 


jvcrdiin, nign sunur i>z protein 


in to 1 nn« nn i rn i 74 


1117 

LJL/ 


TPR009J.Q4R 


jverdtm, nign sunur dz protein 


in *zq 1 AKa, 17 1 on on/i 
lU.Do 1.4D6-1Z iyU-zU4 


1117 

LJL/ 


TPR0094QAR 


jvcidtm, nign sunur r>z protein 


in 1 A^t* 19 99< 77Q 

1U.JO 1.4De- 1Z ZZD-Z3y 


1117 

LJL/ 


TPR0094QAR 

LJr SJ \J Jj 


Jverdtm, nign sunur r>z protein 


in co 1 no 00 1H7 


1117 

LJL/ 


TPR0094Q4R 


jvcidLin, nign suiiur dz protein 


in co 7 7^ 0 in 17c ion 

iu.jo z. /De-iu izj-ij>y 


1117 
LJL/ 


TPR0094Q4R 
lr JjUUZ^i'*+Ij 


iveratin, nign sunur Jtsz protein 


in^cc^ lie 11 c/i ico 
lU.Do J.lle-11 D4-Oo 


1117 

LJ L 1 


TPR0094Q4R 


iveratm, nign sunur r>z protein 


inCOO 01 Q 11 17H 17/1 

iU.oo j.ole-1 1 lzu-134 


1117 
LJL/ 


TPR0014Q4R 


iveraiin, nign sunur dz protein 


lO ^ 1 1 /fO /^O 

llOo D.o4e-ll 4y-o3 


1117 
LJ L 1 


TPR0094Q4R 
lr .DUuZ4y4.D 


iveratin, nign sunur ±5Z protein 


in co c £n« no ion iOvi 

iu.jo j.oye-uy ibu-jy4 


1117 
LJL/ 


TPR009AQ4R 


iveratm, nign sunur t>z protein 


in ro r /Trt- nn 7 1 C 7in 

iu.jo 3.oye-uy zi j-zzy 


1117 

LJL/ 


TPR0094Q4R 


jveratin, nign sunur uz protein 


1A CO C O/lo 1 A m ^1 ' 


1117 
LJL/ 


TPR0094Q4R 


iveratm, nign sunur dz protein 


in CO C QOa 1 1 1 CC 1 £ZCi 

iu.do D.yze- 1 1 Oj-ioy 


1117 


TPR0094Q4R 


iveratm, nign sunur dz protein 


1 n co c f\H n nn /co 


1117 S 

LJL/ 


TPR0094Q4R 


isjsrdxin, iiigii suiiur dz proLeiri 


in co ^ cQo no 10c 100 

lu.jo o.Doe-uy loj-iyy 


1117 

LJL/ 


TPR0094Q4R 


jveraiin, mgii sunur dz piorein 


in co coc no i7n 77/1 

lu.Do o.Doe-uy zzu-zj4 


1117 
i J i / 


TPR0094Q4R 
Lr LJ\) \Jz.Hy*TLj 


iveratin, nign sunur dz protein 


1 n co £. okq no co 77 

iu.jo o.yoe-uy jy-/j 


1117 

LJ L 1 


TPR0094Q4R 


iveratin, nign sunur dz protein 


in co 7 no a no icn 1 yi 

iu.jo /.U5e-uy iz>u-io4 


1117 

LJ L / 


TPR0094Q4R 


iveratin, nign sunur dz protein 


in co 7 no Q no 10c ino 

iu.t>o /.Uoe-uy iyj-zuy 


1117 
LJ L 1 


TPRO094OAR 
!.TljUUZ4y4.D 


iveratin, nign sunur dz protein 


in co 7 n7/> nn 70 no 

iu.do /.y/e-uy /y-yj 


1117 
LJL/ 


TPR0094QAR 


iveratin, nign sunur dz protein 


in co 0 C7„ in nn 10/1 
ID. Do o.Dze-lU l/(J-lo4 


1117 

LJ L / 


TPR0094Q/1R 
lr r>KJ UZ4y Jt> 


iveratm, nign sunur dz protein 


in co 0 C7„ in one 71 n 
lU.Do o.DZe-lU zUj-Ziy 


1117 
LJ 1 / 


TPR009AQ4R 
lJr.DUUZ4y4.D 


iveratin, nign sunur dz protein 


inc0 07Cy^ 1 T /J/| 70 

iu.do y.zje-iz o4-/o 


1117 

LJ L 1 


TPR0094Q4R 
1 r r> U U Z4 y 4 13 


iveratin, nign sunur dz protein 


incco77« 1 n nn ~\ a a 
lU.jo y.3ie-JU 13U-144 


1117 
LJ V / 


TPR0094Q4R 


iveratm, nign sunur dz protein 


in co 0 77 Q no o/t mo 

iu.do y.3/e-uy y4-iUo 


1117 


TPR0094Q4R 


iveratm, nign sunur dz proxem 


in co 0 /£o«=. no i/i n ic/i 

iu.jo y.oze-uy z4u-zj4 


1117 

LJ L I 


TPR0094Q4r* 


iveratm, nign sunur dz protein 


1/1 AA. 1 T) Q 71 1C/1 1 Q7 

14.40 i.ize-zi 134- iy/ 


1117 

LJL/ 


TPR0094G4r 1 


Jveratin, nign sunur dz protein 


1/1 /I 1 77„ 10 17 C/T 

14.46 l.z3e-lo 13-35 


1117 ' 

LJ 1 / 


lir.fc»UUZ4y4l^ 


iveratin, nign sunur dz protein 


1 yl A /L 1 1C~ in On 177 

14.46 1.35e-10 oy-13z 


1117 

LJ L I i 


llr.t>UUZ4y40 


iveratm, nign sunur dz protein 


1/1 A 1 O O „ 17 77 £T c 

14.46 1.38e-lz 22-65 


1317 


IPB002494P 


"K"pratin ViicVi sulfur R9 nmt<*in 

XVClClLlll, 111^,11 oUUUl XJx- |J1UIC111 


14 Af\ 1 <-Q<= 00 79-1 1 *J 
i^r.^+o I.«?ye-U-/ /z-113 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.71e-20 224-267 


1317 


IPB002494C 


Keratin, higli sulfur B2 protein 


14.46 1.73e-ll 27-70 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.91e-ll 37-80 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 2.11e-22 209-252 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 2.13e-12 84-127 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 2.22e-10 123-166 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 2.22e-12 168-211 
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TD 

AJJ 

NO* 


pnfrv TT1 
ciui y lxj 


uescription 


Results* 


1317 


IPB002494C 


TCpTfltin liio'li ciilfiir 150 nrntpin 

XV^/iailll, lllgll OUUUI XJ Z piULClll 


14 Afs 0 0Op_0Q 1 £4-007 

1*t.--tU Z.Zj7C - Z«7 1 Ot-ZZ / 


1317 


IPB002494C 


XVCiallll, lllgli oUUUl 13 Z> piVJLClli 


1 4 Af\ 0 ^ 1 p 1 0 40 R-^ 


1317 


IPB002494C 


TTprntiti Hioli Qiilfiir R9 -nrntpin 

XV-V^XClLlXlj lllgll oLll 1 Lll JJZ, |JiiJLClll 


14 4^0 4-^p 1 QQ 140 

Xt.*TO Z.-tJC- IJ ZrzJ-" 1-*TZ 


1317 


IPB002494C 


TCpratin liio"!) <5iilf*nr R9 nrrkt/^i n 

XWw/ltXLlllj 111X^11 OLXlXLll XJ iL, JJ1VJLC111 


14 46 0 48p~1Q n 76 


1317 


TPB002494C 


TCprntin nicli ciilfiir T-19 nrntpin 
xvt<la,uuij Jlllgll oUlxUx JjZt piULClll 


14 46 0 SOp_14 0^4 077 


1317 


TPB002494C 

XX XJ V/V/Z".?" V/- 


xVCldllllj lllgil alillUl xjZf pXOLclIl 


1 4 46 0 -^-^p 1 ^ QSZ 141 


1317 

x ^ x / 


IPB002494C 


xVV/lctLlil) lllgil oUxxUI JjZ> pi OLC1H 


14 46 0 ^7p 0O 1 HI 


1317 


IPB002494C 


xvcictnii, nigci buiiui jDjC pruicin 


1 4 46 0 6Hp HQ 47 Qfl 


1317 


IPB002494C 

XX XJv Vi-rf"> I Vw/ 


K'f^'TCi'fiTi 111 o*l*i cnlfiit* r\rf\4'e*i n 
xV.ClcH.lllj 111^11 oulxUl XJZf pi L> LCI 11 


14 46 0 64p 11 145 1 86 


1317 

X X / 


IPB002494C 


ivcldUU, lllgll oUllLU xj2> pi O LCI 11 


14 46 5 DHp 05 1 70 000 
l-H.-4-O j.uue-zo 1 /y-zzz 


1317 

x X / 


IPB002494C 


K'f^m+i'M llioTl cnlfnt* nrntpin 
xvv^lclLlllj lllgll oullUi 13z> pi Ult/111 


14 46 5 nOp 111 55 1 


1317 

X — / X / 


IPB002494O 


xvci ciLiiij liigii ouixui xjz. piULCill 


14 46 5 56p HQ 005 066 
LH.t+O 0.JOC-U7 Zzzi-zOO 


1317 

X — v X / 


IPB002494C 

XX XJ v Vi* I 


T^prntin liicrli cul'fiiT" R9 nrntpin 
XVCldLlll, lllgll oUllUI dz pujiciii 


14 4 6 5 552p 05 OfiA 047 


1317 

X ' x / 


IPB002494O 


T^'f^'ra'fiTi rii o*li Giilfiif -t-ikt-^+f*»i-n 
xVClaLlll, ill ££11 oUliUl xj^ piULCill 


1 4 46 5 zt4*= OQ 0 5 Q O Q 1 


1317 


IPB002494C 


TTpmi'in Tricrl-i ciilfiir R9 nT'/rf/^i-n 
IVCldLlllj lllgll oUliUl xjZ> piULClll 


14 46 5 86p HQ 018 061 
l-H-.-H-O J.0DC-U7 zlO-zOl 


ni7 




ivciaLiii, m^n ouiiur e>z, protein 


14 46 5 QHa 50 1Q/I 057 


1317 

x «_/ x / 


IPB0CP494C 


T^"p a 'TCl+in 111 OtVi Cnlfiit* T^ / ? i*%r , /~»+fai-n 

jvcicttiiij iiigii bu.il ui xj ^ pivJLclli 


1 4 46 5 Q7p 1f\ 10 <C 


1317 

X — ' X / 


TPB002494C 1 

XI XJ u ViiT7"V/ 


xVClaLHl, lllgil oUlxUl iJ^6 piUldlx 


1 4 46 4 Hfip 1 A 1 85 05 1 


1317 


TPR007494.P 

XX DVj\/Z*T^*tV_/ 


JVClaLlIl, lllgll bUllUI xjZ prOlClll 


1 4 46 A 1 Bp 111 7Q 00 1 


1317 

1 —/ X / 




Tt^^rQ^in I'll n*Ti cmil*fiiT" t^0 rxt^nfmi*! 

xvciciniij iiign buiiur dz proicin 


1 A /1 6 /I 5/Co 1 1 £7 1 in 


1317 

x — / x / 


TPB002494C 


xvciainij iii^n ouiiur dl proiciii 


1 A Aft A 56p OI 1 QO O/IO 

14.40 4.ziOe-z I iyy-z4Z 


1317 


IPB002494C 

XX XJWJXf^.^T Vv 


xvCI uLlll, 111^11 bu.llU.1 DZi piULClll 


14 46 4 4Qp 1C 14A 1 C7 
14.40 4.4ye- 1 0 144-15/ 


1317 


IPB002494C 


TCprntin liiori cnlfiir nmfpiri 

XVCl ClLlIl, lllgll o 14.11 111 Ljx* piULClll 


14 46 4 *\0p OS A5 86 


1317 


IPB002494C 

XX XJ V Vi<T ~ ^ 


TCprntin ViicVi cnlfiir R0 tirAfpin 

XVCldllllj lllgll bUlXUl XjZ> pi (J I will 


14 46 4 ^p 07 00Q 070 


1317 


IPB002494C 


TCpratin Viitrlx ciilfiir R9 -nrnfpin 
lYciamij uigii ouiiui ljjj piULClll 


\AAf\A£TIe* in 1<i8 om 


1317 


TPB002494C 


T^'prai'in TiiorTi cnlfiir R*7 nrntpin 
xVClaLlll, Iligll oUllul DZ piOLCIIl 


1 4 46 4 84p 10 1 Q8 04 1 
14,40 4.o4e-lz iyo-z41 


1317 


IPB002494C 

XX X_» WX*"TJ^"TV_<- 


TCprni*in Vtioi*i cnlfiir R9 nrnfpin 

lvul uLlllj lllgll oUUUl U^r piULClll 


1 4 46 H4p HQ 1 (YX 1 46 
1-4-.-4-0 D.KJ^tQ-yjy IVJj-140 


1317 

X X / 


IPB002494C 

XX X-^ vv**~y~ V-^ 


TCpratin liiffn cnlfiir R9 nrntpin 

1VW dtlllj lllglx OU11U1 LIZ piVJldll 


14 46 ^ IOp 0<\ 1 60 010 
1-t.tO J.IUC-ZJ IO7-ZIZ 


1317 

X X 1 


TPB002494C 


i^pratiTi Hitrli cnlfiir R0 nrntpin 
xVClaLlll, lllgll bUlxUl ljjL pi (IRCIII 


14 46 ^ 1 Op in 1 06 
14.40 D. iye-lU IDJ-IVO 


1317 


IPR002494r 


T^p-rai-in Tii trn cnlfiir R0 nrAfpin 
XVCl«xLlll 5 Ill^ll blxllUl DZ piULclH 


1 4 46 <I ^np 1 4 Q4 1 57 
1H-.40 D.DUQ- L*+ i?4-lj / 


1317 

X — ' X / 


TPB002494C 

XX X-r \J \J 1_X ~ 


T^prntiTi ni<rVi ciilfiir R0 nmrpi n 
XVCldHllj lllgll oUUUl J_>Z pivJLClll 


1 4 46 ^ ^np 1 8 85 1 06 
1<4\40 j.jUc-1o 0z>-1zO 


1317 


TPB002494C 


TCpmtin ViicrVi cnlfiir nrntpin 
xVCiallll, lllgll oUlXLU. DZ piULClll 


14 46 ^ ^np 0^ 68 111 
14.40 J.D u e-ZJ 05-111 


1317 

X «-» JL / 


XX XJV/V/Z*T_7'TV>< 


l^pratin ni crTi cnlfiir TIO nrntpin 
xVCiallll, lllgll oLlllUl OZ, piULClll 


14 46 ^ Qnp iA 0AA 087 
14. 4D 3.yi/e-14 z44-z5 / 


1317 

X X / 


IPB002494C 

xx xj y vii^^Tv-' 


PiTprfltin Viicrri cnlfiir RO nrntpin 
XVClctLlllj lllgll oullUl xjjL pivJLClll 


14 46 6 1 6p 1 0 01 5 0^6 
14.40 O.lOe-lz zIj-zjD 


1317 


IPB002494C 

XX XJV/ V*(~7~\^ 


l^prfltin liicrri cnlfiir R9 nvcxtt^in 

XVCldllllj 111^11 OlilXLll LIZ piULClll 


1 4 46 6 1 6p 01 014 0*?7 
14. 40 O. lOe-zl zl4-Zj / 


1317 


TPR007494P 


T^*prcitirt nioli onlfiir R0 nrntpin 

xvciaLiii, ni^ii ouiiur jjz. proicm 


1 /l A6 6 1 Qp 50 I^O OHO 

14.40 0. 1 ye- jz i j y-zuz 


1317 


TPB002494O 


T^pmtin TiicrV* cnlfiir R7 v\vr\ie^\r\ 

XVCldllllj 111^11 sUllUl DZ pi VJIC1I1 


14 46 6 0*Cp 05 174 01 7 
14. 40 O.zjo-zj 1 /4-zl / 


1317 

X -/ X / 


TPR002494P 


T^f^rotin rii crh cnlfiir* 15 O nmtpm 
xVCldllll, lllg,Il bUlJLUl DZ piULClll 


1/t ^ 1Q 050 OCT 

14.40 o.j^e-iy zz?y-z5z 


1317 


TPR0004Q4P 

XX XJV WZ*T J*T 


l^prcitm hi ct \\ ciilfiir 150 nrntflm 
XVCldLlIlj Illgll bUlllil DZ piULClll 


14 46 6 ^7p 1 6 78 1 OI 
14.40 0.D /e-10 /5-lzl 


1317 

X _/ X / 


TPR002494P 


TTprntin ntrrVi cnlfiir 150 nrr»tpin 
xvciaLiii, ni^ii biiixui D jC pi vjiciii 


14 46 6 74p 1 fi 10Q 170 

14. 40 0. /4e-io izy-i/z 


1317 

X w' X / 


TPR002494C 1 

XX XJ V UiT7"V^ 


T^prdtiti rii crn cnlfiir 150 nrntpin 
xVCiallll, Illgjll DU1XU1 JjZ pit* LCI 11 


1 4 46 6 Q 1 p 10 ^0 1 

14. 40 o.yie-iz oz-iuD 


1317 

X *J X / 


TPR0074Q4P 

XX UUVii^y i V/ 


l<^"prcitin TiifTri ciilfiir 150 nrntpin 
xvciaLiii, lllgll DtllXUl DZ piULClll 


1 A A(k & QAp 1 7 1 1 Q 1 ^O 

14.40 o.yoe-i / ny-ioz 


1317 


ir jl> v Lf iitj/nvy 


Tifpfntin hirrn ciilfiir 150 nrr\toin 

xvcraiiii, nign sunur dz proicin 


1/1 A£L H nfta 11 17Q OI ^ 

14.40 /.uue-li 1/3-Zlo 


1317 

X_> X / 


11^150004041^ 


xveraiin, nign sunur dz proiciri 


14.4o /.UUe-lz lzo-1/1 


1317 

X -J X / 


TPR0074Q4P 

XX IJUU^'TytLi 


PTpratin Viion ciilfiir T50 nrr\t*iin 
IvClaLiri, IHgll oUUUl DZ piULClll 


14.40 /. ize-lU zUo-zj 1 


1317 


1PB002494C 


Keratin, high sulfur B2 Drotein 


14 46 7 1 Op-OS 1 0A_ 1 67 
x— r."VJ / . x jc zc xz—r iu / 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 7.21e-16 3-46 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 7.30e-18 73-116 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 7.43e-24 134-177 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 7.50e-23 53-96 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 7.55e-10 32-75 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 7.64e- 11 193-236 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 7.73e-10 148-191 
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OT7" 

SEQ 

W 

INfJ! 


Database 
entry ID 


Description 


Results* 


1017 

1-51/ 


TPP. A AO A QA (~* 
lJrr>UUZ4y4t-' 


Keratin, high sulfur B2 protein 


HI A A SB ,a» A .a ^ a _^ ^a> i t 

14.46 7.75e-20 114-157 


1017 


lr\t>UUZ4y40 


Keratin, nigh sultur Bz protein 


^ A A J^* irt a^ a*V a*^ <fl /" I a**a a"*4 a«^ 

14.46 7.88e-23 189-232 


lOl/ 


TPP A AO /I Q /t Z*" 1 
lr'±JUUZ4y4C 


Keratin, nigh sulfur Bz protein 


-4 J J V - * >'f /~\ ✓— N a*^ a*X a# a*^ «H aa _i ■_ 

14.46 7.99e-20 109-152 


1717 
LD L 1 


lJrr>UUZ4y4L/ 


iveratm, high suliur Bz protein 


14.46 8.03e-12 163-206 


1717 

LD 1 / 


TPPAAO/dQ^r"* 
lJrUUUZ4y4v^ 


Jveratin, nign suliur Bz protein 


14.46 8.05e-31 149-192 


1717 


TPP. A A O/I Q/! 
Lr 13UUZ4y4L^ 


1 , ^ S — — •— 4 y * -■_ I — _■ ilia * 1 ^ * _ _ ~T^"^ _ 

Iveratm, nigh suliur Bz protein 


14.46 8.15e-22 28-71 


1717 


lir x5u uz4y 4^ 


Jveratm, nigh sultur hJZ protein 


14.46 8.22e-12 8-51 


1717 
10 l / 


lJrr>UUZ4y4V^ 


rLeratin, nign suliur Bz protein 


14.46 8.25e-23 164-207 


1717 
LD L f 


lJrr>UUZ4y4v^ 


Jveratin, nigh sultur Bz protein 


14.46 8.32e-21 23-66 


1717 

LD 1 / 


lJrr5UUZ4y4*w.' 


Jveratm, nigh suliur J3z protein 


14.46 8.81e-25 18-61 


1717 

LD 1 / 


ijrr>uuz4y4i_' 


Keratin, high sulfur B2 protein 


14.46 8.91e-09 77-120 


1717 
LD L I 


l_r JO U UZ4y4L,. 


Keratm, nigh sultur Bz protein 


14.46 8.92e-17 249-292 


1717 
1 J 1 / 


tppaai/i o/i 0 
iJruuuz4y4La 


Keratin, high sulfur B2 protein 


14.46 9.02e-17 88-131 


1717 
101/ 


ijr uuuz4y4Ly 


Keratin, high sulfur B2 protein 


( 14.46 9.04e-10 57-100 


1717 


TPRAAOziQjdT" 1 

lriJUUZ4y4V^ 


Keratin, nigh suliur Bz protein 


14.46 9.24e-09 19-62 


1717 

LD 1 / 


lJrrJUUZ4y4l^ 


Keratm, nigh sultur h5z protein 


14.46 9.32e-13 203-246 


1717 


IJr £> U UZ4y4 ^ 


Keratin, nigh sultur J3Z protein 


14.46 9.41e-20 63-106 


1717 
iO 1 / 


1 Jr J3 u uz4y 4 c 


Keratin, nigh sultur Bz protein 


14.46 9.43 e- 19 139-182 


1717 
LD L / 


IJr.t>UUZ4y4L, 


Keratin, nigh sultur Bz protein 


14.46 9.51e-22 38-81 


1717 
Ij 1 / 


tppaao/i O/i 
lJrxJUUZ4y4lw< 


Keratm, nigh sultur Bz protein 


14.46 9.73e-26 48-91 


Ij 1 / 


tppaa'-m QAr* 


Keratm, nign sulfur B2 protein 


14.46 9.74e-10 108-151 


1717 
LD L / 


lr D U UZ4y 4 


Keratm, nigh sultur Bz protein 


14.46 9.82e-ll 52-95 


1717 
LD L / 


TPPAAO/IO/1 z^ 1 

ljrr>UUZ4y4C 


Keratin, high sulfur B2 protein 


14.46 9.86e-28 219-262 


1717 
LD 1 / 


lJrJt>UUZ4y40 


Keratin, high sulfur B2 protein 


14.46 9.89e-16 93-136 


1717 
LD L I 


IJr\bUUZ4y4C 


Keratin, high sulfur B2 protein 


-| J A S~\ S~\ aTV -4\ A -4 r~\ A HA mm 

14.46 9.90e-14 104-147 


1717 
1-5 I / 


JrKUUo /OB 


Nematode metallothionein signature II 


7.66 9.76e-09 113-126 


1 71 8 
1 J 1 0 


TDT3 AAAAA/C 


Vertebrate metallothionein, family l 


13.41 1.00e-08 42-87 


1 71 S 
LD La 


tpraaaaa^ 
IiMjUUUUUO 


Vertebrate metallothionein, tamily 1 


13.41 2.10e-09 125-170 | 


1 71 R 
LD LO 


tpt^apiaaa/^ 

IJr JljUUUUUO 


vertebrate metallothionein, tamily 1 


13.41 2.74e-09 34-79 


1 7 1 S3 

LD L O 


TPI3AAAAA£ 
lJrrSUUUUUO 


Vertebrate metallothionein, tamily 1 


■i a^ A -\ *—* r~\ -4\ y^V afl af*— J af a-4 1 

13.41 3.59e-12 124-169 


1 7 1 R 

LD lo 


tpraaaaa^ 


Vertebrate metallothionein, tamily 1 


13.41 4.21e-12 119-164 ! 


1 71 R 


TPRAAAAA/:; 
lirJUUUUUO 


Vertebrate metallothionein, tamily 1 


13.41 4.58e-09 32-77 | 


1 7 1 £ 

LD L O 


TPT3AAfinA£ 
lI\tJUUUUU0 


Vertebrate metallothionein, family 1 


13.41 4.67e-09 94-139 


171 R 

ID L O 


lr\DUUUUUO 


vertebrate metallothionein, tamily 1 


13.41 5.04e-09 88-133 1 


1 7 1 £ 

LD L O 


TPPiAAAAAf? 
IJrrJUUUUUO 


vertebrate metallothionein, tamily 1 


13.41 5.13e-09 101-146 i 


1 71 r 

1 J lo 


TPQAAAAA/C 

IJrrJUUUUUO 


"\ T—--4.—'L.~ — J-~ . a-„71 „a-1-J X* • 1 ■* 

Vertebrate metallothionein, family 1 


13.41 5.16e-10 100-145 | 


1 7 1 R 
l j 1 0 


TPP2AAAAA£ 
IJrrJUUUUUO 


Vertebrate metallothionein, family 1 


13.41 5.20e-ll 109-154 \ 


1 71 R 

Ij 1 O ! 


TPTiAAAAA/^ 
IJrrJUUUUUO 


Vertebrate metallothionein, family 1 


13.41 5.40e-ll 120-165 | 


1 7 1 8 
1-5 1 0 


TPT3 AAAAA/C 

IJriJUUUUUo 


Vertebrate metallothionein, family 1 


13.41 5.65e-10 114-159 1 


17 1 R 
Ij 1 0 


TPRAAAAA/^ 
Lr JtJUUUUUO 


Vertebrate metallothionein, tamily 1 


13.41 5.68e-09 110-155 


171 R 


TPPiAAAAA^ 
IrrJUUUUUO 


Vertebrate metallothionein, tamily 1 


13.41 5.78e-09 104-149 


17 1 R 

IJ iO 


TPRAnAAA^ 
IJr UUUUUUO 


verteorate metal lotnionem, tamily 1 


*1 O >l "1 at* aTV aTV aT* *f aTV V 

13.41 5.96e-09 55-100 ; 


171 R 
LD 1 0 


TPT3AAAAA**; 

IrJtJUUUUUu 


Vertebrate metallothionein, family 1 


^4 >4% 4 a# a^V a**4 a# a^ a^V A a] a ■_ f 

13.41 6.03e-10 98-143 


17 1 R 

LD L O 


TPT3AAAAA»< 
IJrrJUUUUUO 


Vertebrate metallothionein, family 1 


•4\ A -4 af 1" a*V a4 af] B a 1 

13.41 6.50e-ll 35-80 j 


1 7 1 R 
LD L 0 


IJrrJUUUUUO 


Vertebrate metallothionein, family 1 


13.41 6.59e-12 115-160 


171 R 
ij j. 0 


ir^rJUUUUUO 


vertebrate metaiiotnionem, tamily 1 


"t A "t >f *T\ ft /*\ af\ 4 4 4 4 a f ^ 

13.41 6.97e-09 111-156 | 


1318 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 7.61e-09 43-88 


1318 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 9.23e-10 83-128 


1318 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 9.67e-13 93-138 


1318 


IPB000254 


Cellulose-binding domain, fungal type 


18.11 6.86e-09 94-124 


1318 


IPB000359B 


Cystine-knot domain 


19.26 3.25e-12 147-165 


1318 


IPB000359B 


Cystine-knot domain 


19.26 5.29e-13 127-145 


1318 


IPB000726A 


Glycoside hydrolase family 19 


14.41 9.88e-09 3-29 \ 
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SEQ 
ID 


Database 
entry ID 


Description 


Results* 


mo 

UlO 


LriD\JV\) /J.O& 


oiycosiae nyaroiase iamiiy iy 


J.jy 3.4Ue-10 lo-zo 


1 ^ 1 c 
1 J lo 


IrrJUUU /zoo 


Glycoside nyaroiase iamiiy iy 


C O A 1 /( A« 1AA 1A 

5.39 3.40e~10 9-19 


1 j 15 


rD"nn a 1 (\(\ir* 

i Lr iS\j\) LKjyj 


von w lUeDrand. iactor, type U repeat 


OA OA A IT, AA 111 1 O O 

20.89 4.27e-09 1 1 1-132 


1 J lo 


T"D13AA1 1/CQF 

lrt>UUl loyK 


integrin beta, C- terminus 


27.45 3.07e-09 11 8-160 


1T1 o 

13 1 o 


IrrSUUl loyK 


Integrin beta, C-terminus 


27.45 5.62e-09 25-67 


1 1 1 Q 

1j la 




Integrin beta, C-terminus 


27 A5 8.91e-09 35-77 


1 7 1 Q 


rnt) n a i a*7 1 

LrrJUUlZ / 1 


Mammalian derensin 


19.97 5.91e-09 142-170 


111 o 


TDDAH1 I 

lr.BUUlZ/ 1 


Mammalian defensin 


19.97 6.89e-09 107-135 


I'll Q 

1J lo 


lr\b>UU 1Z / 1 


Mammalian defensin 


19.97 8.41e-10 21-49 


1 7 1 Q 


TDT3AA1 '7^0 A 
IrVBUUl /OZA 


Disintegrin 


23.93 3.03e-13 112-152 


1715 
LD 15 


T"DTJ AA 1 7<CO A 


uisintegrin 


23.93 4.l8e-09 113-153 


1 7 1 Q 
LO 1 0 


IrMSUU 1 /OZA 


Jjismtegrin 


c /ta_ i m i r\o 1 yt o 

23.93 5.60e-l0 103-143 


1 7 1 8 
1 J 1 5 


i TDT3AA1 A 
Ir\fc>UUl /DZA 


JJismtegrm 


0,0 oo c „ 1A 1 OO 1 o 

23.93 5.60e-10 123-163 


1 7 1 Q 
U lo 


Ix^oUUZl /4A 


Furin-like cysteine rich region 


o/\ ci 1 n "7 „ i r\ oo t t a 

30.51 1.97e-10 83-114 


1 7 1 S 
Lo 1 O 


TDDAA01 1 A A 

LrrJUUZl /4A 


Furin-Iike cysteine rich region 


OA C1 o rtn „ in no i o /f 

30.51 3.92e-10 93-124 


1 7 1 Q 
1 J lo 


TDT5AAO 1 *7/1 A 

IrrJUUZl /4A 


Furin-like cysteine rich region 


OA C1 O OA_ AA 1 A >l 

30.51 8.20e-09 73-104 


1 7 1 Q 

lo lo 


LroUUZi /4A 


runn-liKe cysteine rich region 


OA CI O OO AA 1 'I A O 

30.51 S.8Se-09 17-48 


1 7 1 S 

1 J lo 


TDOAAOI HA A 

LroUUZl /4A 


Furin-like cysteine rich region 


OA C1 A O 1A O O/l 

30.51 9.27e-10 3-34 


1715 

lo lo 


TDT* AA1/I A/1 A 

lrr>UUZ4y4A 


Keratin, high sulfur B2 protein 


1A /lid 1 11— 11 1A1 10/1 

12.44 l.lle-11 101-134 


1 7 1 C 

lo lo 


TDOAAO/1A/I A 

lr£>U(JZ4y4A 


Keratin, high sulfur B2 protein 


1A /I /I 1 10 1T10A 1/"0 

12.44 1.13e-17 130-163 


17 15 

lo lo 


IrrS UUZ4y4 A 


Keratin, high sulfur B2 protein 


10 A A 1 01_ 1A11/^ 1 -1 /A 

12.44 1.31e-10 116-149 


1715 

10 lo 


TDDAAO/1Q/1 A 

lrr>U0Z4y4A 


Keratin, high sulfur B2 protein 


1A A A 1 _ 10 1AC lOO 

12.44 1.78e-12 105-138 


1715 
lO lo 


lFr5UUZ4y4A 


Keratin, high sulfur B2 protein 


1A A A O AA— 1A AC CO 

12.44 2.00e-12 25-58 


1715 
10 1 o 


TPTi AAA4Q/1 A 
11 Jt>UUZ4y4A 


Keratin, nign suiiur h>Z protein 


11 A A 1 /C AA 0 1 ZT/1 

12.44 2.16e-09 31-64 


17 15 
10 1 0 


Ir t>UUZ4y4A 


iveratm, nign suiiur r$z protein 


io /f/i i /to ^ ic i /in no 

12.44 2.63e-15 140-173 


1715 

lo lo 


lrr>UUZ4y4A 


Jveratm, nign suiiur Bz protein 


1A /1/f O OA— lO 11C I/IO 

12.44 3.32e-18 115-148 


1715 

10 lo 


lr*r>UUZ4y4A 


Keratin, high sulfur B2 protein 


1A A A O /t /I _ 1^" IOC 1 /T O 

12.44 3.44c- 16 135-168 


1715 
1015 


lrt>UUZ4y4A 


Keratin, nign suiiur dz protein 


1A A A A 11 - 1 s~ i /^a ICO 

12.44 4.21e-16 120-153 


1715 

lo lo 


TD"Q A AO /I CSA A 

lrr>UUz4y4A 


Keratin, high sulfur B2 protein 


1 O A A A AO 1A1 AA 1 O O 

12.44 4.48e-10 100-133 


1715 

lo lo 


lrr>UUZ4y4A 


Keratin, high sulfur B2 protein 


12.44 5.04e-13 91-124 


1715 

lo lo 


TT>"D AAO/1 A/1 A 

lrriUUZ4y4A 


Keratin, high sulfur B2 protein 


1A A A C 01 _ 1AOrt /^O 

12.44 5.81e-10 30-63 


1 7 1 C 

lolo 


TDDAAO/tA/1 A 

lrr>UUZ4y4A 


Keratin, high sulfur B2 protein 


1A A A f o o r\r\ ^ r\f iort 

12.44 6.32e-09 106-139 


1715 

lolo 


lrJt3UUz4y4A 


Keratin, high sulfur B2 protein 


12.44 6.67e-12 14-47 


1715 
10 lo 


lrJoUl>Z4y4A 


Keratin, high sulfur B2 protein 


1A >1/1 /T OA— 1AAA CO 

12.44 o.89e-12 20-53 


1715 

lo lo 


TDDAAO/IO/1 A 

irr> U UZ4y 4 A 


Keratin, high sulrur r>2 protein 


1 A A A 'H AO 1 A A ^7 1 A A 

12.44 7.03e-10 96-129 


1715 
10 1 o 


lrr>UUz4y4A 


Keratin, high sulfur B2 protein 


1 A /I /I *7 ^TC — 1A 1A1 IC/t 

12.44 7.65e-10 121-154 


10 1 o 


lrr5UUZ4y4A 


Keratin, nign suiiur r>z protein 


1A /f/1 O If. A A A1 C A 

L2A4 o.35e-09 21-54 


1715 
10 lo 


TDQAAO/IO/I A 

lJrJt>UUZ4y4A 


Keratin, nign suiiur r>z protein 


1A A A O OO— IC 111 1/1, 1 

12.44 8.88e-15 111-144 


1715 
lOlo 


TDT3 AAO/1 0/l A 

lrr>UUZ4y4A 


Keratin, high sulfur B2 protein 


1 A A A A AO — AA O /T" CA 

12.44 9.03e-09 26-59 


1715 
lO 1 o 


TT>"D A AT /I Qyl A 

LrJoUUZ4y4A 


Keratin, high sulfur B2 protein 


1A /i yl AAA 1A O 1lr\ 

12.44 9.22e-12 86-119 


1715 
lO 1 o 


lJrJoUUZ4y4A 


Keratm, nign sullui r>z protein 


1A /I /I A AC— 1 1 A A C7 

12.44 9.25e-ll 24-57 


1715 
10 15 


lrJt>UUZ4y4A 


Keratin, nign suiiur r>z protein 


1A A A ACT 1A 11A 1/10 

12.44 9.57e-19 110-143 


1715 

lolo 


TDT3AAO/1A/1 A 

LroUUz4y4A 


Keratin, high sulfur B2 protein 


12.44 9.73e-18 125-158 


1715 
Ld lo 


TDDAAO A A /I T5 

lJrJDVJt;Z4y4r> 


Keratin, high sulfur B2 protein 


10.58 1.51e-09 124-138 


1715 

lolo 


TDD AA1 /1 A/1 "D 

lrioUUZ4y4o 


Keratin, high sulfur B2 protein 


10.58 2.27e-09 159-173 


1318 


TPB002494R 


JXCIallll, Illgll oUllui JDx. piUtcin 


lu.jo j.33e-iz ijy-i3o 


1318 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 6.70e-12 129-143 


1318 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 8.45e-ll 149-163 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.27e-ll 114-157 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.27e-ll 99-142 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.41e-27 128-171 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.70e-10 104-147 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 2.08e-13 98-141 
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SEQ 

ID 

NO: 


Database 
entry ID 


Description 


Kesults* 


1 O 1 o 

lolo 


TDT3 A A 7 A A A 


iveratm, nign sultur BZ protein 


l^K^O J.^fOe- 1 o 11j"1jO 


1 1 1 o 

131o 


T"DT5 A AO A A A 


xveratm, mgn sultur t>z protein 




1 O 1 o 

131o 


TTJO A AO A n A 

lrr$L)Uz494C 


Jveratm, mgn suitur bz protein 


14 4^4flQp 11 119 1^^ 

iH-.^f o 4.uye- 1 1 iiz-ij^ 


loin 

131o 


lrt>UUz494L> 


Jveratm, nign sultur bz protein 




iJio 


TDDAAO /I A/1 

lJrr>UUz4y4L> 


Jveratm, mgn sunur bz protein 


1 d. 4^ 4 R/l^..? 1 10S_1^1 


1110 
Li lo 


Tt>T3 A AO A A A 

lrr>UUZ494C 


iveratm, nign sunur bz protein 


1 A A& ^ 10p 10 19^-1^/^ 


1Q1 O 

IJlo 


TDT3 A AO A A A 

lJrrJUUz494L- 


Jveratm, nign sunur bz protein 




111 o 

131o 


lrr$UUZ494C 


iveratm, mgn suiiui bz protein 


1 A A& ^ 77f» 17 1 fH-1 A_^ 


UIo 


lrl3UUz494U 


iveratm, nign sultur bz protein 


l<d ^ R9p» 11 1 HQ- 1 S9 
iH-.^fo d. oze- 1 1 It/y-IJZ 


111 o 

1318 


T"DD A AO A C\ A /~i 

lrrJ00z494C 


iveratm, mgn sultur bz protein 


1 A A& A Q7p* 1 "3 9^ 
14.40 0. 0 / e- 1J> Zj-OO 


1 1 1 o 

131o 


TUT) A AO A A A 

lro U0z494C 


iveratm, nign sultur bz protein 


1 A A& 7 C1a AO Qzl 1^7 


131o 


T"DD A AO /K1/1P 

lrJ30Uz494C 


iveratm, nign sultur bz protein 


14 J.^ 7 C/Lo 19 199 1 61 
14. 40 /.o^fe-iz IZZ-JLOJ 


1 1 1 o 

1318 


TT5T3 A AO ACi A 

lP13(J024y4C 


Keratin, nign sultur Bz protein 


'\A A& Q Afla AO AS 01 

14.40 o.4ue-uy 4o-yi 


1 1 1 o 

1318 


ll J B00z494C 


Keratin, nign sultur BZ protein 


1 A Q ?7o 00 11C 1^1 

14. 4o o.j /e-zy 1 lo-lOl 


1 1 "I o 

1318 


lrB002494C 


Keratin, higli suiiui BZ protein 


1A A^ Q i^Q<= 1A52R 1^1 
14. 4D cS.Oye-lU oo-1j1 


111 n 

1318 


TUTD A AO AnAf^ 

lrr>(JUz494C 


Keratin, nign sultur bz protein 


1 A Afk S SQa 19 119 1 f,C\ 

14.40 o.oye- iz 1 1 /- iou 


1 1 1 o 

1318 


TTJTD A AO /tCliin 

lJrBU(Jz494C 


Keratin, nign sultur BZ protein 


1A A& O ?.'Xa> AQ n^; 


111 o 

1318 


TOO A AO A A A 

lrt>U0z494C 


Keratin, nign sultur BZ protein 


144^Q71p 1^ 197 17H 


1110 
1318 


TDDAA1 r-71 -r> 

lrr>UU3j / lr> 


onaKe toxin 




1 1 1 o 

1318 


T>T> A AOO A /~* 


rungi-iV metallotnionem signature ill 


A ^7 O 7A«» 1 A G*\ 1 HQ 

4.j / y. /4e-iu yj-iuy 


1 1 1 o 

1318 


PRUUd /oB 


1 ____. -------- __L_ — jJ _p*k jfcifc jfcX _— ^ 1 1 _p*W _L 1 -m __| ji y MB _T*» ^ _UK _A _f~ B kP h ^^ _"^ ^h*« li V 1 

JNematode metallotnionem signature 11 


7 9 77o 1 A 1 07 1 9A 

/.ooz.z/e-iu 1U/-1ZU 


loin 

1319 


TT3TD AAAOOO 

IrBUOOozz 


Zinc tmgei, UZrlZ type 


1 A £7 ^ A$2^ 1 1 7*C 9AA 
14.0/ J.Uoe-lo 1 /D-ZUU 


1319 


TTJTD AAAOOO 


zmc linger, v,Zriz type 


1 A f>7 A 7<vf=> 1 Q 1 A7 1 79 


1319 


TT»*D AAAOOI 

IrBOOOoZZ 


Zinc linger, CZi-lZ type 


1 A £7 A QA» 1 A 1 1 Q 1 A A 

14.0/ 4.y4e-t4 iiy-144 


1 1 1 n 

1319 


IFBOuOezz 


Zmc tmger, CZBZ type 


1 A A1 7 Q9o 1 C 52 R 

14.0/ /,yze-io oj-oo 


1 1 1 o 

1319 


TDTD AAAOOO 

IrBUUUoZZ 


Zmc tmger, uzhz t3pe 


1A fH S 'Xfs.f* 91 9H^ 978 
14.0/ o.joe-zi ZUj-ZZO 


1319 


AAAOOO 

IrBUOUbZZ 


Zinc linger, CZHZ type 


1A fH 9. 91 01 1 1£ 

14.0/ o.ooe-zi yi-iio 


11m 
1319 


TT>D AAAOOO 


Zmc tmger, L2Hz type 


14. o/ y.jie-io jj-du 


1319 


T"DT3AA1 OOO 

LroUUlZZZ 


lrllo zmc riDDon domain 


9A fS\ A 7C\o HO 17^-91 1 


inn 
13 19 


1 nnnn mr 
IFBUUIZ/J 


iJM ujna uinamg aomam 


in 17-3 A^f* 1 ^ 7Q 1 1 5? 

1.7.1/ j.^ju-ij /y-no 


1319 


T"DT3 AA1 T7C 

lrr>UU12/3 


um iJiN A Dinomg aomam 


1 Q 1 7 A 77p 1 D 1 ^^-909 

izf, i / 4. //e-iu ioj-zuz 


1 1 1 n 

1319 


TT>T5AA1 T7f 

IJrBUUlZ/j 


iJM UNA binding domain 


1Q 17 ^ Qlo 11 <1 OA 

iy. 1/ o.yie-ii Ji-yu 


1 1 1 a 
13 19 


IrBUUlz/j 


_uivi iJJN A oinamg aomam 


in 17 7 1A P in 1 A7_. 1 A A 
Ly.L/ /.ZL/e-lU 1VJ/-140 


1319 


TTJTDAA1 T7C 

lrB0U1275 


UiVl UJN A binding domain 


1 Q 1 7 Q AOf» AQ 91 A9 

iy. i / y.uye-uy zj-oz 


1 1 1 o 

1319 


TTJT3 AA 1 0^7C 

1PJdUI)1275 


iJJVl una binding domain 


1 Q 1 7 O 1 /"W 14 1 7 A 

iy.i/ y.iue-i4 ioj-i/4 


inn 
1319 


TT>T3AA1 nc 

lrB001275 


1 Ojt "PL X 1 ^_l"^V. T* A T-V-« _rj -1 ma ^-w _hJ g -> -> /--k a__ 

UM UJN A binding domain 


1Q n q oQp 1 a iqi nan 
iy.i/ y.zye-ij5 iyi-zju 


1319 


li J BU02867C 


Cysteme-ricn domain (UoliCj 


1Q /l^ C 5/1 0 AQ "5.7 AQ 

iy.4o cS.o4e~uy oz-4y 


1319 


PR00048A 


^ ^ ' _f~ TT ^^fc J ___ _____ __. ___ j — — | ___k ^ _— ^ a ^ -fc __L_ -_ _ — _T 

CzHz-t^e zmc linger signature 1 


A O/l 7 Ola 1 A 9AA 91 Q 

y.y4 z.zje-iu zw-zij 


i im 
1319 


"DT> AAA /I O A 

PRUU 04 8 A 


CZrlZ-type zinc linger signature 1 


Q OA 1 1 1 Af\ 71 

y.y4 j>.o4e-ii ov-/j 


1319 


"T>T_> plAA/1 O A 


C2H2-type zinc linger signature l 


Q QA A 15o AQ 93 1 A1 

y.y4 4.joe-uy oo-iui 


111Q 

13 19 


T)T> AAA/1 O A 


C2rlZ-t3^pc zmc tmger signature l 


Q OA *s ^A<a 1A 19 A^ 

y.y4 j.jue-iu jz-4d 


1319 


T)T> AAA/1 0 A 

PRUU 04 a A 


CZMZ-type zmc tmger signature I 


O OA S 77i= 1 A 1 79 1 52^ 

y.y4 o.//e-iu i/z-ioj 


1 1 1 A 

13 19 


T>T> AAA/1 O A 

PRUUU48A 


CZrlZ-type zmc linger signature 1 


Q QA Q 1 Qi= 1 A 1 AA 1 -^7 

y.y4 y. i cse-iu 144- 1»>/ 


inn 
1319 


T>T_» AAA/f OTU 


C2H2-type zmc tmger signature 11 


_r <ro 1 flDo AQ 117 1A1 

roz l.uue-Uo i^z-141 


111 A 

1319 


TIT> AAA A OT> 

PRUU048B 


C2H2-type zinc finger signature II 


C CO 7 CAa AA AQ 

d.dz z.Due-uy 4oo / 


111 0 




f^OT-TO mp> r7inr» "fi r. cfi^V cifmoturo TT 

v^^_n.^~iypc ZrixiL' inigci aigiiai-urc li 


^9 ^ S7e-10 160-16Q 


1319 


PR00048B 


C2H2~type zinc finger signature II 


5.52 4.00e-ll 104-113 


1319 


PR00048B 


C2H2-type zinc finger signature II 


5.52 5.50e-09 76-85 


1319 


PR00048B 


C2H2-type zinc finger signature II 


5.52 6.14e-10 188-197 


1319 


PR00048B 


C2H2-type zinc finger signature II 


5.52 7.75e-ll 216-225 


1320 


IPB002041A 


GTP-binding nuclear protein Ran family 


11.12 9. 17e-10 4-48 


1320 


PR00449A 


Transforming protein P21 ras signature I 


12,48 1.29e-14 4-25 


1321 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 6.00e-ll 71-116 
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SEQ 

TT\ 
ID 

inu; 


Database 
entry ID 


Description 


Results* 


1 11 1 

1 jZ 1 




veneDrate metaiiotnionem, iamiiy i 


13. 41 y. /ze-0y 146-191 


1 11 1 

IjZI 


LriD\J\JU f ZOO 


oiycosiae nyaroiase iamiiy iy 


j.jy z.54e-U9 66-76 


1 11 1 
IjZ 1 


TDT3 fi Afl Q£ 9 1? 


z<inc linger iNr-Al type 


21. 06 1.24e-09 95-135 


1 11 1 


ijtJduu i i oyjv 


integrm oeta, c-termmus 


T*7 AC C TO« AO XT^ AO 

27.45 5.3oe~09 56-98 


1 11 1 


IJrJtSUUl /oz/v 


jL>isintegrin 


m fx OA*^ AA ^TA 1 l~\C\ 

23.93 9.29e-09 69-109 


1 111 
IDZ 1 


TPt3fin9/tO>1 A 
lrrJUUZ4y4A 


Jveratin, nign suliur r>z protein 


12.44 1.56e-12 161-194 


1 °.9 1 

I J/1 


±J J oUUZ4y4A 


Jveratm, nign suliur Bz protein 


12.44 2.06e-09 146-179 


1 11 1 
IjZI 


lrl3UUZ4y4A 


iveratm, ingn suliur r>Z protein 


11 /I yf A A/f~ AA 1 AO 1 O £Z 

12.44 2. 94e-09 103-136 


1 ^9 1 
IjZ I 


lJ^DUUZ4y4/\ 


Jveratin, nign suliur ±>z protein 


12A4 4.00e-14 ol-114 


1 11 1 


Ir\DUUZ4y4A 


Jveratm, nign suliur dZ protein 


1 A A A A 1 A-^ AA T1 1 A4T 

lz.44 4.lye-U9 /z-105 


1 19 1 


Tt>TM\Ci1AQA A 


Jveratin, nign sunur r>z protein 


n if/1 f AZT--V AA /C7 1 AA 

IZ.44 5.Z6e-09 6/-IUU 


1 19 1 


TPRn09Z^Qd. A 
IrDUuZ^^T-A 


iveratm, nign suuur r>z protein 


1Z.44 j.3oe-lj IjO-loy 


1 191 


1 r D \J \J z*+ y 1\ 


jveratin, nign suliur r>z protein 


11 AA C C< a 1/1 71 1 A/1 

1Z.44 D.5oe-14 /1-1U4 


1 19 1 

1 Jj£ J. 


J~PVlf\(MAQA A 
IJ^I3U\JZ4y4A 


jveratin, mgn sunur JdZ protein 


1A /1/1 C Q1« AA 1 ^T/T 1 AA 

iz.44 o.3ze-0y 166- lyy 


1 191 

IJZI 


fT>Vlf\r\1AQA A 


jveratin, nign sunur tsz protein 


10 /i/i sr r/r_ n loo u/r 

1Z.44 o.Doe-iZ 1Z3-10O 


1 191 
IjZI 


TT>Hnm AQA A 
lJrJjUUZ4y4A 


Jveratin, nign sunur JdZ protein 


n /i/i "~1 1 A ~ 1A 1A/f 1 

1Z.44 /.34e-i0 124-15/ 


1 191 


TDT±f\f\1AQA A 


Jveratin, nign suliur r>z protein 


1 A /I A H OHr, AA OA 11C 

lz.44 /.jye-Uy oz-llj 


1 11 1 
IjZ 1 


lJrr>UUZ4y4A 


Jveratin, nign suliur t>z protein 


1A A A T AO ^ AA "lO/l 

12.44 7.4oe-09 151-184 


1 11 1 
IjZI 


TPT*nH9 AQA A 
LroUUZ4y4/Y 


Jveratin, nign suliur I5Z protein 


1A /) /1 ^7 CO «. A A AA 

12.44 7.5oe-0y 57-90 


1 19 1 
IjZI 


lrI5UUZ4y4A 


jveratin, nign suliur £>z protein 


12.44 7.60e-14 118-151 


1 11 1 

IjZ 1 


IJrJb>lJUZ4y4A 


Jveratin, ingn suitur Bz protein 


1 A /I /I T A^7_ AA O S' 1 1 f\ 

12.44 7.97e-09 86-119 


1 19 1 

IjZ I 


lT>Tir\r\1 AQA A 
IJr\E5UUZ4y4/Y 


iveratm, nign suliur t>z protein 


1A /i>i o n„ ia 1 io 1/t^r 

12.44 o.33e-12 113-146 


1 19 1 
IjZ 1 


Ix^J3UUZ4y4A 


Jveratin, nign suliur r>z protein 


12.44 o.35e-09 62-95 


1 19 1 
IjZI 


IJt\DUUZ4y4/Y 


iveratm, nign suliur JbJZ protein 


1 A /I /I O A1„ 1 "7 /T 1 A A 

12.44 8.97e-16 76-109 


1 19 1 
1 jZ 1 


TDT1f\f\1AQA A 
IJr r>UUZ4y4A 


Jveratin, mgn suliur iiz protein 


1A A A A 11„ AA 1/1^7 lOA 

12.44 9.13e-09 147-180 


1 19 1 
IjZ 1 


rPHAA9zlQ^l A 
lr^r>UUZ4y4A 


Jveratin, nign suliur JdZ protein 


12.44 9.14e-ll 66-99 


1 11 1 
1 jZ 1 


TPT3 A A9 A QA D 

lr^oUUZ4y4x> 


Jveratin, nign suliur dz protein 


1AC0 1 11. A A HA O /i 

10.58 1.13e-09 70-84 


1 191 
IjZ I 


TPTJA f\1A Q/1D 
I Jr JtS U UZ4 y 4J5 


iveratm, nign suliur BZ protein 


10.58 3.53e-ll 122-136 


1 19 1 
IjZ 1 


TPT5AAO/1 0/ttJ 
IJrJDUUZ4y4x3 


Keratin, high sulfur B2 protein 


10.58 4.17e-09 142-156 


1 19 1 
IjZ 1 


IJrr>UUZ4y4r> 


Jveratin, nign suliur bz protein 


10.58 4.17e-09 22-36 


1 19 1 
IjZI 


TPDAA9/1Q/1TJ 

LrJ3UUZ4y4Jj 


iveratm, nign suliur Jh5z protein 


IA CO XT C ll AA 1 O'T 1 T1 

10.58 5.56e-09 137-151 


1 191 
IjZ 1 


TPT3 A A9 A QA T3 
IJrr> U UZ4y4Jh> 


Jveratin, nign suliur J3Z protein 


1A C O ^ CO— AA OA C\ A 

10.58 6.58e-09 80-94 


1 19 1 
IjZ 1 


TPT* A A9 

Ir^r5uuz*+y4r> f 


jveratin, nign suliur dz protein 


1 A CO iC TO« 1 A AC 1 AA 

10.5o 6.7oe-10 95-109 


119 1 
IjZ 1 


Lr I3l/UZ4y4L/ 


Jveratin, mgn suitur jdZ protein 


1/1 A£Z 1 rn„ i/i A1 /^/i 

14.46 1.50e-i4 zl-64 


1 191 


TPRnA9AQ/ir t 
ijrx5uuz £ fy*+i^ 


Jveratin, nign suitur ±5Z protein 


1 /I A/Z 1 CH« AA 1 1 O 

14.46 1.59e-09 /U-I13 


1191 




jveraiin^ niga sunur dz protein 


1/1 AfZ 1 O/C^ IA ^ ylO 

14.46 l.y6e-10 6-49 


1191 


TP"R 0 09 zlOzLr* 


jveratin, nign sunur dz protein 


1/1 /1^AA1« AO O^C 70 

14.40 Z.Ule-Oy DO- /y 


1 19 1 

IjZ I 


TPRA A9/I Q/1 P 1 
lJrr>UUZ4y4U 


Jveratin, mgn suliur r>z protein 


1 A A iC A 1A„ 11 T/1 i -I 

14.46 2.20e-21 74-117 


1 19 1 
IjZI 


IJrJ3UUZ4y4L^ 


Jveratin, nign suitur r>z protein 


1/1 /I ZT A T7 „ IA AA*7 

14.46 2.27e-13 164-207 


1191 
IjZ I 


iJrl3UUZ4y4L/ 


Jveratjn, nign suitur JdZ protein 


1 /I /I /Z A C -C ^ 1 1 O /T (TA 

14.46 2.55e-ll 26-69 


1 19 1 
IjZI 


TPRA CMAQA 
IJrr5UUZ4y4L^ 


Jveratin, mgn suitur az protein 


1 /I /1/T A HA« 1/11 C A AAO 

14.46 2.70e-14 159-202 


1 19 1 
IjZI 


TPRHA9yl QA O 
lJTJt>UUZ4y4U 


Keratin, high sulfur B2 protein 


1/1 /I /T" O GZT 1 O 1/1A 

14.46 2.86e-13 149-192 


1 191 
IjZ I 


TPRAA9 J 4Qyf r* 
LrJt>UUZ4y40 


Keratin, nign suliur BZ protein 


1/1 O 1 A AA 1 1 1 1 <~f yl 

14.46 3.10e-09 131-174 


1 19 1 
IjZI 


TPT3 A C\1 A QA 
IJrJt>UUZ4y4U 


Jveratin, nign suitur Bz protein 


14.46 3.10e-09 75-118 


1 19 1 
IjZ I 


lJrJt>UUZ4y4L, 


Keratin, high sulfur B2 protein 


14.46 3.30e-14 126-169 


1321 


IPB002494C 


T^f*rrj+in Tiicrli ciilfur T-?9 r»T*r\-f*at-i-k 

111 ^11 DU11U1 IJ^r (JI VJLC1I1 


1 id il£ 4. 1 Qe> 19 S^l 1 97 


1321 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 4.36e-17 116-159 


1321 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 5.36e-ll 144-187 


1321 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 5.61e-16 79-122 


1321 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 5.79e-09 145-188 


1321 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 6.06e-12 111-154 


1321 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 6.53e-12 11-54 


1321 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 6.91e-12 89-132 
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C IT 1 

111 

1VJTV 


JJatabase 
entry id 


Description 


Results* 

i 


1 19 1 


i.r.DUUZ*+y4v-' 


jveratm, mgn suiiur r>z protein 


14.40 /.lUe-lJ 04- tU/ 


1 191 
1 JZ1 


lJrr5UUZ4y4^ 


T/^" y— % V* J"l 4" ¥] 4 Mill 4^| 4 44 U '""I A 4- /— » 1 +Oi 

Jveratin, nign suitur Jdz protein 


lyl /f^ T "5 1 « K 111 1 CA 

14.40 /.jle-iD 121-164 


1 19 1 
1 JZ1 


TPRnn.9 AQ/1 

i"ouuz^fy z tv.. 


iveratin, nign suitur jdz protein 


1/1 7 oo„ 11 1 /c a mo 
14.4D /.oze-11 ioy-zlz 


1 191 


1 Jr Jt> U UZ*+y *f 


iveiann, nigri suitur rsz protein 


1/1 /i "7 c^^i io en im 
14.40 /.oJe-1 J jy-iuz 


1 19 1 
1 jZ 1 


iJrJt$uuz4y4i^ 


iveratin, nign suiiur r>z protein 


1 A A£L 1 flfln 1 A 1 ICS 1 Ol 

i4.4o /.yye-iu ijy-ioz 


1 19 1 
IJZ 1 


TPTinn9/i 0/1/^ 


iveratin, nign suitur hz protein 


1 A AiZ O H £. ~ 1 O 1 CA 1 (T7 

14.40 s./Oe-lo lj4-iy/ 


1 191 


TPP.^^9ilQi4^ , 

IJtxj U UZH-y ^ 


iveratin 5 nign suitur r>z protein 


1/1 /1/C o r\n ^ no 0*7 ta 

14.40 y.u /e-uy z /- /u 


1 19 1 
1 JZ1 


TPnnn9zioztr^ 
lr.DUUZ4y4l^ 


iveratm, nign suitur r>z protein 


1/1 /l/CA/f/1^ 11 "3 1 ^7/1 

14.40 9.44e-lz 31-74 


1 19 1 


TP 15^/19/10/40 
IJr JjUUZ4y40 


iveratin, nign suitur r>z protein 


1 /l AC A 1 O £T A 110 

14.40 y.00e-lo 09-1 lz 


1 191 
IjZ 1 


LrD U UZ4y 4 


i iveratin, nign suitur Jt>z protein 


1 /f /1/C A TO/, 111 A/C 1 /in 

14.40 y. /je~l 1 luO-149 


1 19 1 
IjZ 1 


TP"RAA9Q1 Q A 

irouuzy i y/v 


trypsin inniDitor-iiKe cysteine ncn domain 


Ij.jO O.j /e-Uy Z4-J0 


1 199 


TPTJAAA1 1 CP 


oranuim 


T A/1 1 in« 1 A OA C~l 

/.y4 l.jOe-lu zy-0 / 


1 197 


IJrrsUUU 1 1 <$V-J 


oranuun 


IOIOO OCft AA 1 O C€L 

12. lo 5.oje-Uy lo-Oo 


1 197 
IJZ / 


TPRfifin9ni a 


vjjro aoniain 


1 O Af\ A riA /< A "1 A 

lo.4u 4.0je-U9 44-74 


1 197 
IJZ / 


TP"Rnnn9ni A 


uI l) aomain 


i 0 Af\ c C7« fin o/t 1 i/T 

io.4U j.j /e-uy so- 110 


1 190 

i jzy 


ir jjuuuuuo 


verreDrate metai lotnionein, iamny i 


10 /II 1 *C /I „ AA CO 1 AO 

1J.41 1 .o4e-uy jo- 1(Jj 


1 190 


lJr.t>UUUUUO 


veneDraie metauotnionem, iamny i 


11/119 7A Q 1 1 C/1 AO 

1J.41 z. /ue-11 J4-yy 


1 19Q 


li X3UUUUUO 


verteDraie meiauotnionein, iamny i 


11/11 1 ooa no oa 1 1 c 

ij.41 j.jye-uy /u-iij 


1 190 

UZy 


1JTI5UUUUUO 


verteDrate metauotnionem, iamny i 


19/11 9 ££a f\Ci 1 OA 

ij,4i j.ooe-uy /j-izu 


1 190 
1 jzy 




verxeDraie metauotnionem, iamny i 


11/11 /I 07a 1 n C 1 A^ 

1 j. 41 4.0 /e-iu j i-yo 


1 190 

uzy 


1.T I3UUUUUO 


v erteorate metauotnionem, iamny 1 


19/11 C ?Qo AQ CO 1 AO 

ij.41 j.jye-uy j/-iuz 


1 19Q 

i jzy 


lr\DUUUUU0 


verteorate metauotnionem, tamuy I 


11/11 /Z AA« 1 1 CA AC 

ij.41 o.uue-ii ju-yj 


1 190 

uzy 


IrMoUUUUUO 


verteDrate metauotnionem, tamuy i 


11/11 /Z /IO/-. AA C 1 1 A 

ij.41 o.4ze-uy oj-iiu 


1 190 

uzy 


Lr.fc$UUUUUo 


verteDrate metauotnionem, tamiiy 1 


11/11 0 no= AA /I A 0 c 

Ij.41 7.yoe-0y 4U-oj 


1 190 

uzy 


l.rr>UUUUU0 


verteorate metaiiotnionein, iamny 1 


11/11 O AOo AA CC 1 n A 

ij.41 o.u /e-uy jj- iuu 


1 19Q 

i jzy 




verteDrate metauotnionem, tamiiy l 


11 /II O /10„ 11 /I O AO 

Ij.41 o.4oe-lj 4/-yz 


1 190 


lJr.t>UUUUUO 


verteDrate metauotnionem, tamiiy l 


11/11 OA/1^. 100/1 11A 

Ij.41 o.y4e-lU /4-liy 


1 190 

1 JZj7 




v^enuiose-Dinaing aomam, iungai type 


1 Q 1 1 O AAo AA /II Ol 

lo. 11 /.Uue-uy 4j- /j 


1 19Q 
uzy 


IJr 13UUUJ04A 


zre-zo rerreaoxin 


10 11 O Of\ft AO 1 1 A 

1 /. j 1 /.oue-uy i-iy 


1 190 

uzy 


TPRnnnsi 7 a 


rrion protein 


O 1 A 1 Of „ A A 11 CC 

o.j4 i.oje-oy Ij-jj 


1 190 


TPRnnnsi 7 A 

IrDUUUo 1 / J\ 


Prion nrntmn 

jriion protein 


O 1/1 /I 7Qo AO O C1 

0.J4 4. /ye-uy y-j i 


1 190 

uzy 


lPRnnnRi 7 a 
lr\t5UUUol /J\ 


r non protein 


0 i/i o7„ no 10 c/i 
o.j4 O.o /e-uy lz-54 


1 190 

uzy 


TPT^nnnc 1 7 a 

IJrrSUUUol /A 


Jrnon protein 


Ol/I O^T/1^. 11 AO 

o.j4 o.04e-ll 0-4o 


1 190 

uzy 


TPPifinna 1 7 a 

IJtxdUUUo I / /\ 


Jrnon protein 


O n A A A 1 ^ 1 A 1 A CO 

o.j4 y.ule-lu lU-jz 


1 19Q 

uzy 


IJr jtJUUUyo ID 


Z/inc linger jnjf-ai type 


1 A A O O 77-, AO OC 1 1 A 

iU.4z /. //e-Uy /j-lll) 


1329 


IPB001169K 


Integrin beta, C-terminus 


27.45 3.19e-09 57-99 


1 19Q 

uzy 




lntegrm beta, C-termmus 


OT /( r f] nr _ AA ^70 11/1 

27.45 8.05e-09 72-114 


1329 


IPB001169K 


Integrin beta, C-terminus 


27.45 9.15e-09 16-58 


1 lOQ 

uzy 


TT>r>Ani n 1 
Irr$UUlz/l 


Mammalian deiensin 


19.97 3.29e-10 66-94 


uzy 


TDDf\r\ 1 OO 1 

IrrsUU 12/1 


JVlammalian deiensm 


19.97 4.44e-09 73-101 


1 19Q 

uzy 


TPTlA.ni 001 

lt\t$UUlZ / 1 


Mammalian defensin 


19.97 5.42e-09 82-1 10 


i jzy 


IJrJJAJUlZ / 1 


Mammalian defensin 


1 A A*7 '"7 O C 1 A A A T~\ 

19.97 7.35e-10 44-72 


1 19Q 

uzy 


TDR^m /1/19 A 

lrjt>UU144zA 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 2.10e-09 4-56 


1 19Q 

uzy ! 


TPRnm A A 9 A 
IrrJUU 144ZA 


C-terminal tandem repeated domain m type 


f- -t r*. i~i OA— AA 1 iT 0 

26.12 2.80e-09 1-53 


1329 


IPB002174A 


Furin-like cysteine rich region 


30.51 2.5Se-09 51-82 


1329 


IPB002174A 


Furin-like cysteine rich region 


30.51 4.38e-09 34-65 


1329 


IPB002174A 


Furin-like cysteine rich region 


30.51 4.97e~ll 2-33 


1329 


IPB002174A 


Furin-like cysteine rich region 


30.51 5.28e-09 6-37 


1329 


IPB002174A 


Furin-like cysteine rich region 


30.51 6.35e-10 26-57 


1329 


IPB002174A 


Furin-like cysteine rich region 


30.51 7.62e-ll 30-61 


1329 


IPB002174A 


Furin-like cysteine rich region 


30.51 9.27e-10 18-49 
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SEQ 

ID 


Database 
entry id 


uescription 




1 ^90 
i ozy 


TPR009 174A 

XX xjUUZV 1 / 'r.rY 


J. Ill III IlIYw vjr JlbJuw llwJl Ivglvll 


30.51 9.27e-10 22-53 


1 ^9Q 


TPR0H999 1 R 

xx JjUUZZZ 1 Jj 


WAP-tvnp TWliPv AriHic Prnteinl fbur- 
disulfide core domain 

UloUlliUw uUlLf UU 111 CI 111 


17 12 1 00e-08 63-84 


1 ^90 


TPRA094Q4A 


TCnratin hiprVi sulfur R 2 t)T*otein 


12.44 1.39e-09 44-77 


1 19Q 


TPRn09<lQ4A 


TCpratin hicrh «?ii1fnrR9 rvrotein 


12.44 3.23e-09 34-67 


1 39Q 


TPR0094Q4A 


TTpratin hicrlT Qiilfnr R9 rvrntein 


12 44 7.75e-10 85-118 


1329 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 8.67e-10 47-80 


i ozy 




rvClaLlll, iligii ouixui iJ^ piuLciii 


12 44 9 03e-09 40-73 


i ozy 


iFiDUUZ^y^t-Vy 


T£"f»t*ij-f in T-i-J nrVi oii1"fiirR9 "nfotpin 


14 46 2 31e-10 60-103 


1 ^90 


TPRnn9<k^i p 

IrDUUZJJ 1JT 


Prtt*rtna\/iTiio Q 1 crl \/c cwwdff^i n 

V_^UI Uild Vll US) iJ 1 gl J^IJIJI VJUClll 


13 91 7 68e-09 79-120 


1 T9Q 

1 j.£y 


xx xjUVJZ»OU / Ix 


^ y a LC<iiit< I iv^i J KX\jinaiix ^wiiv^y 


24 88 6 49e-09 53-84 


1 19Q 
i jzy 


TPR0n9Q1 OA 


Tr\/n<2in TnlriHitor-lilrf* fvstftinfi rich domain 


15 56 3 57e-09 67-79 


1 ^90 


rlvUUoJOD 


PnictQr*f=»nn mptnll Athinnpin ^ipnptnrR IT 

UblClL'Ca.lI llICLallwLIliUIl^JtXI algllfllUlv XX 


5 93 1 48e-09 55-73 


1 ^90 


PP0HR79 A 
riwv o / z/\ 


rVntfrn ^l"jro«;o"nV»i1n^ mpf pilot Viinnein 


5 17 7 41e-09 99-111 


1 ^9Q 


PP01 1 1 7 A 


PT C^-ft rhlnridp nhannpl sienature T 


7 79 9.47e-09 66-78 




PP fi 1 1 ^ 1 R 


Pnrifipviii "Xfx ^Py^^ Gitmnturp 1 [ 

V^UIillCAlll JU ^A.J \j j bigllcttuiv AX 


3 45 5 12e-09 42-54 


1 ^9Q 


PP01 998C 1 


"P crorcTi fA 1 nrntpi QionntTirp TTI 

xiggaiicii JJ>1 \JLG1X1 OX^IXCILLIX 1XX 


5 69 1 22e-10 17-32 


1 ^9Q 

i ozy 


PP01 99£r 1 


T-? ofcroTi f*l 1 nrntpin cionntnrf* TTI 

JJrggoJXdl [JUJICIII Otg,Hu.LUIU 1XA 


5 69 1 22e-10 21-36 


1 ^9Q 


PP01 99^^ 

x XVU 1 ZZO'U 


P ctctcTi p*1 1 nrrxtf^i n cionntnrp TTT 


5 69 1 22e-10 25-40 


1 ^9Q 

i jzy 


PP01 99RP 


P ctctcIt r»1 1 rviwifM n Qiormtiirp TTT 

U/ggJilV-'ll LJXVJltsllI oXgXXdLLI.lt/ XXX 


5 69 1 22e-10 29-44 


1 ^9Q 




Pcrcrcrip'll nrntpin ^iPTiature TTT 

X_/ggDlXwXX L/lV/lVlll OXgllCXLUlX^ XXX 


5 69 1.22e-10 33-48 


1 ^9Q 


ppftl 99RP 

XT XvU 1 ZZrOV^ 


PcrcrcVipll r>rntpin «?ionatnrR TFT 

XJ/ggol It'll OlgllClLU.1 t/ XXX 


5 69 1 22e-10 37-52 


1 ^9Q 

i j^x 


PRO! 99RC 1 


Ppfp-cripll nrotein ^ionature T! I 


5.69 2.34e-09 41-56 


1 ^9Q 

l jzy 


PP0 1 998P 
X XVU I z»z^o v_- 


porp-cripll rvrotpin sipnature TI 1 

Xwg,g01Xwrll JJIWIAa'IH OlgllCUtUl V 1.XA 


5.69 5.86e-09 18-33 


1 ^9Q 

j. jzy 


PR0199Sr 


poftrcripll nrotpiti ^ionature Til 


5 69 5.86e-09 22-37 


1 ^9Q 


PP01 998P 


Pcrcroripll r>rr>tpin ciifmatiire Tfl 


5.69 5.86e-09 26-41 


1 "39Q 
i jzy 


PP01 99RP 


P ixcrcTi p1 1 nrfitpi n <;i on at? ire TTI 

X-/ tiiiOliL/l 1 Ul Vlvlll OlgllOLUl V XXX 


5 69 5.86e-09 30-45 


1 19Q 

i jzy 


PP01 99RP 


Pcrorclipll "nrritpin ^icmafiirp Tfl 

X->^gollCXl JJlVJLt/lll OXglXClLlXX *^ XXX 


5 69 5 86e-09 34-49 


1 ^9Q 

i jzy 


PPfH 99RP 


PctctcjVipII ■nrntpin «iionaf*nrp Til 


5 69 5 86e-09 38-53 


1 ^9Q 


PP01 99RP 


pCTcrciTipll nrntpin ^ipnaturs Til 

X_>ggOXXvyXl piUlvlll OXgllCXLlXl V/ XXX 


5.69 6.48e-09 13-28 


IjjU 


TPR00099 1 


Prfitaminp P1 

X X VrLvUXXllXt/ X X 


5.48 1.00e-08 76-102 


IjjU 


TPRfinn99i 


PrfitnmirtP P1 

X I vJLClllllXiv/ X X 


5 48 1.00e-09 83-109 

» | X^ A ft) XX XX ^X >X «X ^X ^X W iX^ 


1 J JU 


TPR00099 1 


Prfitam i n p P 1 

X X VJLCUlXlll^i' X X 


5.48 1.46e-09 65-91 


iJJu 


TPR00099 1 


Prfitaminp P1 

X XVLcXXXXXXXw X X 


5.48 2.19e-ll 103-129 


1 

IjjU 


TPR00099 1 


Prntaminp PI 

X X UlClXlXXXIVs X X 


5.48 2.59e-ll 64-90 


1 ^f) 
IjjU 


TPR000991 

Xxr DUUUZZ 1 


Prrrfnminp PI 

X 1 U Lull 11 111^ X 1 


5 48 2.73e-10 118-144 


1 

IjjU 


TPR00099 1 


Protamine PI 

X X \JlCllXXXXlVx X 1 


5.48 2.97e- 12 74-100 


1 

IjjU 


TPR00099 1 

JUT DUUUZZ 1. 


Protnminp P1 

X 1 UlClllXXXXt/ X X 


5,48 3.31e-09 109-135 


1 

IjjU 


TPROnn99 1 


Pmtnminp P1 
x L Ulctllllllt/ X I 


5 48 3.31e-09 122-148 

p 1 XX »X • ftaX -*V- *X \X -X. Aml+m* X. p XX 


IjjU 


TPRfiflfi99 1 
Ix^jjUUUZZ I 


PrntfimiTiP PI 


5 48 3.91e-ll 78-104 

^X * f LV ■> !X^ A ^X X. -X- # V -A NX ft 


IjjU 


TP"RPlfifi99 1 
xx^xjUUUZZ 1 


Protaminp PI 
X 1 UlctllllUC x X 


5 48 4 70e-10 62-88 

m 1 XX m * I XX XX XX X^ XX 


IjjU 


IrOUUUZZ I 


PrntatniriP P1 
x lUlcllllxllC x x 


5 48 4 70e-10 94-120 

■«X • 1 v^/ • ■ / XX XX J- XX X 1 X \J . 


IjjU 


TPRnnn99 1 


Protpminp P1 

X 1 VJLtlllllllV-' X X 


5 48 5 15e-09 107-133 

^X hf J fcX a XL *X w* X. XX X X- * — ' 


IjjU 


TPRfMin99 1 


Protamine PI 

X 1 WlOXIXXXXw X X 


5.48 5.27e-09 52-78 i 

^X # 1 ^X ft mmW 9 %X #W # ^X 


1330 


IPB000221 

A. X * Xx \- Vx -»- 


Protamine P 1 


5.48 6.16e-ll 92-118 


1330 


IPB000221 


Protamine PI 


5.48 6.19e-09 116-142 


1330 


IPB000221 


Protamine PI 


5.48 6.43e-ll 99-125 


1330 


IPB000221 


Protamine PI 


5.48 6.88e-09 111-137 


1330 


IPB000221 


Protamine PI 


5.48 6.88e-09 97-123 


1330 


IPB000221 


Protamine P 1 


5.48 7.62e-ll 60-86 


1330 


IPB000221 


Protamine PI 


5.48 8.89e-10 101-127 


1330 


IPB000221 


Protamine PI 


5.48 9.30e-12 63-89 
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SEQ 

ID 

INO: 


Database 
entry ID 


Description 


lvcaUllo 


I DOKJ 


lr oUUUZ / 1 


JKJDOSOITld.i prOlCin IjJ^t 


15 87 9 78e-09 111-148 


toon 
1 DDV 


lrr>UUU4yzr> 


rrotarnine z vr^Kivizj 


^ 26 3 84e-09 75-109 


LDD\J 


TT>r> AH A/t OOD 

lrr>UUU4yzr> 


prAfimind 9 / , PP'^/^7^ 
i^roiarnine z 


5 26 5 88e-l 1 98-132 ' 




TPRAAAAQ9R 


Prntaminp 9 rPPlVI^ 


5 26 6 38c-09 94-128 




TPT300fiAQ9T1 
lrr5UUU4yZE5 


Prrvf-arninp 9 /'PR M9^ 


5 26 6 67e-09 107-141 i 


lo JU 


lr DUULW-yzri 


Prntdminp 9 /PPfVf9^ 


5 26 6 97e-10 103-137 ! 


LddKj 


tpt* a no4Q 9 n 


Prntaminp 9 fPUM?^ 


5 26 7 75e-09 77-111 




TP H A A A/1Q 9 TI 


rrOXallllliC Z ^JrxvIVLZ,J 


5 26 8 12e-10 106-140 


i iin 


TP '0000/10,913 


x rOlallllllC Z ^JTJtviVlZ^ 


5 26 8 34e-09 65-99 


1 


TP"R0004Q9 19 
ixi5VJUUH , yzi5 


Prntnmitip 9 rPPA/T9 i 


5 26 8 53e-10 105-139 


1330 


IPB000492B 


Protamine 2 (PRM2) 


5.26 9.06e-10 78-112 


IOTA 

133U 


Tnn A f\ A / G1 X5 


Dfrttomino 9 /PPA/f9\ 

r roiamme z ^r jvlviz^ 


S 96 9 1 102-136 


IOTA 

133U 


TDD AAA/1 OOTU 


rroLarnme z ^r^ivivizy 


S 96 9 35e-1 1 79-1 13 


1330 


IPB000492B 


Protamine 2 (PRM2) 


5.26 9.69e-10 100-134 


1330 


Irr300049zr> 


Frotamme z (rKjvuj 


^ 96 0 QOp-OQ 1 1 1 -1 45 


1330 


IPB003134F 


Repeat in HSl/Cortactin 


15.66 7.21e-09 183-231 


1330 


PR00055C 


T_TT"\ T T A T 1 ^nmni-n rM'rmnlm<a TTT 

111 V l A I domain signature ill 




1331 


IPB003006B 


Immunoglobulin and major 
mstocompatibuitv complex domain 


20.23 2.80e-10 54-91 


1332 


IPB002223 


Pancreatic trypsin inhibitor (Kunitz) family 


19 9 9^ 09 19A 

i / .oo z.oie-/j yz-izo 


1332 


PR00759A 


Basic protease (Jvumtz-type) lnnibitor 
family signature I 


14 GJ1 ^ 91#=» 10 7^ R7 

1^4-. o / j./ie-iu / j-o / 


1332 


Tin AA^ron 

PR00759B 


Hasic protease (Jvumtz-typej mniDitor 
tamily signature 11 


19^^ 1 9Rp 10 101-111 
iz,oZ) i.zcse-iu lui-iri 


1332 


rR0U759C 


rJasic protease ^J\.unitz-type^ mniDitor 
ramiiy signature 111 


19 4.^ 9 94e-1 1 1 1 1-196 


1333 


TPT3AnAQO0 
lrr>UUUoZZ 


z^inc linger, v_.z,riz type 


14 67 1 75e-19 201-226 


1333 


n>T3fiOOQ99 


"7i-r»<- -fiticr^r <^9 T-T9 fvnp 


14 67 1 75e-20 341-366 


1333 


TP19000S99 


Trno fin (jpr ^9^19 tvnp 


14 67 2 13e-15 369-394 


1333 


TPT3000899 


ZjinL linger, v^^iix, type 


14 67 5 50e-20 229-254 


1333 


iPT9onos99 

lJrlJUUUoZZ 


7inr fincrp>r (~^9 T-T9 tvnp 
z_>iiic iiiigei, vza/i type 


14 67 5 91e-21 285-310 


1333 


TPTinnn£99 


ZjIuc linger, \^£*r\jL, type 


14 67 7 00e-20 313-338 


1333 


TPnOOOS99 
lr JdUUUoZZ 


ZjIiil linger, v^^nz ^ypt' 


14 67 9 10e-22 257-282 


1333 


TPT3AA1 99^ 
IJrriUUlZ fj 


T^\/f T^TvT A KitininfT nAfnom 
U1V1 J_^lN/v Dllltllllg LHJIIldlll 


19 17 1 29e-13 245-284 


1333 


TPROOI 11 *\ 
lrxSUUlZ IJ 


T^N/T T^TsI A 111 W t~\ 1 O HrtTTICIITl 

Ulyl L^L\s\. UlllUlil^ LlUlIlctlii 


19 17 2 34e-12 329-368 


1333 


TPT9001 99<I 
IrrSUUlZ I j 


T~^\/f T"YM A V^i'tii^iTi it /^r\TYiQin 
JL/LVr -L'lN jTV Ulllvililg LiUIlldlll 


19 17 2 95e-10 217-256 


1333 


TPI3AA1 9"7^ 


Uivr rjrNjrv DinQirig cioiiidiii 


19 17 3 51e-09 357-396 


1 m 
1333 


TPRAA1 99^ 
lr JjUUIZ/3 


T^\/T T^"NJ A T-\i ti i~\ i tn cr /^Amnin 


19 17 7 00e-13 273-312 


117'} 

1333 


TPT3AA1 17^ 
lrrSUUlZ /D 


TlTV/T T~1^J A V\i nrlinfT atyi oiri 
JL^ivl l^lN-r\. UnHllIlg LMJllld.111 


19 17 7 15e-14 301-340 


1 999 

1333 


TPT3O01 QAO 

lr jduu iyuy 


JMvflD UVJA 


17 37 9 10e-31 8-42 


1 119 

1333 




f~ 1 9T-T9-t\/np 7111^ tincrpt* cicrncstTiTP 1 


9 94 1 00e-13 198-211 


1 ill 
1333 


PP OOA4R A 


^9T-T9-t\/np *71tip fincfpr ci cm si 1 1 rp 1 


9 94 1 64e-13 254-267 


1 111 

1333 


PP AAA/1 Q A 


/^*9T-T9_t\mp 7inA tincrpT cicrtidtiifp T 


9 94 1 64e-13 282-295 


1333 


TJP AAA/f O A 


v^znz-rype zinc linger aignarurc i 


9 Q4 6 21e-l 1 226-239 


1333 


PP f\f\f\A Q A 


v^z,rTZ- type ZiiiiK/ iiiigci bigiidLuic 1 


9 94 6 79e-13 366-379 


1 999 


PP 00048 A 


(~^9H2-tvne zinc finder signature 1 


9.94 9.36e-13 310-323 


1333 


PR00048A 


C2H2-type zinc finger signature I 


9.94 9.53e-ll 338-351 


1333 


PR00048B 


C2H2-type zinc finger signature II 


5.52 1.00e-09 270-279 


1333 


PR00048B 


C2H2-type zinc finger signature II 


5.52 2.29e-10 326-335 


1333 


PR00048B 


C2H2-type zinc fmger signature II 


5.52 2.50e-ll 242-251 


1333 


PR00048B 


C2H2-type zinc finger signature II 


5.52 7.00e-09 298-307 


1333 


PR00048B 


C2H2-type zinc finger signature II 


5.52 7.00e-12 354-363 


1333 


PR00048B 


C2H2-type zinc finger signature II 


5.52 7.50e-09 214-223 



J 



I 
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Nil- 


XrdldDdac 
Dtifnr 111 

eiiiry iu 


PCPfintl ATI 

UcHI ipilUll 


Results* 


13^6 


PR00806B 


Vinculin signature II i 


4.27 9.04e-09 333-346 




PR00806B 


Vinculin signature 1J 


4.27 9.64e-09 332-345 




P"Rfi1 917G 


Proline rich extensin signature VI! 


4.02 7.47e-09 399-424 


1337 


PR00806B 


Vinculin signature II 


4.27 9.04e-09 333-346 


1 '31'7 
Ijj / 




A/inrMiKfi cional'iirf* Tf 

V 1111/ UUll blgllu-LUl C XI 


4 27 9.64e-09 332-345 


1337 


PR01217G 


Proline rich extensin signature VII 


4.02 7.47e-09 399-424 


1 JJo 




l^y-O/U x rVJv QUI 11 dill 


13 24 8 33e-09 7-22 




pp Al 09/: a 


nxp all Mil blgllalUl C 1 


5 84 6 1 le-09 259-273 


1339 


IPB000729D 


PMP-22/EMP/MP20 family 


18.96 7.30e-10 182-209 


134U 


TD T5 A AT* 


PAvfP 00/TnA/fP /ft/TP 90 -Famib/ 

Jrivix^-zz/x^ivix^/ivix^zu iairiiiy 


1 8 96 7 30e-10 173-200 


1 1 /t i 
1J41 


TD"D AA1'2Q'"7Ti 


lVllLOUnOUtirid.1 lllipUlL 11111C1 lllCIilUiaiiG 
trnnclnrncp cnbiinit T'lm 1 7 

Ll allolUL/CtoC/ oUL'Uilll 1 1111 X / S 


21 18 9 79e-10 60-105 


1343 


IPB000353B 


Class II histocompatibility antigen, beta 

r\\ n|n \\ rvf o _ 1 (\ c\\y\ CI "in 
L-lltilllj OClar 1 UUlildlli 


19.16 4.94e-16 187-236 


1343 


IPB001003B 


MHC Class II, alpha chain, alpha- 1 domain 


14.72 9.90e-10 195-238 


10/10 

1343 


TD"DAACIAA£ A 


immunogiODUim anu major 
histocompatibility complex domain 


17 51 8 50e-16 208-230 


1344 


TDT5 A AO A QOXT 


iviajor vauix xrotein repeal 


] 2 73 9 47e-09 220-273 


1345 


PR00462B 


Fungal lignin peroxidase family signature II 


14.83 4.74e-09 34-52 


1347 


1PB0(J3UUoB 


immunogioDuiin ana major 

niSlOCOmpaXlDllliy CviiipiCA. ciuiiiani 


90 71 ^ 77e-09 202-239 


134/ 


T'D'DAA'2 AA£I3 

lJrr>UUoUUOr> 


immunogiODUim anu niajui 

liicf rmrvrrmsit'iVn litv f omnlpv dArnain 

llioLUl'Ulll}jaLlLrllliy UUlliaui 


20 23 8 83e-ll 306-343 


1347 


PR01236A 


Tumour necrosis factor beta (lymphotoxin- 

•al-nVin^ cionaturp T 
aipiia^ olgllctLLtlC X 


4.92 5.60e-09 3-19 


lJ>4o 




TmTn i in r\o~1 r^il^i t1 in nnrl tnfiior 
llillllUHL'glVJUU.llll ailU LllajyJL 

Vii<5tArAmnatir»ilitv comnlex domain 


20 23 1 36e-10 403-440 


1350 


PR01218B 


Pistil-specific extensin-like signature II 


8.47 4.88e-09 115-138 




r>o AAA1 Q A 


l^CUt/lIlC"IICIl icpcd-l OlgllClLtli W 1 


11 72 7 33e-10 193-206 


1356 


IPB002896F 


Herpesvirus glycoprotein D 


21.08 9.22e-09 327-362 


1356 


lPt5UU3UUors 


immunogiODUim ana major 

niSlOUOITipaLllJllHy t/\JiiipicA. uuiiiani 


90 9^5 1 5e-09 276-31 3 


i q ^7 


tpraaoi a 


i^n^6 fnmilv 


36 08 5 26e-39 258-31 1 


13j / 


TPTJAAO 1 ^Q"R 
LrD\jSJZlDyr> 


y^LJjD iamuy 


29 25 5 13e-34 334-379 


1 Q 




iinniuiiUgioLiuiiii aiiu niajui 
TiiQtofnnrnati Hilitv conrolex domain 

lUOlvvUlllL/utll/llli> Y Wlliuiwy\ uwiuuiii 


17 51 1 00e-10 160-182 

X / ■ *V 1 X » V/ V/ W XV/ X \S\J X VJlrf 




TpRnnino6R 


TmTYmn Aal aHiiIiti and maior 

±111111 LHlv>iilU U til 111 Clll\-i iiicijvi 

Vii<jtACAmr>fltir>ilitv comolex domain 

llloHJv/vJlllJJai.liyAlll'jf wiiiyiv/A uuiiiuiii 


20 23 6.09e-ll 206-243 


1 jUj 




PaViantin Qipnature }CI 


2.19 8.43e-09 873-895 


1 366 


TPB000885A 


Fibrillar collaeren C-terminal domain 


11.46 1.00e-15 200-237 


1 166 


±X JU>VVJV/0 0 J-rt. 


Fibrillar roll a fen C-terminal domain 


11.46 1.20e-10 167-204 


1166 


XX DvUwOOJA 


Fibrillar pAlla^en C-terminal domain 


11.46 1.25e-16 191-228 


1 166 


TPROOORR'SA 
xx JLJV/Uuoojri. 


Fibrillar rolla^en C-terminal domain 


11.46 2.00e-13 203-240 


1166 


TPFtflf)0RR5A 

IX DUUUOOJ/l 


Fibrillar collagen C-terminal domain 


11.46 2.20e-20 179-216 


1 166 

1JUU 


TPR000885 A 


Fibrillar cnllaeen C-terminal domain 


11.46 2.80e-18 176-213 




TPR000885A 


Fibrillar collaeen C-terminal domain 


11.46 4.29e-19 182-219 


1366 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 6.58e-10 209-246 


1366 


IPBO0O885A 


Fibrillar collagen C-terminal domain 


11.46 7.15e-09 164-201 


1366 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 7.85e-16 173-210 


1366 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 8.20e-20 185-222 


1366 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 8.99e-ll 206-243 


1366 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 9.03e-15 197-234 


1366 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 9.11e-13 170-207 


1366 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 9.37e-16 194-231 



* 
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SEQ 
ID 

TVTV\. 

JNO: 


Database 
entry ID 


I^/\f1 AWin^*1 Atl 

uescripuon 


Results* 


1 300 


TPtinnnRR^ A 

lr £>UUUo03rV 


Fibrillar rnllacren C-terminal domain 


11.46 9.71e-19 188-225 


1 ioo 


TPP2AAftRR*?R 

lr jjUUUoo 3D 


FiHrillnr onllap-pn (~!-terminal domain 


19.15 1.12e-19 174-227 


1300 


TPRAAftRR^T} 


FiKrillar rnllap^en C-terminal domain 


19.15 2.73e-20 186-239 


1.500 


1 IT 15 U U v o O J JD 


Fibrillar rollapren C-terminal domain 


19.15 3.59e-17 171-224 


1300 


TPRftftftRR^R 


Fibrillar collaeen C-terminal domain 


19.15 4,5Se-16 189-242 


1300 


TPRnnnRR^"R 

lr DUUUoO JJJ 


Fibrillar rollapen C-ferminal domain 


19.15 5.11e»10 153-206 


1 300 


TPROAORR^Fl 


Fibrillar collagen C-terminal domain 


19.15 5.31e-12 159-212 


1300 


TPRnnnss^Pi 

lr XjUUUooJIj 


Fibrillar rnllapren C-terminal domain 


19.15 5.77e-16 192-245 


1300 


TPPiftftftRR^R 


Fibrillar rollacrpn C-terminal domain 

A ILMlllCli V/vii(lgvll V-/ lv/1 lllllicvi uviiiuiii 


19.15 5.98e-15 162-215 


1 300 


TPTinnnRR^Pi 

lr JjUUUoo 


Fibrillar nnllaffen C-terminal domain 


19.15 6.06e-14 165-218 


1300 


TPTinnnRR 

lr J3UUUOo3Jj 


Fibrillar rnllafrpn C-teiminal domain 


19.15 6.18e-20 177-230 


1300 


TPT3nnriRR*k"R 
lr\DUUUo5Dr> 


Fibrillar pnllacr^n C-terminal domain 

r iULIUCLl LUllagvll lv^i liiiiicxi vivniict-iii 


19.15 6.46e-14 195-248 


1300 




Fibrillar mllacren C-terminal domain 

J/J. Lrl 1 1 1 dl w UilagVll l.v-1 iiiiiitii uuiiiuni 


19.15 6.66e-10 201-254 


1360 


1 Jr jj U UU o o D r> 


Fibrillar pollacren C-terminal domain 

r^lU/llildl \r* 1 1 1 flr.V 1 ' ^— LV/1 111111C11 UV/111U111 


19.15 6.87e-13 198-251 


1300 


TPT^nnnRR^R 


Fibrillar pollacTpn C-terminal domain 

J/1UI11ICH VvlllCl^V'll \s 111111C11 UV/illUlii 


19.15 7.36e-19 168-221 


1-500 ! 


rPRftftftRR^P; 


Fibrillar rnllaapn C-terminal domain 

X LLJ± llltll v/vJIXd.&t'll v^" l\jL lllll lul u\/iiiwiii 


19.15 7.55e-13 156-209 


1300 


TPPiftftftRR^'R 


Fibrillar rnlla^en C-terminal domain 

r ILfllllCU L/V/llClSi,V/ll V-> LvlllllllUi v*vx*ifc*m 


19.15 8.78e-12 150-203 


1300 


TPRftftftRRSR 


Fibrillar rnllapen C-terminal domain 

IUL 111CU v» v'llCl/i^V'ii uviiiinivn wviiiuiii 


19.15 9.16e-24 180-233 


1300 


TPRftftftRRS"R 


Fibrillar rnllao'en C-terminal domain 

J7 IUL illAi V'V/lltl^V'll V^ UV/lllAJLllCll UV111U111 


19.15 9.60e-22 183-236 


1300 


TPTJAftl (YTX A 
IJtjdUUIU / 3/\ 


i^rtmnlf^TYif^nt Pin nrntpin 


22.14 1.00e-13 202-236 


1300 


TP"Rftft1 0/7^ A 
IJtXjUU 1 U / 3/V 


Pnmnlpmpnt 1 n nrntein 

v^-UIlJ.piCll italic v>> l vj lyivji-viii 


22.14 1.46e-16 193-227 


1300 


TPTiftftl fi71 A 
IxMjUU 1 U / 3/V 


Pntnnlpmpnt C1 n rirotein 


22.14 2.76e-15 196-230 


1300 


TPTJftftl 0/7*3 A 


PAmnlpmpnt f^l n rirntpin 


22.14 3.77e-16 190-224 


1300 


TPRAA1 ft7^ A 
lr J3UU 1 U / 


Pnmnlpmpnt C1 n rjrntein 


22.14 3.90e-10 208-242 


1300 


TPRftftl (YT\ A 
lr J3UU 1 U / J^rV 


Pnmnlpmpnt CI n nrotein 


22.14 4.69e-16 175-209 


1300 


TPT3ftft1 (YTX A 


Pnmnlpmpnt CI n nrntfiin 


22.14 5.83e-15 178-212 


1300 


TP"Rftft1 ft7^ A 


i^rkTirnlpmpnt C1 n nrotein 


22.14 6.18e-19 184-218 


1300 


TPRftft! 071 A 


Cnmnlpmpnt CI a nrotein 

V^-vJlll JJiv/lllv/iii. v^>- 1 vj piwiviii 


22.14 7.51e-13 181-215 


1366 


IPB001073A 


Complement Clq protein 


22.14 8.02e-15 205-239 


1300 


TPT2ftft1 CYTX A 
lr jt5UU 1 VJ / jA 


PAmnlpmpnt P 1 n rirntein 


22.14 8.40e-14 199-233 


1366 


IPB001073A 


Complement Clq protein 


22.14 8.78e-10 172-206 


1366 


TTi"D AH1 ATJ A 

IrrJUUlU /3A 


V^OmpieiTlCIlL V>1L[ piULClil 


22 14 8.85e-13 187-221 


1366 


IPB001073B 


Complement Clq protein 


20.88 7.26e-29 281-315 


1366 


TT1T"»A/M AT) /~* 

IPB001073C 


Complement d q proxem 


13 07 1 87e-14 349-368 


1366 


IPB001073D 


Complement Clq protein 


7.60 8.20e-13 383-392 


1366 


IPB001442A , 


C-termmal tandem repeatea aomain in type 
4 procollagen 


06 19 1 44e-lS 169-221 


1366 


TTlTlA f\ "1 A A *\ A 

IPB001442A 


terminal tanoem repeatea aomain in xype 
4 procollagen 


26 12 1 52e-23 181-233 

X* V7 . JL law/ J—rV*> ^*mJ 1 VJ 1 -s 


1366 


IPB001442A 


C-termmal tanaem repeatea aomam in type 
4 procollagen 


96 12 2 08e-14 160-212 


1366 


IPB001442A 


C-termmal tanaem repeatea aomain in type 
4 procollagen 


96 12 1 00e-18 166-218 


1366 


IPB001442A 


C-termmal tanaem repeatea aomain in type 
4 procollagen 


96 19 ? 9 He- 1 6 202-254 


1366 


IPB001442A 


C-terminal tanaem repeatea domain m type 
4 procollagen 


96 19 ^ 9^p-12 1^7-209 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 3.72e-13 196-248 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 3.93e-17 193-245 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 3.97e-12 199-251 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 


26.12 4.08e-21 175-227 
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4 procollagen 




1366 


TIYTiAA 1 A A O A 

iPBQ0144zA 


C-terminai tandem repeated domain m type 
4 procollagen 


96194 4Se- 1 6 1 90-242 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


Ofi 1 9 4 74e-1 9 1 84-236 


1366 


tt»ti r\r\ a a a*^ A 

IPB001442A 


C-termmal tandem repeated domain in type 
4 procollagen 


96 IT OIp-99 179-994 


1366 


TTITIA/M A A o A 

IPB001442A 


C-terminal tandem repeated domain m type 
4 procollagen 


Ofs 19 6 94p-0Q 1S4-206 


1366 


IPB001442A 


c-termmal tandem repeated domain in type 
4 procollagen 


96 19 ft 40p-?2 178-230 


1 1 i 

1366 


TT>T> Afk 1 A A O A 


i^-termmai tanoem repeated uoinciiii in type 
4 procollagen ! 


26 12 6 54e-ll 148-200 


1366 


TDDAA 1 /I /1 1 A 

1PB0U1442A 


i_,~terminai tanaem repeated domain m type | 
4 procollagen 


96 19 6 86e-17 187-239 


1366 


IPB001442A 


C-termmai tandem repeated domain m type 
4 procollagen 


96 1 9 6 97e-1 6 1 63-21 5 


1366 


IPB001442A 


C-terminal tandem repeated domain m type 
4 procollagen 


96 19 8 4.7p-19 151-90^ 


1366 


IPB001442B 


C-termmai tanoem repeated domain m type 
4 procollagen 


19 ^8 1 57p-1 1 909-229 


1366 


IPB001442B 


C-termmai tandem repeated domain m type 
4 procollagen 


19 ^8 9 ^4p-09 189-902 


1366 


TTJDAA1 yl /I ID 

IPB001442B 


C-terminai tandem repeated domain in type 
4 procollagen 


19 ^8 9 89p_09 188-208 ' 


1366 


TT%T"»/'\/\ AAA IT) 

IPB001442B 


C-terminai tandem repeated domain m type 
4 procollagen 


19 ^8 ^ ^^p-09 179-109 


1366 


IPB001442B 


O-termmal tandem repeated domain m type 
4 procollagen 


19 18 A ODp-1 1 906-996 

LA. DO ^T. uUC 1 1 ZUU -faAU 


1366 


IPB001442B 


C-terminai tandem repeated domain m type 
4 procollagen 


1 9 IS A A7p_1 0 101-911 

LA. Do Hr.*+ / C- lu 171"il 1 


1366 


IPB001442B 


C-termmai tandem repeated domain m type 
4 procollagen 


19 18 S 01p-09 901-99^ 

LA. DO J.vlC ^vJ"j-aJ 


1366 


TTJDAA1 A A ^1 TI> 

1PB001442B 


O-termmai tanoem repeated domain m type 
4 procollagen 


19 18 S 24e-10 194-214 


1366 


IPB001442B 


^-terminal tanoem repeated domain m type 
4 procollagen 


19 18 7 10p-10 200-220 


1366 


inn aa 1 ,1 /iin 

IPB001442B 


t^- terminal tanoem repeated doinam m type 
4 procollagen 


1 9 18 7 57e-l 1 197-217 


1366 


lrBUU1442B 


\_--termmai lanQcm repc«itcQ uoiiidiii in type 
4 procollagen 


12 38 7 93e-09 176-196 


Uoo 


IDDAA 1 AAOTi 


l 1 farminal tcinHf^m ff^t^f^at^H Hr^TYinin in tvnp 
l^~lCriIllIlctl LdllliCIll iCpCcttCU. ULIlilalil 111 ly^J^/ 

H prueuiidgcn 


12 38 8 42e-09 185-205 


lOOO 


TT>T3fkfi1 A/10T3 
IJrtSUU 144Zr> 


V_.»»LC1111111<11 LdllUClll ItlJt^ClltLl VlCllldlll ill 

A. nrnrnl 1 fi p"rti 

*T |J1 VJwvJllclgwll 


12 38 8 66e-09 212-232 


1 166 


rivuuuu / a 


Pnmnlpmfnt C!1 O domain ^ipnnture 1 


20.64 6.54e-20 274-300 


1366 


PR00007B 


Complement C1Q domain signature II 


15.63 3.13e-14 301-320 


1366 


PR00007C 


Complement C1Q domain signature III 


16.13 2.62e-15 349-370 


1366 


PR00007D 


Complement C1Q domain signature IV 


9.66 1.95e-12 381-391 


1367 


IPB001427A 


Pancreatic rib onucl ease family 


12.97 1.41e-10 31-40 


1367 


IPB001427B 


Pancreatic ribonuclease family 


31.44 4.30e-29 45-95 


1367 


IPB001427C 


Pancreatic ribonuclease family 


24.33 8.07e-20 83-113 


1367 


PR00794A 


Pancreatic ribonuclease family signature I 


15.68 1.00e~ 18 46-65 


1367 


PR00794B 


Pancreatic ribonuclease family signature II 


17.91 7.55e-16 66-85 
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1367 


PR00794D 


Pancreatic nbonuclease family signature IV 


1 Q HA & AAo OH 0,4-1 1 £> 

iy. /4 o.4ue-zu no 


1368 


IPB000171A 


Bacterial-type phytoene dehydrogenase 


TO OQ Q 70s 1 1 S7 87 

ly.zo y./ze~ii d/-o/ 


1368 


IPB000927A 


D-amino acid oxidase 


Q "I Q1 a. AO fcJl 


1368 


IPB002135A 


3-hydroxyacyl-CoA dehydrogenase 


oa iq o <rip-in ^R_R^ 
zu.3y z.jue-iu jo-oj 


1369 


IPB000171A 


Bacterial-type phytoene dehydrogenase 


in no q 70p 1 1 ^7_R7 

ly.zo y. /ze-11 j/-o/ 


1369 


IPB000927A 


D-amino acid oxidase 


ojo /.y ie-uy j j-o / 


1369 


IPB002135A 


3-hydroxyacyl-CoA dehydrogenase 


zu.3y z.3ue-iu jo-oo 


1371 


PR00762A 


Chloride channel signature I 


1 /I "7<C H Q^o 1 Q 1 AS 1 O 1 ^ 

14. /o /.yDe-iy iuo-izd 


1371 


PR00762B 


Chloride channel signature II 


11 C/1 O Q7o 1/^1 IQ 1 

11. 84 z.y/e-io uy-ijo 


1371 


PR00762C 


Chloride channel signature 111 


A 1 cio OA OA1 OOO 

y.4z 3.3 /e-zu 2.\J3-zzz 


1371 


PR00762D 


Chloride channel signature IV 


ii 1A A 1 Q AAA-A^O 
1 1.34 4.D*fe-iy 'f'RJ-H'OU 


1371 


PR00762E 


Chloride channel signature V 


11 O/I O /l^o 1 £ A7^ AQ1 
1 1.Z4 o.40e-lO q-fj-'+y l 


1371 


PR00762F 


Chloride channel signature VI 


ir on /I 1 Oo 1 ^ AQ1 ^10 

13. zy 4. i oe- 1 j iz 


1371 


PR00762G 


Chloride channel signature VII 


1 a *7 iQo lO <11 ^A^ 

13. ->3 /. lye-iz D3 


1371 


PR01117A 


CLC-6 chloride channel signature I 


*"7 70 1 AQ<=» 1 Q 0 1 A 

/./y i.uye-iy z-i*t 


1371 


PR01117C 


CLC-6 chloride channel signature III 


9.38 1.19e-17 54-66 


1371 


PR01117D 


CLC-6 chloride channel signature IV 


£L 1 C 1 1 A^ 111 1 

6.15 l.lUe-13 oy-ioo 


1371 


PR01117E 


CLC-6 chloride channel signature V 


-j iq 1 in 0 11 1 on 1 oo 

/.zy l.iue-i3 i/u-i/y 


1372 


IPB000276A 


Rhodopsin-like GPCR superfamily 


11 1 <Oo 11 1 170 

11. do i.oye-13 ijy-i/u 


1372 


PR00245A 


Olfactory receptor signature I 


1 r\ ao i *zf\a 11 111 1 AA 
11). 9o Z.3tle-13 133-14^ 


1372 


PR00245B 


Olfactory receptor signature II 


1<J to c 'Tfia 1 O 1 7A 1 QO 
13. Id 0. /Ue-lZ 1 /U-loZ 


1372 


PR00534A 


Melanocortin receptor family signature I 


12.77 8.04e-09 92-104 


1372 


PR00642C 


EDG-1 sphingosine 1 -phosphate receptor 
signature III 


10 0' , 7 A ^ o „ OA 1 A1 

13.87 4.43e-09 oy-lUl 


1374 


IPB000109A 


PTR peptide transporters (PTR2) 


10.85 3.79e-15 44-oZ 


1374 


IPB000109B 


PTR peptide transporters (PTR2) 


29.23 4.18e-23 67-119 


1374 


PR00308C 


Type I antifreeze protein signature III 


2.79 2.78e-09 20-29 


1374 


PR01471B 


Histamine H3 receptor signature II 


12.38 9.63e-09 24-42 


1375 


IPB000169A 


Eukaryotic thiol (cysteine) proteases active 
site 


10.17 1.00e-14 132-141 


1375 


IPB000169C 


Eukaryotic thiol (cysteine) proteases active 
site 


8.11 2.93e-ll 274-284 


1375 


IPB000169D 


Eukaryotic thiol (cysteine) proteases active 
site 


8.56 9.44e-19 295-311 


1375 


PR00704C 


Calpain cysteine protease (C2) family 
signature III 


H.35 6,0le-09 132-148 


1375 


PR00705A 


Papain cysteine protease (CI) family 
signature I 


11.22 2.80e-21 132-147 


1375 


PR00705B 


Papain cysteine protease (CI) family 
signature II 


10.02 1.45e-10 276-zoo 


1375 


PR00705C 


Papain cysteine protease (CI) family 
signature III 


13.28 1.00e-09 295-301 


1376 


IPB000276A 


Rhodopsin-like GPCR superfamily 


i H| ^ S f\ f\ F I /*\ 1 O ^ 1*1/" 

11.56 9.05e-10 125-136 


1376 


IPB000276B 


Rhodopsin-like GPCR superfamily 


4.97 5.74e-09 201-212 


1376 


" PR00237B 


Rhodopsin-like GPCR superfamily 
signature II 


12.45 7.87e-09 66-87 


1376 


PR00237C 


Rhodopsin-like GPCR superfamily 
signature III 


14.77 8.50e-12 111-133 


1376 


PR00237E 


Rhodopsin-like GPCR superfamily 
signature V 


13.03 4.21e-ll 193-216 


1376 


PR00237F 


Rhodopsin-like GPCR superfamily 
signature VI 


14.34 4.55e-ll 397-421 


1376 


PR00237G 


Rhodopsin-like GPCR superfamily 
signature VII 


19.23 6.09e-09 436-462 
. 
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1376 


PR0H03B 


Adrenergic receptor signdiuic n 


13.72 2.50e-10 139-147 


1378 


IPB000203A 


(jfr'tj domain 


18.40 9.74e-09 209-239 


1378 


IPB000203B 


(jrFo domain 


13.98 3.25e-15 325-346 


1378 


IPB000832C 


O-protein coupieo receptors* idmuy ^ 
^secretin-iiKe^ 


19.53 1.00e-10 325-354 


1378 


PR00249C 


oeCreLin-llKe vjrjr *^rv aupciiciiiiii y signaLui 
III 


15.44 3.32e-09 327-350 


1378 


PR01380A 


oiauoin-j signdiuic i 


5.16 8.79e-09 265-273 


1380 


IPB002190A 


jyLA.ijrr!/ Tamil y 


31.63 4.40e-18 1-37 


1380 


TT"l"E> ArtO 1 OAT) 


IVLrVvJi-/ Ialil 11 y 


27.39 L61e-26 109-153 


1380 


IFBUUziyUC 


A/1" A rT-Tn -fVj v-|-» i 1 -\ / 
IV-LA-vJI-/ Icinill V 


33.01 9.12e-31 179-233 


1380 


IPB002190D 


MAGE family 


18.35 1.00e-40 239-288 


1381 


PR00169G 


Jrotassium cnannei signaxure vn 


11 30 6 38e-09 265-287 I 


1381 


PR01320B 


inward rectiner ' cnamiei bupciiaiiiiij 
signature 11 


12 73 7.46e-ll 162-184 I 


1381 


PRO 133 3 A 


I wo pore domain ivt cnainici aiguaiuic ± 


18 74 8.11e-28 168-196 


1381 


PR01333B 


Iwo pore domain jv i cnurtiici biguauuic ±± 


10 39 2.58e-09 171-180 


1381 


PR01333B 


iwo pore domain iv • t/Xidiiiici oignaiuic xx 


10 39 7.10e-10 280-289 


1381 


PRO 1 463 F 


fc/ACj/ tSLdSJiilvVJ pOXaSSlUm C-Ilalliici istmiky 

signature vi 


4 09 3 42e-09 268-285 1 


i o O 1 

1381 


PRui4yyA 




8.28 7.99e-12 21-34 


1381 


T-|T) A 1 /I f\OD 

rKU14yyt> 


TV/*!/- T<T 4- r'Vicirrn/^l CTT1 atl 1T"R TT 


9.88 8.15e-21 68-85 


1381 


nTJAI A C\C\f~^ 

FKUi4yyL. 


T*«<ilr T<T 4- r>Vi cin n f»1 oionsi'hirp TT F 
1 XGK. IvT OlldllllCl olgllaxixi w xix 


9.14 7.07e-14 105-117 


1381 


rKU14yyiJ 


T't-p'V r-l-innnr^l Qionptiire TV 
1 1 Civ iVi c-iiaiiiici oig,ii«itni ^/ ± y 


8.87 9.05e-20 119-137 


1381 


rKU i 4yyti/ 


T-r/alr T<f 4- r > TiannF k 1 ci on atl lTfi V 


14.43 2.73e-14 146-158 


1381 


rRUi4yyr 


T-aV I/' 4_ r>l>cinnp1 cionntiirp VT 


10.57 1.00e-24 196-215 


1381 


t>"d a 1 a oon 


TVp>V T<r4- orinnnpl «?ipnature VT' 


12.38 7.80e-22 216-235 


1381 


rKU14yyrl 


r TVtf»1/ r IV 4- r'lidTinF'] cicrnatlirft VTI I 


9.00 6.71e- 14 237-248 


1381 


PKui4yyi 


TV^V T<T4- nrtannp! cicmnturf! I X 


11.15 4.73e-20 293-308 


1381 


PR01499J 


Trek K+ channel signature X 


16.62 l.lle-16 315-326 


1381 


PR01499K 


ireK j\t cnannei bignaiuic yvx 


11.81 2.96e-20 328-344 i 


1381 


PR01499L 


Trek K+ channel signature XII 


13.04 4.80e-17 364-378 


1382 


IPB000920C 


jviyeiin r'u proiem 


15 78 6.5 0e-09 245-297 


1382 


IPB003784 


BioY 


27.20 1.22e-09 263-313 


1382 


PR00213E 


ivlyelin FU protein signature v 


5 51 8 97c-09 263-287 


1382 


PR01228C 


Eggshell protein signature III 


5.69 5.14e-09 306-321 


1385 


IPB001039A 


Major histocompatibility complex protein, 
Class I 


17 17 8 89e-ll 19-72 i 


1388 


IPB002130A 


Cyclophilm-type peptioyl-proiyi cis-trans 
isomerase 


1^ R7 1 53e-18 58-79 


1388 


IPB002130B 


Cyclophilm-type peptidyl-prolyl cis-trans 
isomerase 


91 90 2 93e-37 88-126 


1388 


IPB002130C 


Cyclophilin-type peptidyl-prolyl cis-trans 
isomerase 


1fi 09 9 ^Oe-34 138-175 


1388 


PR00153A 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature I 


14. 60 1 00e-15 64-79 






PvrlniYhilin nentidvl-Drolvl cis-trans 
isomerase signature II 


12.94 5.24e-17 93-105 


1388 


- 

PR00153C 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature III 


10.79 1.00e-17 136-151 


1388 


PR00153D 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature IV 


10.73 7.75e-16 151-163 


1388 


PR00153E 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature V 


8.39 5.15e-10 164-179 
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1390 


IrhJOOOz /Dd 


Kiioaopsin-iiite vj.r superidiiiiiy 


4 97 6 06e-1 0 1 02- 111 


1390 


IPB000276C 


Rhodopsin-like GPCR superfamily 


8.03 8.50e-ll 162-173 


1390 


TIT* nA^1 * IT\ 

PR00237D 


Rnodopsm-iike CxrCK superiamiiy 
signature IV 


y, /o *t.joc-U7 jo- /y 


1390 


PR00237E 


Knodopsm-lilce urLK superiamiiy 
signature V 




1390 


TIT* 

PR00237F 


Rnodopsin-like OrtK superiamiiy 
signature VI 




1390 


PR00237G 


Rnodopsin-like LrrCK superiamiiy 
signature Vll 


1 0 9^ ^ 70<=-riQ 1 09-9 1 R 


1390 


r ROOD 29 L) 


Oonadotropnm releasing normone receptor 
signature IV 


99 f^O 7 9-17 


1 ^ aa 
1390 


"DTD AACOOC 

rK00D29r 


oonaaotropnin releasing normone recepiui 
signdiure vi 


S 71 S f)Qe-1 0 7^5-89 


i ion 

1390 




vjonavioiropiilll IClCciolllg iiuhiiuiig ic^^^juji 
bJH,llctlUl C V 111 


9 64 9 43e-l 1 176-196 


1392 


IPB003662A 


General substrate transporters 


18.97 8.76e-13 196-228 


1393 


TI>T5AAA97£ A 


isJiQUopbiii~iiive vja "oxs. bu|Jcii<tiiiiiy 


11 56 1 69e-13 157-168 


1393 


PR00237C 


Rhodopsin-like GPCR superfamily 
signature 111 


14.77 3.84e-10 143-165 


1393 


rK0Uz45r> 


diiactory receptor signature 11 


1 ^ 1% 6 54e-l 1 1 68-1 80 

LJm I -J v 1 . J'T'C 1 1 1 VJ U LOW 


1 aAO 
1393 


i-'KOU^D U 


vjiiactory receptor signature in 


14 6S ^ 75e-l 1 215-231 


1393 


rK0UZ4Dii 


vjiiactory receptor signature v 


S Q6 7 91e-l0 322-333 


1 mi 

1397 


TT5TD A AO A A *C"D 

1PB003000O 


immunoglobulin ana major 
nisiocoiiipatiDiiity tompicA. uuiiiam 


90 9^ 6 SSe-09 294-331 


1397 


IPB003006B 


Immunoglobulin and major 

nibLUUUllipaLliJiiiLy uuiupiCA. uajhicuh 


20.23 S.92e- 10 487-524 


1 ^07 






20 65 2 31e-09 646-668 


1397 


PR00014C 


Fibronectin type III repeat signature III 


14.47 8.71e-09 585-603 


139 / 


x^K.UUU14JJ 


riuroneLim type 111 rcpcdi bigiiaiuic i v 


15 12 2 96e-09 1 093-1037 


139 / 


rK0UU14U 


rioronectin type 111 repeal biguaiurc i v 


15 12 9 22e-09 701-715 


140z 


TDT5AAA1 QCT3 

lrr>U0U19!>J3 


ixau o/\Jr/ i r>i^ aomdin 


14 35 5 15e-09 38-50 


1402 


PR00250G 


Fungal pheromone mating factor STE2 
ur v^xv signature v 11 


9.58 9.75e-09 21-42 


1409 


IPB001359H 


Synapsin 


22.58 9.63e-10 53-103 


1409 


lrB00300o.B 


immunogioDuim ana major 
histocompatibility complex domain 


90 1% 1 4^p-1 ^ 907-^^4 


1409 


mDAAi coin 

IPB003531C 


onort nematopoietin receptor iamny i 




1413 


PR01131B 


Connexin36 (Cx36) signature II 


3.45 2.41e-09 500-512 


1 A 1 A 

1414 


IPB001993A 


Mitochondrial energy transrer proteins 
(carrier protein) 


1AQ1 7 OOp 11 14^-1^7 


1 A 1 A 

1414 


mnAAi r\c\^ A 

IPB001993A 


Mitocnonariai energy transier proteins 
(carrier protein) 


10. yi /.jUc-10 40-/1/ 


•t A H A 

1414 


IPB001993B 


Mitochondrial energy transfer proteins 
(carrier protemj 


A AA C 1 1/11 QO 1 Q«: 

y.uu D.i oe-14 loz-i^j 


1 A 1 A 

1414 


PR00927E 


Adenine nucleotide transiocator l signature 

V 

V 




1416 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 1.00e-09 6-43 


1416 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 7.92e-09 88-125 


1417 


IPB000315A 


B-box zinc finger superfamily 


24.51 3.25e-10 30-53 


1417 


IPB001S41 


RING finger 


10.69 1.56e-10 42-51 


1418 


IPB001285A 


Synaptophysin/synaptoporin 


12.19 1.00e-28 41-87 
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1418 


IPB001285C 


Synaptophysin/synaptorjorin 


15.24 9.89e-23 125-165 


1418 


IPB001285D 


Synaptophysin/synaptoporin 


1? 74 4 03e~13 166-200 


1418 


IPB001285E 


S ynaptophysin/synaptop or in 


9.55 2.98e-24 201-247 


1418 


PRO022OA 


Synaptophysin/synaptoporin family 
signature I 


n rQ7 60^-96 38-60 


1418 


PR00220B 


Synaptophysin/synaptoporin family 

• * TT 

signature 11 


l/.UO J jjc-Zj Oii«-0/ 


1418 


PR00220C 


Synaptophysin/synaptoporin tamily 

' i TTT 

signature III 


ii R^p-?^ 1 1 7-141 

11,00 Z,OJC"A(J 11/ J.TA 


1418 


PR00220D 


Synaptophysin/synaptoporin tamiiy 
signature 1Y 


10 60 6 S4e-25 149-172 


1418 


PR00220E 


Synaptopnysin/synapioporin iamny 
signature v 


4 56 4 60e-23 216-234 


1418 | 


PR00489C 


rrizziea prorem signaiure in 


8 19 2 36e-09 216-240 


1419 


IPB000729B 


jrJXdr-ZZ/JDlVIr/lvil^zu idmiiy 


13 56 4.86e-ll 49-59 


1419 1 


IPB000729C 


rMr-Zz/JiMr/iVirzu iarniiy 


37 83 9 01e-15 80-132 


1419 


IPB000729C 


"JYLa -ZZ/lilvix^/ivrrzu ituniiy 


37 83 9 72e-10 87-139 


1421 


IPB001905A 


Ammonium transporter idmiiy 


22 03 8.47e- 14 69-97 


1421 j 


IPB001905B 


Ammonium tTansponer iamny 


19 59 6 08e-17 127-159 


1421 i 


IPB001905C 


Ammonium transporter idmiiy 


15 52 5 35e-18 178-203 


1421 


IPB001905D 


Ammonium transpoiLcr iarniiy 


11 89 3.61e-19 217-240 


1421 


IPB0019U5r 


Ammonium tranbporici ictniiiy 


17.91 1.00e-18 287-313 


1421 ! 


PRQ0342B 


rCneSUS DIOOQ group piUlClll oigiiatuit> xj. 


14.04 5.57e-13 61-78 


1421 


PR0Q342C 


ivnesus Diooo group pruicin 5i^n<tim^ aia 


11.52 5.03e-ll 90-107 


1421 


T»Ti AA1 A O T A 

PR00342D 


IviieSUS D1O0Q group prOLCUi ai^,iiaiuic x v 


11 IS 8.69e-13 128-144 


1421 


PR00342b 


KneSUS DIOOQ group pruLCUi bigTiciLUic v 


17 96 3 74e-10 151-174 


1421 


PR00342F 


Dttnntin V«Trti-\<-J rrt*/-\nr\ r\r , r\i"^i'ri 01 OTIil'f'1 1T"P A/T 

Knesus Diooo group prutciix ssi^iittLuic v x 


8 73 3 88e-10 185-200 


1421 


PR00342G 


KneSUS DIOOQ group proxcm aigiiatuiu > xx 


10 10 2.70e-20 220-238 


1421 


PR00342H 


JtMiesus uiooa group proicin Mgnaiuic v xxx 


8 86 1 76e-12 250-272 


1423 


IPB001427A 


Jrancreatic riDonuciedac idimiy 


12 97 5 09e-10 36-45 


1423 


IPB001427B 


Pancreatic ribonuclease family 


31.44 6.04e-34 50-100 


1423 


IPB001427C 


Pancreatic ribonuclease iamny 


24 33 2 17e-13 106-136 


1423 


J PR00794A 


Fancreatic riuonuciease iamny signaiuie x 


15 68 6 31e-13 51-70 


1423 


PR00794B 


Pancreatic ribonuclease family signature II 


17.91 6.34e-13 71-90 


1423 


| PR00794C 


Pancreatic ribonuclease iamny signature in 


16 87 8 07e-14 96-114 


1423 


PR00794D 


Pancreatic ribonuclease iamny signature i v 


19 74 2 37e-09 117-139 


1426 


IPB000038A 


Cell di vision/ Cj IF binding protein 


16 77 9 89e-09 447-488 


1426 


IPB001984C 


ATP-dependent serine proteases, Lon 
family 


12.76 3.32e-09 158-194 


1426 


PR00318A 


Alpha G protein (txansaucmj signature i 


6 88^1 9e-10 469-484 


1426 


PR00830A 


Endopeptidase La (Lon) serine protease 
(bio) signature 1 


8 5?? 14e-1 1 165-184 


1426 


PROllOOA 


oniKimate Kinase iamny signature i 


8 30 1 00e-08 470-485 


1427 


IPB0004o3 


Leucine ricn repeat L>-terrnind.i uomdin 


11 18 6 85e-13 164-178 


1427 


IPB003006B 


ImmunogloDuim ana major 
nistocompatiDiuiy compicA uomdin 


20 23 8 20e-10 259-296 


1427 


PR00019A 


Leucine-rich repeat signature I 


11.72 3.18e-09 52-65 


1427 


PR00019A 


Leucine-rich repeat signature I 


11.72 5.64e-09 77-90 


1427 


PR00019B 


Leucine-rich repeat signature II 


11.42 1.27e-09 25-38 


1427 


PR00019B 


Leucine-rich repeat signature II 


11.42 4.82e-09 49-62 


1427 


PR00019B 


Leucine-rich repeat signature II 


11.42 6.32e-ll 74-87 


1430 


PR00462B 


Fungal lignin peroxidase family signature II 


14.83 4.74e-09 34-52 


1435 


IPB002610A 


Rhomboid family 


5.73 9.50e-10 185-194 


1437 


| IPB001627A 


Sema domain 


16.97 9.34e-09 138-153 
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1437 


IPB001627C | 


bema domain 


21 13 7.1 le-09 201-232 1 


1437 


IPB001627D 


isema aomain 


16 04 9.47e-17 249-264 


1437 


IPB001627F 1 


bema aomain 


22 0^ 8 50e-27 293-326 


1437 


IPB001o27Cj 


aema aomain 


21.49 5.36e-22 346-379 


1437 


TnT» AA1 /'onT i 

IPB00162/J | 


bema oomain 


11 43 2. 97e- 14 462-478 


1437 


IPB001627K 


Sema domain 


13.76 4.00e-14 540-552 


1437 


IPB002165A 


Plexin repeat 


15 07 3 73e-12 138-153 


1437 


IPB002165B 


Plexin repeat 


13.59 3.43e-12 175-184 


1437 


IPB002165C | 


Plexm repeat 


1 8 49 6 ^?2e-34 293-325 


1437 


IPB002165D 


Plexin repeat 


14.72 2.85e-15 540-552 


1437 


IPB002861B 


Reeler domain 




1437 


IPB002861B 


Reeler domain 


10.50 9.76e-12 644-672 


1437 


PR01303D 


Plasmodium circumsporozoite protein 
signature IV 


in o AO 708-79 


1438 


PR00170A 


Voltage-gated Na+ channel alpha subunit 
signature I 


52 ^ ^ AzL*=» in R00-S19 


1438 


PR01097A 


Transient receptor potential family 
signature I 


1 1 S 06p-09 799-820 


1439 


PR00170A 


Voltage-gated Na+ channel alpha subunit 
signature I 


O.jj «j.v i +c-iV/ ooi y\j\j 


1439 


PRO 1097 A 


Transient receptor potential family 
signature I 


1 1 ^4 ^ n^-09 SS0-Q01 


1442 


IPB000483 


Leucine rich repeat C-termmal domain 


1 1 1 S f\ 7^p-1 ? 1 86-200 


1442 


PR00019A 


Leucine-rich repeat signature I 


11.72 1.33e-10 106-119 


1443 


IPB000135A 


High mobility group proteins HMul ana i 
HMG2 


1 1 £Q 9 ^0/^ ^1 ?A_RS 

11. Dj? Z.07C0 1 oo 


1443 


IPB000135B 


High mobility group proteins HMLrl ana 
HMG2 


11 9 A 1 78^_94 RR-119 


1443 


IPB000203A 


GPS domain 


1040 R 41p-09 1 26-1 S6 


1443 


IPB000203B 


| GPS domain 


13.98 4. 95e-12 286-307 


1443 


IPB000832C 


G-protem coupled receptors family 2 
| (secretin-like) 




1443 


PR00249C 


Secretm-like GPCR superfamily signature 
III 




1443 


PR00249D 


Secretin-like GPCR superfamily signature 
IV 


u in Q 44e_09 ^97-^^9 


1443 


PR00886A 


High mobility group (HMGI/HMCjz; 
protein signature I 


1 n ^7 R 99^-1 7 17-^SQ 


1443 


PR00886B 


High mobility group (HMG1/HMG2) 
protein signature II 


10.07 8.97e-13 60-80 


1443 


PR00886C 


High mobility group (HMG1/HMG2) 
protein signature III 


11.20 1.40e-ll 104-122 


1448 


IPB002000D 


I Lysosome-associated membrane 
I glycoprotein (Lamp) 


5.87 5.75e-09 419-432 


1448 


PR00308C 


j Type I antifreeze protein signature III 


2.79 9.64e-09 758-767 


1449 


PR00180B 


Cellular retmaldehyde-binding protein 

1 ClfTTldtllt*^ 11 


ID. /o O.Ujc-U!? ZjD-ZDU 


1452 


IPB001993A 


Mitochondrial energy transfer proteins 
(carrier protein) 


16.91 8.65e-10 33-57 


1452 


IPB001993A 


! Mitochondrial energy transfer proteins 
(carrier protein) 


16.91 9.39e-09 130-154 


1452 


IPB001993B 


Mitochondrial energy transfer proteins 
(carrier protein) 


9.00 4.21e-09 78-91 


1 1452 


IPB001993B 


Mitochondrial energy transfer proteins 


9.00 7.14e-ll 173-186 
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(carrier protein) 




1452 


PR00926D 


ivlitocnonariai carrier protein Mgiieuuic iv 


10 86 7 92e-17 41-59 


1452 


PR00926E 


Mitochondrial carrier protein signature V 


11.83 5. 74e- 19 89-107 


1452 


PR00926F 


Mitochondrial carrier protein signatuie vi 


17^11 00e-09 35-57 


1452 


PR00926F 


Mitochondrial carrier protein signature VI 


17.31 5.76e-24 132-154 


1452 


PR00927A 


Adenine nucleotide translocator 1 signature 
I 




1452 


PR00927B 


Adenine nucleotide translocator 1 signature 
II 


1^15 1 44p_19 164-185 


1452 


PR00927D 


Adenine nucleotide translocator 1 signature 
IV 


in 89 ^ 7Qp-1^ 26-39 


1452 


PR00927E 


Adenine nucleotide translocator i signature 
V 


14- 99 6 32e-l 1 162-183 


1452 


PR00927E 


Adenine nucleotiae translocator 1 signature 
V 


14. go Q 25e-22 67-88 


1452 


PR00927F 


Adenine nucleotiae translocator l signature 
VI 


1? 47 S ^Oe-14 128-144 


1452 


PR00927G 


Adenine nucleotide translocator i signature 
VII 


10 67 5 14e-15 178-193 


1454 


IPB001993A 


Mitochondnal energy transier proteins 
(carrier protein) 


16 91 8 75e-ll 83-107 


1454 


IPB001993B 


Mitochondrial energy transier proteins 
(carrier protein) 


9 00 1 72e-10 242-255 


1455 


PR00320A 


G protein beta WD-41) repeat signature i 


1^ 15 1 96e-09 128-142 


1455 


PR00320C 


G protein beta WD-40 repeat signature III 


12.32 3.67e-09 128-142 


1456 


IPB000494A 


Epidermal growth-factor receptor (isorKj, 
L domain 


99 80 £ 5Qp-?S 122-154 


1456 


IPB000494B 


Epidermal growth- factor receptor (bufKj, 
L domain 


9A 47 0f)p-1 8 928-748 


1456 


IPB000494C 


Epidermal growtn-tactor receptor ^riLirrt;, 
L domain 


94 40 9 45e-33 1099-1 145 


1456 


IPB000494D 


xipidermai growtn-iactor recepiur {CKjris-j, 
jl domain 


19 44 1 00e-40 1162-1211 


1456 


IPBO 00494b 


xipiaermai growtn-iaCLur rc^cpLui v. J -' VJ - L xv /s 
j_r aomam 


21 65 3.53e-27 1218-1253 


1456 


TDD a a 1 a/""* 
lrrJOOlUoUO 


r^^ll rli\Hcir\Ti rnntrnl nrntpin 15 TCDC15^ 
v^cii aivisioii cuiiii^ji pivjLcin ij 


26.16 L00e-09 1183-1202 


1 A C S" 

1456 


lrBU01240A 


1 yrOSine KlllabC L-aLaiyilW UUUlalU 


22.45 4.27e-19 1105-1145 


1456 


IPB001245B 


Tyrosine kinase catalytic domain 


21.68 6.54e-29 1165-1203 


1456 


IPB001772C 


isJ.nase associa.teu. cioiiid.111 i 


20 66 1.75e-10 1100-1130 


1456 


IPB002174A 


Furin-like cysteine rich region 


30.51 4.41e-13 126-157 


1456 


IPB002174B 


Furin-like cysteine ncn region 


11 91 7 00e-10 1183-1193 


1456 


IPB003961B 


Fibronectin type III domain 


12.29 7.43e-ll 1184-1194 


1456 


PR00014D 


Fibronectin type 111 repeat signature i v 


15 12 6 09e-09 893-907 


1456 


PR00109A 


Tyrosine kinase catalytic domain signature I 


12.56 5.34e-ll 1066-1079 


1456 


PR00109B 


Tyrosine kinase catalytic domain signature 

II 


1 1 07 1 47p-90 1112-1130 




pr nni oqp 


TvrnQinp kinase ratal vtic domain signature 

III 


11.86 9.25e-13 1164-1174 


1456 


PR00109D 


Tyrosine kinase catalytic domain signature 
IV 


17.61 4.50e-24 1183-1205 


1456 


PR00109E 


Tyrosine kinase catalytic domain signature 
V 


12.99 4.48e-16 1227-1249 


1457 


IPB002889B 


WSC domain 


11.76 8.41e-09 1558-1604 


1462 


IPB000130 


Neutral zinc metallopeptidases, zinc- 


5.86 4.00e-12 238-248 
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binding region 




1462 


IPB001818A 


Matrixin 


1 A f\C\ 7 1 1p-9Q 01-170 


1462 


IPB001818B 


Matrixin 


96 & 0/1f*-11 117-17K 


1462 


IPB001818C 


Matrixin 


OA IS <1 OQp-19 90-6^ 


1462 


IPB001818C 


Matrixin 


OA 1R 7 zIIr-IS 1 R9-997 


1462 


IPB001818D 


Matrixin 


1 A Q1 1 OR^-17 919-96^ 

J.uoCO / Z*DZi~Z,\}*J 


1462 


IPB001818E 


Matrixin 


o o/; 7 i Qp i< 971-98.6 


1462 


IPB001818F 


Matrixin 


11 1Q ^ ^fip 11100-190 


1462 


PR00138A 


Matrixin signature I 


12.54 1.64e-16 111-124 


1462 


PR00138B 


Matrixin signature II 


14. o4 ->.Zie-iU lOO-l/l 


1462 


PR00138C 


Matrixin signature III 


OA A'? 1 7Q 0 16 1 8fl 70.Q 


1462 


PR00138C 


Matrixin signature III 


OA AT O O Q a 1 C 1 O /1_6 

ZU.U/ o.Joe- ID 15-40 


1462 


PR00138D 


Matrixin signature IV 


i /I <n < cfta n 71S 761 


1462 


ER00138E 


Matrixin signature V 1 


n 1A Q ^lo ic 071 7R6 

/.1U o./ie-iJ z/j-zoo 


1462 


PR00480B 


Astacin family signature II 


14.35 1.16e-ll 233-251 


1462 


PR00997G 


Fragilysm metallopeptidase (M10C) 
enterotoxin signature VII 


11.64 o.Ule-10 Z44-Z0U 


1466 


IPB000008D 


C2 domain 


14.83 5.92e-09 288-306 


1466 


PRO 147 IE 


Histamine H3 receptor signature V 


c /i 1 o /iirt in /Tin /^oc 

5.41 Z.43e-lU 01U-OZD 


1468 


IPB000228D 


RNA 3'-terminal phosphate cyclase 


1 o a on„ 1 A OA £J~I 

33. lo 4.oye-iy 3U-o / 


1468 


IPB000228E 


RNA 3-terminal phosphate cyclase 


^/^Ol o ^ 1 - AA 1 AC 1 ,/1 

26.21 o.43e-L)y 11O-140 


1468 


IPB002029A 


Aspartate and ornithine 
carbamoyltransferase family 


20.73 7.86e-09 78-105 


1470 


IPB003874C 


CDC45-like protein 


5.49 9. 16e-10 315-326 


1473 


IPB002568A 


Carlavirus putative nucleic acid binding 
protein 


12.57 9.66e-09 9-34 


1473 


IPB002999B 


Tudor domain 


7.50 5.09e-09 29-3 / 


1473 


IPB003482B 


Transcription factor WhiB 


t /\ rt/^ ^ no An 1 A OO 

10.09 5.73e-09 10-22 


1474 


PR01547I 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature IX 


■t c o t ■— 1 c\ A ^ OO /CA A A 

15.37 7,94e-28 69-99 


1474 


PR01547K 


Saccharomyces cerevisiae 175.8kL>a 
hypothetical protein signature XI 


ii £:i oaa^ 1^Q*% 11/1 
13.61 /.UUe-1 j yj-i. 14 


1474 


PR01547L 


Saccharomyces cerevisiae l/D.oKua 
hypothetical protein signature Xil 


in ic £ o On 10 11Q 11^ 

1U.3D o.zye-iy iiy-133 


1477 


IPB000822 


Zinc finger, C2H2 type 


1 A £LH O <?Qo AA A 1 1 AAA 

14.0/ z.ooe-uy hzi-^d 


1477 


PR01407E 


Butyrophylm C-terminal DUF signature V 


11 OA 1 C\Q ^A_RO 

13. zy 1 ,o_>e-uy jo-ou 


1478 


IPB001925C 


Eukaryotic porin 


Ol 1 952 R 61 
Z3.DD l.o4e-Zo O-Dl 


1478 


PR00185D 


Eukaryotic porm signature IV 


1/1 11 Q ^Op> 19 1 1 1 -198 


1482 


PR00449E 


Transforming protein P21 ras signature V 


11 1Q 1 9Q/--1 1 

lj.jy 1.Z7C-11 HD^-tou 


1489 


IPB003S59B 


Metazoa galactosyltransferase 


77 S 97^-18 90^-9^7 

Z/.OJ) O.Z / C'lO Z.UJ Z.J / 


1489 


IPB003859C 


Metazoa galactosyltransferase 


10 ^1 1 74<=-1n 101-^^5 


1489 


IPB003859D 


Metazoa galactosyltransferase 


26.76 6.25e-27 349-382 


1491 


IPB003191M 


Guanylate-bmding protein 


1U.35 /.ooe-zo zo-jo 


1491 


IPB003191N 


Guanylate-binding protein 


a ai £. 1 60 91 61 Q1 

9.33 o.ioe-zi 01-yi 


1491 


IPB003191O 


Guanylate-binding protein 


1/111 777/aOQ 07 1 17 
14.31 /.ZZc-Zo yZ.~i.DZ, 


1491 


IPB003191O 


Guanylate-binding protein 


\ A 0.1 Q 71<= AO R1-171 
14.31 O./jo-W 01-1Z1 


1493 


PR00502B 


NUDIX hydrolase family signature 11 


u co 4 1 Sp-09 148-^63 
i'+.oZ) *+. 1 jc \jy jto jvj 


1494 


IPB000117D 


Kappa casein 


10.18 8.71e-09 117-151 


1497 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 3.00e-15 88-125 


1498 


IPB001395E 


Aldo/keto reductase family 


20.87 7.39e-23 256-293 


1498 


IPB001395F 


Aldo/keto reductase family 


25.03 2.45e-23 317-362 


1498 


PR00069D 


Aldo-keto reductase signature IV 


19.00 3.92e-20 260-289 


1499 


IPB000237B 


GRIP domain 


30.66 8.24e-09 543-593 
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1499 


1PBUUUZ.3 /r> 


VjxviJr uoiiidiu 


30.66 8.34e-09 547-597 


1499 


tf» t> aa a a n^TD 


/"M of ViTMn 1 "i rrh f pin dm 1 

i^iainrin ngQL ciiam 


20.25 7.55e-09 541-593 


1499 


IPBOOloUOD ! 


Xjipoproiein, typ^ o 


22.73 9.11e-09 558-604 


1499 


IPBOOloUOD 


l_»ipoproiein, type t> 


22.73 9.70e-09 563-609 


1499 


T"p»T"> rvnol A1 TJ 
IPB0021U1.D 




14.58 2.28e-09 545-596 


1499 


IPB002906A 


Ribosomal protein S27a 1 


11.48 3.00e-09 561-593 


1499 


TTin f\f\ , ^n^\f\/Z A 
IPBOO29O0A 


KiDOsoruai proLcin oz./a L 


11.48 9.81e-10 558-590 


1499 


IPB003121A 


jdAxOUD uornain 01 inc o vvijj l-uiii^isa [ 


13.89 9.46e-09 552-577 J 


1499 


IPB003134P 


K^epeat In rlo 1/ v_-orid.LLiii i 


15.66 8.49e-09 552-600 


1503 


tit> r\f\f\ a 

PRO 0 049 u 


W lim S LUiiiuur pruiciii aigiicitui^ j. v i 


0.00 9.57e- 10 65-79 


1503 


Ti"T> A 1 O 1 7D 

PRU121 /r> 




4.82 8.55e-09 71-87 


1504 


IPB000436B | 


OnoVii rlrtmain / ^f^R TPnf^at / 0(~!P modllle i 
oUSm QOruairi / ol_/lv icpcai, / muuui^ t 


8.70 8.71e-10 307-318 


1504 


PR01544D 


a^Im'^/m-icio +V»alicin5i 1 ^0 WDa VivnotHetical 

ATEDlQOpSlS tnd.ild.ricl UU. / IvJ-^ct iiyjj^niv^ui^*** 

proxem signdiurc i v [ 


14.54 9.13e-10 373-394 


1511 


IPB000822 


Zinc linger, v^znz xype i 


14.67 3.77e-18 343-368 


1511 


IPBOOOoZZ | 


Ziinc linger, \~>z.nz. Lyp c 


14.67 9.25e-19 371-396 


1511 


IPB001275 


JJ1VI U1NA DinUing UUlllalll 1 


19.17 3.51e-09 331-370 


I5ll 


IPB001275 I 


TY\/f T^"\T A KinHincr HAm?*in i 


19.17 5.67e-09 359-398 


1511 


TTlTIAAl AAA 1 

IPB001909 


T7"P A D Unv 1 

KKAd dox 


17.37 4.00e-19 230-264 j 


1511 


PRO 004 8 A j 


vyZrlz-xype zmo iirigcx oigiiatuic x i 


9.94 5.76e-12 368-381 


1511 1 


T4TJAAA/10 A 1 

PR00048A | 


V_>rZri.Z~type ZjIIIL' xlUgd oigiiai.uiv^ ± j 


9.94 7.75e-16 340-353 


1511 


r>T> AH A/1 OT3 1 

rxCUUU4or> | 


/~ i 9T-jo_t\mp 71'np finder siienature It ' 


5.52 7.43e-10 384-393 


1517 


TTYTIAAAOII i 

IPBOOOoZZ 1 


Z/inc linger, wnz «-yp c i 


14.67 1.00e-17 1066-1091 


ir in 

1517 


lFrJUUUcSZZ j 


7\r\r> fincrpr P9H2 tvne 


14.67 1.00e-19 281-306 


1517 


TT»n f\ r\ aoi O 
IPBOOOoZZ 


z<inc linger, l^zjtiz, iypc 


14.67 1.60e-16 393-418 


1517 


rnnAAAOO 
IPB0008ZZ 


Z#mc ringer, ^/-nz iype 


14.67 1.60e-16 954-979 


* t~ t '-7 
1517 


IPdUUUoZZ 


ZjIIIU rillgCl, V-^Z^i. ix. i,y^J^ 


14.67 1.69e-18 477-502 ~1 


1517 


TTinAAAOlO 

IPBOOOoZZ 


Ziinc ringer, v^znz type 


14.67 2.13e-14 1038-1063 


1517 


IFdUUUoZZ 


T^nr* fincrpr P9R9 tvne 


14.67 4.79e-12 1094-1119 


1517 


IPBOOOoZZ 


z<mc linger, wnz. L yp c 


14.67 5. 50e-19 421-446 


1517 


IPBOOOoZZ 


Ziinc linger, v~^rrz< type 


14.67 5.50e-20 870-895 


1517 


lPBOUUoZZ 


Ziinc linger, ^z,riz. Lypc 


14.67 6.73e-21 926-951 


1517 


TT»T*> A A AO TO 

IPBOOOoZZ 


fincrpr P9H9 tvtlP 

| z*mc ringer, v^z,n^ iype 


14.67 7.00e-20 309-334 


1517 


lPBOUUoZZ 


Tinr* -fincrpr l~ 1 9T- : r9 tvne 

Zjinc iingei, L yp c 


14.67 8.36e-21 337-362 


1517 


IPBOOOoZZ 


Vi-rii- fincrpr ^9 149 tvnp 

Ziinc linger, v^zriz, type 


14.67 8.36e-21 982-1007 ] 


1517 


TT1TD A A C\ Q 


7"inp fincrpr (^9142 tvne 


14.67 8.50e-20 449-474 


1517 


lPBOUUoZZ 


•Zinr* fincrpr (^9TT9 tvne 
Lini liiigci, v^z.i ix- tjpu 


14.67 9.25e-20 898-923 


1517 


IPB000822 


j Ziinc linger, tznz type 


14.67 9.31e-18 1010-1035 


1517 


IPB000822 


] Z/inc linger, Lznzr type 


1 14.67 9.40e-16 365-390 | 


1517 


IPB000822 


| Zrinc ringer, ^^nz type 


14.67 9.44e-15 505-530 


1517 


IPB001275 


1JJV1 LiiN/V Dinuiiig UUllldiii 


j 19.17 1.12e-10 970-1009 | 


1517 


IPB001275 


I L)JV1 iJiN A Dinamg aumam 


19 17 3.55e-10 297-336 


1517 


IPB001275 


j iJJVl UJNA DinCling UOIlldlll 


19.17 3. 69e-12 437-476 


1517 


IPB001275 


f UJVl UiNA Dinamg uuiiiairi 


19.17 4.19e-09 886-925 


1517 


IPB001275 


1 UJV1 UiNA umuing UUIlld.111 


19.17 4.19e-09 942-981 ! 


1517 


IPB001275 


1 DM DNA binding domain 


19.17 6.71e-13 325-364 


1517 


IPB001275 


1 DM DNA binding domain 


19.17 6.72e-10 1082-1121 


1517 


IPB001275 


DM DNA binding domain 


19.17 6.72e-10 914-953 


1517 


IPB001275 


DM DNA binding domain 


19.17 8.06e-09 858-897 


1517 


IPB001275 


1 DM DNA binding domain 


19.17 8.18e-10 998-1037 


1517 


IPB001275 


1 DM DNA binding domain 


19.17 9.10e-ll 1026-1065 


1517 


IPB001275 


DM DNA binding domain 


19.17 9.88e-10 465-504 1 


1517 


IPB001909 


j KRAB box 


17.37 8.03e-18 675-709 
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Results* 


1^17 

LD L / 


TPROOI 909 


KRAB box 


17.37 9.00e-32 76-110 


1^17 
LD 1 / 


PR0004R A 


f^T^-tvnp 7inp fin per ^ionatlire 1 


9.94 1.69e-14 306-319 


1^17 
LD 1 / 


PRnnn4R a 


f^9T-T9-tvnp 71'nr fin per Qionature 1 


9.94 1.90e-15 446-459 


1 *>1 7 
LD L / 


PR0004R A 


C9TT9-tvne zinc fin per signature I 


9.94 2.42e-ll 474-487 


1^17 
LD L / 


PR0004R A 


(~ 1 9TT9-rvnp zinc finder ^ipnature I 


9.94 2.93e-13 502-515 


1^17 
LD L l 


prooo4r a 


f~ , 9TT9-tvnp zinc fin per ^ipnature 1 

LmXIa l_y|JW Zj1J.Iv/ xlll£^Vvl Olg, licit Hi. v/ x 


9 94 3.08e-14 1035-1048 


1^17 
LD L I 


PR0004R A 


(^91-^9 -tvnp zinc fin per ^ipnature 1 

^>j£rXxX* tVIJV/ Z/111V/ J-iilg,VJl OlgllOlWlv X 


9.94 4.38e-09 1007-1020 


LDL 1 


PR flHfklR A 


f^9T-T9-tvnp zinc fin per ^ipnature 1 


9.94 5.50e-13 334-347 


LD 1 / 


PR AAH4R A 


r^9TT9-tvnp yinr fin per ^ionfltllTP 1 


9 94 5 5 0e- 13 895-908 


1^17 
LD L I 


prooo4r a 


r^9W9-tvnp 7i nc fin per <?ienature I 


9 94 6 14e-13 923-936 


1^17 
LD L 1 


PR0004R A 


(^9Hf9-tvnp 7\ nr finder signature I 


9.94 7.00e-09 867-880 


1^17 
LDL 1 


PR 00 OAR A 


r^9H9-tvnp 7\x\c fincer sifTiatnre T 


9.94 7.35e-12 362-375 


1^17 
LD L ! 


PR on OAR A 


r^9T-T9-tvnp 7inp finder ^ipnature ] 


9.94 7.55e-10 390-403 


1 S1 7 
LD 1 / 


PR 0004R A 


(~ , 9T J T9-tvne 7inc finder signature I 


9.94 8.11e-ll 979-992 


1^17 
LD L I 


PR0004R A 


(^91^9-1^?, 71 nc fin per signature I 


9.94 8.58e-ll 418-431 


1 *»1 7 

LD L f 


PR0004R A 


P9T-T9-tvne 71 nc fin per sienature I 


9.94 9,05e-ll 951-964 


1 S1 7 
LDL/ 


PR 00048 A 


C!21-r2-tvne zinc finper sienature I 


9.94 9.36e-13 1091-1104 


1^17 
LDL/ 


PR0004RR 


C^^M^-tvne zinc fineer signature 1J 


5.52 1.00e-09 91 1-920 


1^17 
LDL/ 


PR0004RR 


P9T-T9-tvne zinc "fin per signature 11 


5.52 2.29e-10 939-948 


1 *\1 7 
LD L / 


ppnno4R"R 


r^9T-T9-t\mp zinp finppr Qicmatnrp TT 


5 52 2 50e-09 995-1004 


1517 


PR00048B 


C2H2-type zinc finger signature II 


5.52 2.93e-10 350-359 


1^17 
LD L / 


ppnnn4RP» 

rLyjJKJUHolJ 


f _, 9TT9_t\mf i v 1 n p fincrpr ciicmntnrp TT 


5 52 3 25e-ll 294-303 


1517 


PR0004SB 


C2H2-type zinc finger signature II 


5.52 3.25e-l 1434-443 


1M / 


t>t» Ann 4 qti 


^Zxiz-iype zinc linger signd-iurc 11 


S 52 1 25e-ll 883-892 


LDL / 






5 52 3 57e-10 1023-1032 


1517 


PR00048B 


C2H2-type zinc fmger signature II 


5.52 6.00e-09 462-471 


i r i n 

1517 


PR00048B 


czriZ-type zinc linger signature 11 




1518 


IPB000353B 


Class II histocompatibility antigen, beta 
cnam, ueta-l aoniain 


19.16 2.63e-13 21-70 


1518 


IPB000353C 


Class II histocompatibility antigen, beta 
cnam, beta-i aomain 


20.11 1.41e-10 71-125 


1518 


IPB001003B 


JVlov^ oiass 11, aipna cnam, aipna- 1 Qomain 


14 79 4 4^e-0Q 9Q-79 


1518 


lrB00300oA 


lmmunogioDuim ana major 
nistocompaxiDiiity complex uomam 


17 SI 1 ODp- 1^49-64 


1518 


IPB003006B 


Immunoglobulin and major 
nisiocompdii Din iy coinpicx uomdiii 


20.23 9,40e- 19 78-115 


1518 


PR01436D 


NADH-ubiquinone oxidoreductase chain 2 
signaxure iv 


4.56 2.66e-09 6-18 


1521 


IPB001618 


Coenzyme A transferase 


37.23 1.86e-09 56-109 


1 CO 1 


TP"Df\n 1 ^1 8 

JLtvoUU 10 lo 


coenzyme /v TTdnsiciaac 


XI 9^ 5 75e-32 355-408 


1521 


IPB001618 


Coenzyme A transferase 


37.23 6.92e-31 196-249 


1521 


lrB001839A 


transforming growtn tactor l \jrr ) oeta 
lamily 


19 ^7 ^ R1p-17 6^7-67^ 


1521 


T13DAA 1 OOnt> 

IrJbJOOlooyB 


l ranstorming growm ractor ^ i <jt j oeta 
iamny 


1 1 1 S 9 17p-19 704-71 S 


1 CO 1 

1521 


IDT3AA 1 QOfiP 


iransiormmg growm racior ^ i vjr j oeta 
famil v 

nil y 


19 SO 9 ROe-16 794-741 


1521 


PR00438A 


Growth factor cystine knot domain 
signature I 


11.01 4.00e-09 665-674 


1521 


PR00669E 


Inhibin alpha chain signature Y 


14.95 5.39e-12 640-657 


1523 


IPB000034A 


Laminin B 


22.21 8.56e-09 661-696 


1523 


IPB001762A 


Disintegrin 


23.93 4.21e-21 453-493 


1523 


IPB001762B 


Disintegrin 


10.06 7.30e-13 500-510 


1523 


IPB002870A 


Reprolysin family propeptide 


12.22 6.54e-09 109-125 
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Results* 


1523 


IPB002870B 


Reprolysin family propeptide 


24.73 9.38e-26 154-192 


1523 


IrrJUUZo /UJJ 


iveproiysin idmuy propeptide 


16 31 6 33e-l 1 323-338 


1 zroo 

1523 


TOT5 A AO O T AC 


jveproiysin iamuy propepuue 


11 90 7 57e-12 355-367 


1 coo 

1523 


TTJT3 A AO 0 1 AC 


iveproiysin iamiiy propepuue 


18 81 6 85e-19 398-422 


1523 


TTJT3 AAO OC/1D 

lrr>UU3o54r> 


vjiDoerejiiin rcguid.teu proiein 


19 26 4 69e-09 470-508 


1523 


T>T>AAOOn A 


uisiiiiegnn signature i 


14 29 2 55e-14 469-488 


1523 


T|T) AnO OOTD 

FKUU25yi5 


T"\-t r-i -■ 4* ror»-* i ft nl /~r »n of 1 it* A TT 

uisintegnn signature 11 i 


1 1 74 7 27e-10 498-510 


i .co /t 
1524 


TTJT3 AAAO O^T 


1 yrosme specmc protein pnoopnataac auu 
dual specificity protein phosphatase family 


10 77 5 00e-ll 186-196 


1 CO /I 

1524 


TDDAH1 f\T2 A 


oompiemeni i^iq protein 


22 14 5 50e-09 249-28^ 


1524 


TTITDAAI AT^ A 


oonipienierit Vn/iq proieiii 


^>2 14 5 50e-09 436-470 


"1 CO /I 

1524 


TDDAA 1 

lrr>UUloy5l^ 


i_>ysyi oxiaase 


12 69 2 64c-09 391-415 


1 CO A 

1524 


nn AA"7AAT\ 


r rotein tyrosine pnospiidiaoc bigiiaiuic i v 


12 83 5 79e-10 183-201 


1 CO c 

1525 


IrBUUUZ /oL 


KalOaopsin-iiKe on^^iv supeiid.uniy 


8 03 4 52e-09 5-16 


1 CO c 

1525 


TnDAAA07^r\ 


Knoaopsm-iiKe \jrJr^rv superiamiiy 


9 40 5 24e-14 45-61 


1 CO c 

1525 


FRUU23 /(jr 


Kxioaopsin-iiKe urUi\ superiamiiy 
signature VII 


19 23 8 77e-17 35-61 


1 CO c 

1525 


pjT) r\f\ A 1 ^ AT? 


Auenosme receptor signature v i 


8 75 5 34e-12 35-45 


1 CO c 

1525 


"DTD AACC/1T7 

rKUU554r 


Auenosine azd receptor bignatuic vi 


8 86 1 75e-16 24-36 


1 CO c 

1525 


rKUU554(jr 


Aoenosme azd recepxor aigiictiuic vn 


14 18 7 29e-12 69-88 


1 C0^7 

152 / 


TDT5AA1 A/1 <IX5 

Ir -DOU 1 U45o 


opermiame synuid.be 


18 40 1 35e-12 314-356 


1 coo 


TRT3 AAOA/1 Q 

lrr>UUZU4c5 


Ljir -nana iamiiy 


7 914 32e-09 338-350 


1 coo 


"DT> A1 OO^T7 

rKUxZooil 


v_jrpnan nuciedi icccptui i> wrv i bi^iictiuiv v 


5 27 6 65e-09 6-27 


1 CO o 

152o 


nnni o/TOTJ 
rKU13o2rl 


riageiiar caicium-Dinuing piuLein 
^caniagm^ bigridLuic viii 


4 86 7 13e-09 150-166 


1 -CIO 

1 D jZ 


Ir r>UU 1 OUZ/v 


T l-nnVtar nr*i-t*ri r 7F k r\ t^rr^tpin "Fa tn 1 1 v T IPP0047 
LJnt/llctlclL'tCl [Jl Ultsi.il Xdillliy \S 1 l \j\j~t i 


15 98 8 94e-10 119-137 


1 *Z1A 

1534 


lJrr>UU3oD 1J3 


CiH protein 


9 06 2 96e-09 1051-1065 


1 CO A 

1534 


TjT) AAAO 1 A 


omau proime-ricn pioteni bigiia.uiic i 


3 314 55e-09 1044-1056 


1535 


lrr>UU135yxl 


oynapsm 


22 58 6 59e-09 39-89 


i roc 

1535 


r KUUo / 1 L, 


inniuin octd. j_> ujiaiu biguaLuic xix 


4 18 9 69e-09 44-63 


153o 


TPHAAAAQ^T? 


PATT-Krw /P91 -T?ViA-hin diner 


16 47 3 95e-09 146-200 


1 <1 Q 

153o 


lJrx>UUUoo 1U 


PK'TsJ/rnri-nViilin/rlintP'ViTi rhn-hinrHnp - Tftneflt 
i J>k.lN/IUO}JIllllIl/ illULCivlli iiiu L/iiiu.iiig i\jyj<j<x\. 


13 73 9.15e-ll 144-193 


153o 


lr DV\J\JyZ>yLJ 


V^CW (~} Vi/'w rlnr^l ir*^!"/ 1 */^! rr* oi oti 
lULU PUa UUpiiL'dl^/U ltrglVJil 


27 01 6.72e-09 196-248 


153o 


TDD AAAO^i 1 


Pfr\t<a>i-n Vinci cp f^— tprmitipl HnmJiin 
JrlOteill ivllldbC iciiiiiiicti uuiiidiii 


21 23 2 00e-14 138-179 


1 coo 

153o 


TPT3AA1 A 


Tirrncinp Irinoop r'dtdl'vtir* H AtTl 3 in 

l yrubiue jviiidbc L/cticuyLi^ uuuia-iii 


22 45 6 63e-16 82-122 


i coo 
153o 


TDDAA1 O/I CO 


i yrosme iviriaoe v^dLcuyLiv^ uuuictui 


21 68 7 55e-17 142-180 


153o 


TDDAA1 ll'jr^ 


JSJlldbC aooUk/lalCU UOlIlalli 1 


20 66 9.73e-16 77-107 

XW V/ ft X** ft / A. *ft# 9 4 -*- ff 


1 roo 
153o 


IrrSUUl / /21J 


T<T i n o c <=» ciconpi q+pH Hr\rn?iin 7 
iVIIldbC aooUOlaLCU UUlilalli 1 


21 67 4.67e-09 149-188 


1 c o o 

1538 


TDDAA1 '7'70 H? 

IroUUl / /21i 


fl'vin CO IPC Af*1 O "t"^r^ r\ AWl Q 1 Tl f 

JtVlIldbe dbbvJL/ldLCU UUUld.111 1 


24 88 2 67e-10 203-242 


1 C50 

153o 


lr J3UU35Z /U 


IVlAr KindbC 


21 53 2 04e-09 135-176 

4_ ^ mm* ^m* mrnrn^J ft V/ i _ft. A- ft 


1 coo 
153o 


lrr>UU3DZ /Lr 


TV/TAP Vino cp 
IVirt-l ivllldbC 


17 26 2.38e-09 215-252 

1 f m ^mmf ^mf mSmW ft V ^> Aft A. mm* m*mmr 


i con 
153y 


nn AA^AOTl 

x^KUUOUzri 


INUJLJJL/v IiyUiOldbC IdUlliy olgllalUlv ll 


14 82 7.53e-09 ni-146 

X T • * 4 ft *v w' ftft< V BftT A. maS A. -ft. ■ 


1 C A A 

154U 


IroUUU / / /A 


linveiope giyL-upiuLciii vji l 


1? 67 9 67e-09 6^2-665 


154U 


IroUUUoZZ 


7inr > "Fin opt i^s T-T/ tvnp 

ZjIUL/ iiiigdj type 


14 67 9.57e-10 729-754 

X 1 ft V ft — *^ ft ■ *- ft MftT »^ * I 


1 C/1A 

154U 


lrr>UU5U30O 


r^rdor P^O pr\t*p cVipll nrotpin 
vJdg rjv i^uic bixwii pi uiciii 


11 53 4 86e-09 799-815 


1540 


PR00049D 


Wilm's tumour protein signature IV 


0.00 1.79e-10 813-827 


1541 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 1.00e-12 537-574 


1541 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 2.52e-l 1534-571 


1541 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 2.56e-09 543-580 


1541 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 5.85e-ll 546-583 


1541 


1PB000885A 


Fibrillar collagen C-terminal domain 


11.46 6.69e-09 555-592 


1541 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 7.27e-ll 552-589 


1541 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 7.36e-10 540-577 
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ivesuiis* 


1541 


i"DT3AAAQC*I A 


JTlDTlllar COllagen ^✓-ICIIllluai UAJiJiain 


11 46 8 04e-10 528-565 


1541 


T"D"QAAAOC J x A 


riDnuar collagen ^-leruiiiiai uumaui 


11 46 9 12e-10 549-586 


1 C A 1 

1541 


lr\BUUUoo5A 


jriormar collagen v^-ieririiiiai uumani 


1 1 46 9 41e-10 5^1-568 


1541 


1"DT3AAAQQ^T5 


T7iKfil1ot« 1 a <tp»« tprmmnl <"1rHTI£11"n 
PlDriildr COlia^eH V*»-ICI1I111 iai VH-Uiiam 


19 15 1 77e-10 520-573 


i ^ a i 
1541 




"KSKrillar <r<=»n f^-tprminfll domain 
JTlDrillal COllaJ^Cil ICllIllllcll v4W111l4.hi 


19 15 3 70e-09 547-600 


1 C A 1 

1541 


TTJDAAAQQ <T5 


rlDrlllaT COlld-gCll \^-LCIlIllila-l UvJiiiciiii 


19 15 4 lle-09 538-591 


1 C A 1 

1541 


TDDAnnCQ<I3 

lrx>UUlJoo5i5 


Jrioriiiar coiid.gcn v^-Lcrmiiicii uunidii i 


19 15 5 01e-09 544-597 ! 


1 -C A 1 

1541 


TDDAAAOO 

lPBUUOoo5Jb5 


T?« U v { 11 l1r nnllo iTcm P„lpr m ? n ci 1 /^r%tn Clin 

rlDnilar collagen v^-iciminai uornaiii 


19 15 6 49e-10 517-570 


1 C A 1 

1541 


TDDAAACC^D 

IPoUUUooDjo 


irlDrlllar collagen i^-icrniinai uoiiiani 


19 15 8 lle-10 523-576 


1 iT A 1 

1541 


TDDAAnOQ<t3 

IFjdUUUoojjd 


jrlDilllm collagen i_>-Lerriniiai uuiiiain 


19 15 9 67e-09 541-594 


1 C A 1 

1541 


TDDAAACQ C/" 1 

ljrJDUUL)oo5C 


FlDrlliar collagen \^-Luriniiiai uuiiiain 


16 79 1 00e-40 639-678 


1541 


IPB000885D 


Fibrillar collagen C-terminal domain 


18.84 4.75e-32 728-765 


1 C ,4 1 

1541 


IroUUUoojJti 


ri Drill ar collagen ^-Lcrnnnd.i uomain 


91 1 00e-40 800-841 


1541 


IPB001073A 


Complement Clq protein 


22.14 L00e-08 554-588 


1541 


ryiTlL AA1 AH^ A 

IPB001073A 


Complement Clq piotem 




1541 


IPB001073A 


Complement Clq protein 


22.14 7.32e-13 548-582 


1541 


TTV"r\ A A 1 f\^f^ A 

IPB001073A 


Complement Clq protein 


ZZ. It- o.uJ?e-vJ'i' j*tZ.~J /O 


1541 


IPB001073A 


Complement Clq protein 


22.14 9.50e-ll 545-579 


1541 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


20.1/ Z.55e-lZ D4O-0UU 


1541 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


ZO.lZ 3.U0e-lU 551-0U5 


1541 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


zo. 12 5. 54e-uy 5Z/-J /y 


1541 


IPB001442A 


C-termmal tandem repeated domain in type 
4 procollagen 


zo. lz o. ize-iu j45-jy / 


1541 


IPB001442A 


C-termmal tandem repeated domain in type 
4 procollagen 


o^: n o i A,, i a ^ao ^qa 
Z0.1Z c5. lue- 1U D^fZ-jy^ 


1541 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


zo. iz cs. jye-uy 5z^f- j /o 


1541 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


ZO. lz a.yoQ-\)y JjO-joo 


1541 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


zo. lz y.34e- 1 u Dzi-j id 


1541 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


1 0 15 1 lOu AQ *szl^ ^<^^ 

iz.5o i. ize-uy j^+o-joo 


1541 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


iz.oo 1. /oe-iz j4y-joy 


1541 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


lZ.5o 1. / /e-iu JJZO /z 


1542 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 3.88e-ll 1025-1035 


1542 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


o.o0 4.uye-iu yzz-y^fz 


1542 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


o q^: o nA 0 1Q 1A0A 1A^^\ 

o.oo o.zue-io iuzu-iU3j 


1 ^49 




Protamine 9 fPRM2^ 

JT IU I dill HJLC £* -i-^~i-V 


5 26 9.27e-10 430-464 


1542 


IPB001862F 


Membrane attack complex 
components/perforin/complement C9 


29.39 5.39e-13 901-948 


1542 


IPB001881A 


Calcium-binding EGF-like domain 


8.72 3.40e-09 916-925 


1542 


IPB001881B 


Calcium-binding EGF-like domain 


12.28 7.00e-16 1020-1031 


1542 


IPB003886D 


Extracellular domain in nidogen 


13.91 2.77e-10 1020-1039 


1542 


PR00010A 


Type II EGF-like signature I 


12.91 2.64e-09 912-923 


1542 


PR00010C 


Type II EGF-like signature III 


6.98 1.58e-10 1025-1035 
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1542 


PR00010C 


Type II EGF-like signature III 


6.98 4.07e-09 932-942 


1542 


PR00011D 


Type III EGF-like signature IV 


12.12 2.54e-09 929-947 


1542 


PR00764F 


Complement C9 signature VI 


15.74 1.60e-10 918-938 


1542 


PR00907B 


Thrombomodulin signature II 


11,50 2.68e-10 1016-1032 


1543 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 1.00e-ll 88-125 


1544 


IPB001359H 


Synapsin 


22.58 2.49e-09 244-294 


1544 


IPB001394A 


Ubiquitin carboxyl-terminal hydrolase 
family 2 


12.64 8.00e-19 478-495 


1544 


IPB001394B 


Ubiquitin carboxyl-terminal hydrolase 
family 2 


17.87 3.16e-19 684-713 


1544 


IPB001394C 


Ubiquitin carboxyl-terminal hydrolase 
family 2 


8.52 1.00e-09 771-780 


1544 


IPB002000D 


Lyso some-associated membrane 
glycoprotein (Lamp) 


5.87 5.58e-09 241-254 


1544 


IPB002999B 


Tudor domain 


7.50 7.55e-09 244-252 


1544 


IPB002999B 


Tudor domain 


7.50 7.55e-09 245-253 


1544 


IPB002999B 


Tudor domain 


7.50 7.55e-09 246-254 


1544 


IPB003861B 


E4 protein 


9.06 1.00e-08 246-260 


1544 


IPB003861B 


E4 protein 


9.06 9.61e-09 251-265 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0.00 1.29e-10 233-24/ 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0.00 2.68e-09 243-257 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0.00 4.05e-09 237-251 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0.00 4.05e-09 238-252 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0.00 4.05e-09 244-258 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0.00 5.92e-ll 242-256 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0.00 6.75e-ll 236-250 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0.00 7,38e-12 239-253 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0.00 8.01e-13 235-249 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0.00 8.93e-10 241-255 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0.00 9.39e-l 1 240-254 


1544 


PR00211B 


Glutelin signature II 


0.86 7.50e-09 238-258 


1544 


PR01102M 


5-hydroxytryptamine 6 receptor signature 
XIII 


11.13 6.32e-09 229-251 


1544 


PR01217G 


Proline rich extensin signature VII 


4.02 5.58e-09 233-258 


1544 


PR01471E 


Histamine H3 receptor signature V 


5.41 3.35e-09 235-250 


1544 


PR01471E 


Histamine H3 receptor signature V 


5.41 4.23e-09 236-251 


1545 


IPB001359H 


Synapsin 


22.58 3.80e-09 277-327 


1545 


IPB001359H 


Synapsin 


22.58 8.89e-10 437-487 


1545 


IPB001359H 


Synapsin 


22.58 9.83e-09 458-508 


1545 


IPB002999B 


Tudor domain 


7.50 2.64e-09 1028-1036 


1546 


PR00019B 


Leucine-rich repeat signature II 


11.42 4.33e-10 317-330 


1546 


PR00416B 


Eukaryotic DNA topoisomerase I signature 
II 


15.23 3.03e-09 389-408 


1547 


IPB000801A 


Putative esterase 


15.56 7.75e-09 522-533 


1547 


IPB003533C 


Doublecortin 


10.25 5.14e-10 627-670 


1549 


IPB001698A 


F-actin capping protein beta subunit 


11.94 2.89e-23 47-100 


1549 


PR00192A 


F-actin capping protein beta subunit 
signature I 


8.23 1 .47e-27 47-67 


1551 


IPB002469G 


Dipeptidyl peptidase IV, N-terminus 


26.76 9.24e-ll 362-400 


1551 


IPB002469H 


Dipeptidyl peptidase IV, N-terminus 


21.17 6. 14e- 16 407-442 


1551 


IPB002469I 


Dipeptidyl peptidase IV, N-terminus 


10.99 4. 86e- 16 452-470 


1551 


IPB002469J 


Dipeptidyl peptidase IV, N-terminus 


8.97 3.52e-12 534-550 


1551 


IPB002471B 


Prolyl endopeptidase family serine active 


24.90 3.66e-l 1439-470 
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site 




1552 


IPB000183D 


Orn/OAr/Arg aecarooxyiases iamny 


14 78 6 76e-12 38-57 


1552 


IPB000183E 


Orn/UAF/Arg decarooxyiases iamny 


17 25 2 67e-09 106-129 1 


1552 


IPB000183G 


Orn/DAP/Arg decarboxylases family 2 


21.16 1.00e-17 171-191 


1552 


IPB003873C 


Nonstructural protein JNio/smaii envelope 
protein E 


7 Q7 8 03e-10 159-172 


1552 


PR00163B 


Rubredoxin signature II 


1^17 4 Q0e-09 143-159 ! 


1552 


PR01 179E 


Ornithine/diaminopimelate/arginine (ODA) 
decarboxylase family signature V 


16.24 5.06e-ll 180-193 


1552 


PR01182G [ 


Ornithine decarboxylase signature VII 


11.35 7.00e-14 167-180 


1558 


IPB001331C ] 


Guanine-nucleotide dissociation stimulators 
CDC24 family 


16.09 2.40e-12 1104-1129 


1558 


IPB001478B 


PDZ domain (also known as DHR or 
GLGFl 


6.12 7.19e-09 198-207 


•i rr r C\ 

1558 


IPB002532J | 


Hantavirus glvcoprotein G2 16.97 8.37e-09 1129-1165 


1558 


TTXT^k f\f\C\ a c\~v*\ 1 

PR00049D 


Wilm's tumour protein signature IV 


0.00 4.81e-09 1318-1332 


1558 


PR00049D J 


Wilm's tumour protein signature IV 


0.00 5.73e-09 259-273 1 


1558 


t»t» r\r\ c c A i "> 1 

PR00554B 


Adenosine A2B receptor signature II 


12.52 8.85e-09 917-925 | 


1558 


PR00834F | 


HtrA/DegQ protease family signature VI 


11.11 5.24e-09 194-206 | 


1559 


IPB000095F 


PAK-box /P21-Rho-binding 


16.47 4.5 le-17 374-428 | 


1559 


IPB000861D 


PKN/rhophilin/rhotekin rho-binding repeat 


13.61 6.28e-09 148-184 


1559 


IPB000961A 


Protein kinase C-terminal domain 


16.82 4.60e-13 150-184 


1559 


IPB000961D 


Protein kinase C-terminal domain 


21.23 5.64e-09 366-407 


1559 


IPB001245B 


Tyrosine kinase catalytic domain 


21.68 2.38e-13 370-408 


1559 


IPB001772A 


Kinase associated domain 1 


13.64 7.21e-10 148-179 


1559 


IPB003527A 


MAP kinase 


17.00 3.49e-09 157-182 


1560 


IPB003104B 


Formin Homology 2 Domain 


18.83 6.87e-21 785-814 


1560 


IPB003104C 


Formin Homology 2 Domain 


20.33 L27e- 14 957-984 


1565 


IPB000590A 


Hydroxymethylglutaryl-coenzyme A 
synthase 


17.12 1.00e-2o DU-/4 


1565 


IPB000590B 


Hydroxymethylglutaryl-coenzyme A 
synthase 


16.61 L67e-37 75-109 


1565 


IPB000590C 


Hydroxymethylglutaryl-coenzyme A 
synthase 


17.39 1.00e-40 110-156 


1565 


IPB000590D 


I Hydroxymethylglutaryl-coenzyme A 
1 synthase 


13.35 5.50e-21 165-196 


1565 


IPB000590E 


j Hydroxymethylglutaryl-coenzyme A 
synthase 


20.64 3.08e-31 306-359 


1565 


IPB000590F 


1 Hydroxymethylglutaryl-coenzyme A 
synthase 


16.80 4.27e-32 384-412 


1565 


IPB000590G 


Hydroxymethylglutaryl-coenzyme A 
synthase 


30.71 1.00e-40 448-502 


1568 


IPB003016 


2-oxo acid dehydrogenases acyltransferase 
j component lipoyl binding site 


20.69 2.17e-20 103-137 


1572 


IPB002043C 


Uracil-DNA glycosylase 


14.49 2.24e-09 666-676 1 


1574 


IPB000637A 


HMG-I and HMG-Y DNA-binding domain 
(A+T-hook) 


12.35 7.32e-l 0 275-295 


1574 


IPB000840C 


Matrix protein (MA) 


15.25 S.46e-09 245-290 


1574 


IPB000949H 


ELM2 domain 


21.43 9.41e-09 218-264 


1574 


PR00049D 


Wilm's tumour protein signature IV 


0.00 4.66e-09 254-268 


1574 


PR00456E 


Ribosomal protein P2 signature V 


3.08 5.60e-09 184-198 


1575 


IPB002567G 


Cell fusion glycoprotein K 


17.37 9.32e-09 231-270 


1577 


PR00756B 


Membrane alanyl dipeptidase (Ml) family 
signature II 


15.53 3.70e-13 232-247 
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ivesults 


1 577 

LD I 1 


DD n0756f" < 


lVyforv* V»T*c*n ^ 1 o fi \ /1 /IttAO'rt+i /4q oo /"TV/I" 1 ^ "fciTntlv 

ivieniDraiie aianyi uipepLiuabe ^ivii^ laiiiiiy 

MgllalUlC 111 


11 ^9 1 R4p_11 ^10^90 

1 1 .Jji 1 . O'tC 1 1 J 1 \J J JL\J 


1 570 

ld / y 


TPR0fln991 


TD t*i"\+ q m inp PI 
Jr lUiallllllc J: I 


5 AS Q 01e-l 1 6^0-656 


LD / y 


TPRnnn99 1 


PrA+QtYiitiP P1 
L: KJLallllllC Jl L 




1 570 
ld / y 


TPR0fi099 1 


PrAtamiTip P1 
JT lULalillllC x L 


5 48 9 65e-09 640-666 


1 570 

i d /y 


TPRnnnA09R 


i lULaiiiiiic z ysrss-ivLZtj 


5 26 5 70e-09 662-696 


1 570 

ld /y 


TPRnnnAQ9R 

Lr Lj\J\J\JHy 


Prntaminp 7 ^PP\/f9^ 
JT lULaiiiiiic Z ^X^JVLVIZ/ 


5 96 7 ^6e-09 62^-657 


1 570 
ld / y 


TPR0D0S99 




14 67 7 38e-00 586-611 


1579 


IPB003868A 


Herpesvirus UL31-like protein 


9,31 6.11e-09 640-654 


1 5RO 


TPR00n91 5 A 


oerpms 


1^015 01p-1 7 6?-86 


1 5sn 


TPR0nn9 1 5R 
lJrJ3LrUUZ L DD 


ocipins 


0 R7 9 ? Op-HQ 1 66- 17S 


1 5R0 


TPRnnn9 i 5f* 


uCl pill a 


1 ^ 90 4 ^2e-1 4 1 96-21 0 


1580 


IPB000215D 


Serpins 


15.35 4.41e-14 407-433 


i roi 

IjoI 


lrrJUUUDJ ItS 


caiponin iamny repeat 


1 7 9S3 1 11 1 1 A 1 90 

i/.zo i.Oje-ii iiu-izi? 


1 CO 1 


IrBUUl / IDA 


calpomn nomoiogy (Orij domain 


lO 74 7 Q/^<=» HQ 1 \A 19/1 

iu. /h /.ooe-uy llht-lzh- 


i roi 

l!>ol 


T>r> HACQCP 


omootn muscle protein/ caipomn iamny 
signature 111 


11 59 Q in 1 If! 195 


1 COI 

LDO L 


DT> AAQQQ"D 


i^aiponm signature 11 


o 54 o nop no 191-1 


1 ^Q9 
LDoZ 


TPTIAAA/IQO, 

lr£>UUU4oo 


Leucine ncn repear \^-ienriiiiai uomain 




1 5S9 1 
LDO J. 


JUr t> UU j U U 0r> 


ininiunogioDuim a.ua mdjor 

IlloLUCOIIipaLlDlllty UUIIipiCA LlUHlcUXl 


90 9^ ^ ^5p-1 1 475-519 


1 5 89 


ppnnni o a 




1 1 79 9 OOp-OO 05-1 OS 


1 5R9 


ppnorn or 

Jr J\uU v J. y LD 


T pi i pi n p_n fli rpnpof oi crti tt"^ T! 
I^CUL'IIIC ILKsil ICpCdL olgllctlUIC 11 


1 1 42 1 00e-10 140-1 5^? 


1 5S9 
13 OZ 


ppnnm or 




1 1 49 5 67e- 1 0 99- 1 05 


1 589 
1D0Z 


ppnnm qr 


L-eucine-ncn repeal signdiure 11 


1 1 49 6 45p-00 1 1 6-1 90 


1582 


PR00019B 


Leucine-rich repeat signature II 


11.42 9.18e-09 164-177 


LDOO 


ppm 95^R 


L/txi udnscripuon iacior signature 11 


5 09 7 57f>-00 ^^-45 

D.yZi f.D/C~\jy DD-t+D 


1 5Q7 
LDO 1 




iviiN kj linger 


m ^o 1 no^-DQ 18^-105 


1 527 
lOo / 


T"D"Rn09 C£7T5 


L^ysteine-ncn aomain ^v^ori^^ 


99 50 4 15p-15 1525-904 


1 5QA 


IJr l> \J UZ o o y£> 


aomam 


1 1 1£> S RR^-OQ 08-144 
1 1. /o o. ooc yjy y o 1*+*+ 


1 5on 


TPROn988QR 


Wi3V> QOITldln 


1 1 76 0 70p-10 100-146 


1 509 

id^z 


TPROn^RRSP 
IrDUUjOOOIl 


r i -ncn uomain iN-teirninus 


1 0 SO 0 00p-1 0 661 -71 0 

Ly.O\J y.\J\J\j LKJ \J\J L / L\J 


1 50^ 




r^yriciine nut/icoiivJ.c-uibuipiiiu.c 

UAiUUlbUULLajb tlUOo 11 


11 55 6 0Qp-1 0 747-767 


1 5Q^ 


TPR0008R5A 


l iuiiiii.il v\Jiiag^ii v^- niiiiai vjwiiiaiii 


11 46 2 17e-10 481-518 


1593 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 6.14e-09 475-512 


1 50^ 

LDyD 


TPRfifift8fi5R 


riorinar collagen ^-iciininai uomain 


10 15 7 ^Op-ftQ 600-669 
Ly.LD / .D\jc-\jy \j\jy ooz 


1593 


IPB000885B 


Fibrillar collagen C-terminal domain 


19.15 9.57e-10 320-373 


1 COl 

LDyD 


IroUUlU/ JA 


uompiement uiq protein 


99 1AA90<* in 517 

zz. in ^i-. zue- iu 'foj-j i / 


1593 


IPB001073A 


Complement Clq protein 


22.14 5.50e-09 486-520 


1593 


1"DT>AA1 f\Tl A 

lrr>UUlU/oA 


Complement Clq protein 


ZZ. 14 D.4oe-Uy /o 


1593 


IPB001073A 


Complement Clq protein 


22.14 7.19e-09 726-760 


1593 


TriD AA1 A A O A 

1PB0U1442A 


C-termmal tandem repeated domain m type 
4 procollagen 


O/T 1^ 1 AA« AO /I OA COO 

26. 12 1.0Ue-0o 4oU-53z 


1593 


TUn AA1 /I >l O A 

IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 3.43e-09 46o-520 




TT)t>rkA1 A An A 

!J J hJUU144zA 


C-termmal tanaem repeated domain m type 
4 procollagen 


zo. iz 4.ooe-uy o i j-oo3 


1593 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 5.21e-10 462-514 


1593 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 6.17e-09 483-535 


1593 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 7.81e-09 459-511 



WO 03/054152 PCT/US02/39555 



Table 3B 
433 



SEQ 
ID 

NO: 


Database 
entry ID 


Description 


Results* 


1593 


TOT") AA1 A A TT) 


C-terminal tanaem repeated aomain m type 
*f procollagen 


19 9 'XAn. DO ^A/7 
IZ.jO Z. JH-e-Ui? /"DV 1 


1 <oi 

iz>y3 


TDPjAAI /1 71915 

IrMSUU 144Zri 


Lv-termmai tanaem repeatea aomain m type 
t procollagen 


1Z.JO O.l IC-U7 ^ol-jUl 


l jyj 


tptiaai 449~r 


L^-terniinai tanaem repeatea aomain in type 
*+ procollagen 


19 7 60p-0Q 4R4-504 


i jyj 


lr J3UU It'J I 


OdLLcIltll LI dllblCl aoC IlCAcipcpilU-C ICpCdL 




1 sen 




OaL/Icridl Li allblOl abC IlCActpcpLlUC ICpcaL 


18 07 5 14e-10 550-585 




TPRfiOl ASI 
Lrr>\J\J LH-D 1 


JDd.L/LCIld.1 11 allolcl aoC llCAa.pcpLluc icptaL 


IS 07 f? 90p-10 55fS-5Q1 


1 SQ^ 


tPRAA9/t74 A 


vjni" id.Tniiy pcrmcd.hc 


90 01 7 QRp-OQ 170-91Q 




Jr XV.U U U O A 


1 ypC 1 cLllLlil VJUZ/C piULClll oi^iLclLUI O 1 


3 72 1 00e-08 527-541 


1 5Q^ 




' 1 \rnp T cj , n+"i"Ft*P»^ r 7/= i T^Tr^+pin ci onci+iiT"/ 3 T 
1 ypU I dilLlll CCZiC JJIULC1I1 blgildLUl C X 


3 72 ^ 77e-09 371-^85 


1593 


PR00308A 


Type I antifreeze protein signature I 


3.72 6.13e-10 372-386 


1 SQl 




i ype i anTiireeze proxem signaiure i 


3 79 Q HQ 756-770 


1 ^Q^ 
1D;0 


pphaqaq a 


1 ypC 1 dIlLlllCCZ<C piULClll olg.IIcLLUlC 1 


7 7? Q 4Sp-1 0 759-766 


1 SQ^ 




i ype i antirreeze protein bigndiure 111 


7 70 9 ^0p-00 ^75-^S4 


id^o 




i ype i annrreeze proxem signaxure 111 


9 70 5? R9p_0Q ^94-^^^ 


1593 


PR003O8C 


Type I antifreeze protein signature III 


2.79 9.53e-09 372-381 


l jyi 


rKUU3UoC 


i ype l antirreeze protem signature in 


L, fy y .DyG- L\J j / j-joZ 


1593 


PR00350E 


Vitamin D receptor (11 nuclear receptor) 
signature v 


11.57 4.21e-09 13-32 


1593 


PR01286E 


Orphan nuclear receptor NOR1 signature V 


5.27 9.02e-09 365-386 


1593 


Tir> A1 r AAn 

rR015UUB 


Tropoelastin signature II 


9 AO 1 TAa 9/^ TOO O/IQ 


1593 


PR01500C 


Tropoelastin signature III 


6.02 1.3 le-24 306-324 


1593 


PR015HD 


Kvl.4 voltage-gated K+ channel signature 
IV 


3,91 3.47e-Uy 3 /3-3o3 


1593 


PR015HD 


Kvl.4 voltage-gated KH- channel signature 
IV 


3.91 5.o5e-09 326-336 


1593 


PR015HD 


Kvl.4 voltage-gated K4- channel signature 
IV 


3.91 5.85e-09 757-767 


1595 


T"n"P» AAA 1 <^ A 

IPB000130 


Neutral zinc metal lopeptidases, zinc- 
binding region 


C O ^ T1 „ AO 1 O'T 1 0*7 

5.86 5.71e-09 187-197 


1595 


PR00756C 


Membrane alanyl dipeptidase (Ml) family 
signature III 


11 ^i/i oi a n ki 

1 l.jZ 4.ole-lZ iDl-lol 


1595 


PR00756D 


Membrane alanyl dipeptidase (Ml) family 
signature iv 


1 A HQ O ^Co 1Q 1 QH 


1 ^o^ 


rKUU /DDJti 


iviemDrane aianyi aipeptiuase ^ivii j ramny 
signature v 


1 A ^7 9 49p_1 9A6 91 R 


iOUU 


iJr £5UUU / 1 ou 


i iiyrogiouuiiii iypc-1 repeat 


1 5 4Q 7 67p-1 ^ Q6-1 1 0 


1601 


PR00342A 


Rhesus blood group protein signature I 


13.65 4.75e-21 12-30 


lOUO 


TP13 AA99 1 Q"R 


jrnorooi esters/ aiacyi glycerol Dinning 
aomain 


19 Q 1 ^p_0Q 9^n_94S 


1 ^A£ 


TPT3AA9C^7 A 


v^-^vyv^.- zinc rmger 


14 60 4 8£p-91 6R O0 


1OU0 


TPT5AA7QQQr^ 


1 uaor uomdin 


10 ^ 0 1 Sp-00 540-54Q 


luUu 


TPT3 AA9/C/1 QT3 

lroUlo04y.D 


rJ-oox v_,-terminai aomain 


99 1 6 7 57p 1 0 77 
LjL. 10 / .J /c- 1U JO- / / 


1 £:a£ 




uisease resistance protein signature i v 


14 SO 1 00p OS 7^0 74^ 


1613 


TPR002004A 


Pol V— P)nfiTi vlfitf* V>inrli"n sj nrntRin nniniip 

A \Jljf auvnjlalC L/XllVlHig, jJlULwlllj LilliULlV 

domain 


18 93 5 36e-10 150-197 


1614 


PR00258B 


Speract receptor signature II 


7.94 7.75e-13 64-75 


1614 


PR00258C 


Speract receptor signature III 


9.05 2.72e-09 79-89 


1614 


PR00258D 


Speract receptor signature IV 


14.29 6.47e-10 110-124 


1614 


PR00258E 


Speract receptor signature V 


14.06 8.78e-13 133-145 


1615 


IPB003780E 


Cytochrome oxidase assembly 


15.53 7.40e-14 59-83 


1616 


IPB000135D 


High mobility group proteins HMG1 and 


2.13 1.00e-12 318-342 
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HMG2 




1616 


TT^Tl AAA1 ^ f ~T~*\. 

IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 UUe-lU 31 /-341 


1616 


IPB000135D 


TT* 1 1 '1 '±. , TTA — J 

High mobility group proteins HMG1 and 
HMG2 


2.13 z.loe-lD 3ZZ-34d 


1616 


IPB000135D 


T T • 1 1 '1 'a. a. T TT\ ,T / — 1 1 J 

High mobility group proteins HMG1 and 
HMG2 


2.13 3.35e-ll 3Z/-3j1 


1616 


IPB000135D 


TT* 1 1 • T • , _ j < T TA / 1 ^ J 

High mobility group proteins HMGl and 
HMG2 


O 1 1 1 O O rt ftfl Oil 1T7 

2.13 3.oze-Uy 313-33 / 


1616 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


2.13 4.18e-0y 316-340 


1616 


ttm^aaa i *y ^ "T"X 

IPB000135D 


TT" 1 1 * "1 * i j * T TX jT A 1 J\ 

High mobility group proteins HMGl ana 
HMG2 


2.13 4.4/e-lZ 3iy-343 


1616 


IPB000135D 


High mobility group proteins HMGl ana 
HMG2 


2.13 4.53e-14 320-344 


1616 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


2.13 4.58e-l 1 328-352 


1616 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


2.13 4.58e-12 326-350 


1616 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


2.13 5.1 le-15 324-34o 


1616 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


O 1 1 *7 AArt. AO OTA OC/I 

2.13 /.U0e-Uy 330-354 


1616 


TTin AAA1 *-* ^"T"V 

IPB000135D 


High mobility group proteins HMGl and 
HMG2 


2.13 7.o9e-14 325-349 


1616 


IPB000135D 


XT* 1 -| ■ 1 • j j T T> iff 1 3 

High mobility group proteins HMGl and 
HMG2 


2.13 7.85e-10 329-353 


1616 


IPB000135D 


TT* 1 1*1 * a. j ' T TTV >f / — I -1 J 

High mobility group proteins HMGl and 

TTA /~"'*» 

HMG2 


O 11 O 1 A „ 1/C in IvIT 

2.13 8.12e-16 323-34/ 


1616 


IPB000135D 


High mobility group proteins HMGl and 

T TA At — 1 '*"Ti 

HMG2 


2.13 8.91e-09 315-339 


1616 


IPB000135D 


High mobility group proteins HMGl and 

TTTV vr^i 


2.13 9.80e-10 312-336 


1616 


IPB000135D 


High mobility group proteins HMGl and 
rlMLrZ 


Z.13 y.o /e-lD JZ1-J4D 


1616 


1PB000533E 


Tropomyosin 


1 1 11 K 1 A/a HQ 1%f\ A1 7 

1 1.3Z j. ioe-uy jou-41 / 


1616 


1PB000637B 


HJVlu-1 ana HMu- Y JL/JN A- Dinamg domain 
(^a+ i -nooK j 


14.Z1 0.4De-Vjy jZ4-^4Z 


1616 


lrB0U0o3 /.B 


rlMvj-l ana riiviLj- Y urNA-Dinaing domain 
i -nook ) 


1 A 01 Q 1 So HQ ^ZlA 


lolO 


TOT} A A 1 .lOOA 

IroUU 14ZZL/ 


rseuromoauiin ^uAr-H-jj 




lOlO 


lrr>0U14ZZU 


JNeuromoaunn ^orAr-'4o ; 


10. oz z.uie-vj!7 jio-jjj 


lolo 


lr\t>UU 14ZZl^. 


rseuromoauiin ( i uAr-*tjj 


1A R9 9 SOp-DQ ^90-^SS 


1 A1 £ 
lO 1 0 


llr rJUU 14ZZ^ 


iNeuromoauiin ^i_rAJr-*4o^ 


1A R9 ^ ^^p-00 ^91-^^fi 


1 A1 A 
lOlO 


Lr JdUU 14ZZl»> 


iMeuromoQuiiu i^vj^a.i -h- o j 




lOlO 


IJtJdUU 14ZZl^ 


rseuromoauiin ^vjr^\jr-*+j^ 




161 6 




Neuromodulin fGAP-43^ 


16 82 7 33e-13 319-354 


1616 


IPB001422C 


Neuromodulin (GAP-43) 


16.82 7.63e-10 317-352 


1616 


IPB001422C 


Neuromodulin (GAP-43) 


16.82 7.64e-09 315-350 


1616 


IPB001422C 


Neuromodulin (GAP-43) 


16.82 9.29e-10 314-349 


1616 


IPB001580F 


Calreticulin family 


2.93 4.94e-10 325-334 


1616 


IPB001580F 


Calreticulin family 


2.93 4.94e-10 326-335 


1616 


IPB001580F 


Calreticulin family 


2.93 4.94e-10 327-336 


1616 


IPB001580F 


Calreticulin family 


2.93 4.94e-10 328-337 
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1616 


IPB001580F 


Calreticulin iarnily 


2 93 6 85e-09 331-340 


1616 


IPB001580F 


Calreticulin family 


2 9i> 7 75e-10 336-345 


1616 


IPB001990C 


Granins (chromogranin or secretogranin) 


33.59 8.59e-09 309-356 


1616 


IPB002172 


Low density lipoprotein (UD-LJ-receptor 
class A (LDLRA) domain 


1,3 1 . _7\J\J 11 *J"T 


1616 


IPB002172 


Low density lipoprotein (UJl^-recepior 
class A (LDLRA) domain 


7^7 5 ?0p-1 ? 05 


1616 


IPB002360C 


i* i * 

Involucrin 




1616 


PR00194D 


Tropomyosin signature IV 


9.54 8.25e-09 323-346 


1616 


PR00261A 


Low density lipoprotein (LDL) receptor 
signature I 


1 S AO 4 IQa 10 dl-fid 


1616 


PR00261A 


Low density lipoprotein (LDL) receptor 
signature I 


1 S zlQ A ^Op-OQ R4-1 05 


1616 


PR00261B 


Low density lipoprotein (LDL) receptor 
signature II 


ic 17 9 SfWIO R4.-105 


1616 


PR00261B 


Low density lipoprotein (LDL) receptor 
signature II 




1616 


PR00261C 


Low density lipoprotein (LDL) receptor 
signature III 




1616 


PR00261D 


Low density lipoprotein (LDL) receptor 
signature IV 


i ^ C7 /to^o i n Szi i ns 

10.0/ ^.ZDe-lU OH-1UJ 


1616 


PR00261D 


Low density lipoprotein (LDL) receptor 
signature IV 


lO.O/ o.*f3e-il ^O-OH- 


1616 


PR00261E 


Low density lipoprotein (LDL) receptor 
signature V 


lo.OZ /,jZc-U7 oh-iuj 


1616 


PR00261E 


Low density lipoprotein (LDL) receptor 
signature V 


lO.OZ /.OJC-U7 tj-ut 


1616 


PR00261F 


Low density lipoprotein (LDij recepior 
signature VI 


1S 46 ? 65e-l 1 43-64 


1616 


PR00261F 


Low density lipoprotein receptor 
signature VI 


1 5 46 6 91 e- 10 84-105 


1617 


IPB002554A 


Protein phosphatase 2 A regulatory r> 
subunit (djo iamily; 


10 05 9 9Qp-27 165-194 


1618 


IPB003029B 


SI RNA binding domain 


9.42 8.80e-09 407-418 


1619 


1 IPB000001D 


Kringle 


11 31 2 88e-12 55-71 

1 1 ■ J 1 Ztf. UUv ii< /x 


1619 


IPB000126A 


Serine proteases, V8 family 


11.75 6.60e-09 55-70 


1619 


| IPB001254A 


Serine proteases, trypsin tamily 


o qq 7 /;5p_14 5^-71 
y.yo /.ujc / i 


1619 


PR00722A 


Chymotrypsin serine protease family (SI) 
signature I 


12.06 9.36e-14 56-71 


1619 


PR00839B 


V8 serine protease family signature II 


11.20 4.95e-09 55-72 


1620 


IPB000934C 


Serine/threonine specific protein 
phosphatase 


on 9^ ^ nn^-^4 71 -1 14 


1620 


IPB000934D 


Serme/threonine specific protein 
phosphatase 


1/1 oro SSi= 17 1 1 6-1 ^S 
14.00 y._oe-i/ l lo-ijj 


1620 


IPB000934E 


Serine/threonine specific protein 
phosphatase 


or 1 fZ o OOo 0*7 1 OA 99^ 

2j. lo /.ooe-z/ lou-zzo 


1620 




Qfari-n^/tVirRnninf* "nhn^nViatase "Fainilv 

signature IV 


11.97 4.60e-24 77-103 


1620 


PR00114E 


Serine/threonine phosphatase family 
signature V 


17.50 7.63e-23 106-133 


1620 


PR00114F 


Serine/threonine phosphatase family 
signature VI 


16.38 2.93e-15 170-190 


1620 


PR00114G 


Serine/threonine phosphatase family 
signature VII 


15.51 9.57e-17 198-214 
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1623 


PR01008D 


Flagellar L-ring protein signature IV 


9.99 8. 13e-17 35-49 


1623 


PR01008E 


Flagellar L-nng protein signature V 




1623 


PR01010G 


Flagellar P-nng protein signature VII 




1623 


PR01010H 


T"»1 11 Trv • i < ■ j "\ 7TTT 

Flagellar P-rmg protein signature VIII 


Iz.Io o.ooe-lo 114-13z 


1623 


PR01010I 


Flagellar P-ring protein signature IX 


*7 O/l C 1 1a 1 1 1 *70 1 OA 


1624 


IPB002195C 


Dihydroorotase 


11 /I A O O/C^ 1 O AO 1 AO 

11. 4y o.zoe-iz yo-iuy 


1624 


IPB002604F 


Chlorohydrolase 


1 1 00 1 QXa OO T3A lOQ 


1624 


IPB003764D 


N-acetylglucosamine-6-phosphate 
deacetylase 


no C/1 o nc* AO 1 1 a i 

o.Uje-Uy 114- ISO 


1625 


IPB000319C 


Aspartate-semialdehyde dehydrogenase 


12,28 6.63e-10 61-77 


1625 


IPB00031 9D 


Aspartate-semialdehyde dehydrogenase 


1 1 OO Q C\C\a. 1 A 01 1 1 O 


1625 


IPB000319E 


Aspartate-semialdehyde dehydrogenase 




1625 


IPB000319F 


Aspartate-semialdehyde dehydrogenase 


12.86 4.96e-10 238-251 


1625 


IPB000319G 


A '1111111 

Aspartate-semialdehyde dehydrogenase 


14.74 9.79e-16 25o-2/o 


1625 


IPB000534A 


Semialdehyde dehydrogenase 


15.79 7.88e-ll 64-81 


1628 


IPB000847 


Bacterial regulatory protein, LysR family 


15.44 8.67e-15 21-54 


162S 


IPB001S04D 


Isocitrate and isopropylmalate 
dehydrogenases 


24.39 9.18e~09 208-235 


1631 


IPB000121C 


PEP-utilizing enzyme , 


12.96 5. 15e-15 96-108 


1631 


IPB002192G 


Pyruvate phosphate dikinase, PEP/pyruvate 
binding domain 


23.23 3.77e-16 78-109 


1631 


PR00811C 


Bacterial general secretion pathway protein 
D signature III 


10.47 6.20e-09 9-19 


1632 


IPB002029C 


Aspartate and ornithine 
carbamoyltransferase family 


27.21 8.14e-21 401-439 


1632 


IPB003462C 


Ornithine cyclodeaminase/mu-crystalhn 
family 


8.98 7.00e-09 500-513 


1632 


PR00101C 


Aspartate carbamoyltransferase signature III 


15.49 8.93e-10 410-427 


1632 


PR01375B 


Salmonella/Shigella mvasin protein B 
signature II 


7.54 9.67e-09 201-217 


1632 


PR01507H 


Melanm-concentrating hormone 1 receptor 
signature VIII 


11.93 6.53e-09 354-373 


1634 


IPB001568B 


Ribonuclease T2 family 


11.65 1.00e-17 101-112 


1643 


IPB001014 


Ribosomal L23 protein 


22.21 o.03e-20 217-249 


1645 


PR01303D 


Plasmodium circumsporozoite protein 
signature IV 


10.57 6. 8 8 e- 09 486-503 


1647 


IPB001518B 


Argininosuccinate synthase 


12,16 6.87e-12 58-70 


1647 


IPB001518C 


Argininosuccinate synthase 


27.71 3.57e-27 72-114 


1647 


IPB001518E 


Argininosuccinate synthase 


20.16 5.24e-14 172-188 


1651 


PR00453A 


Von Willebrand factor type A domain 
signature I 


11.78 3.89e-13 88-105 


1651 


PR00453B 


Von Willebrand factor type A domain 
signature II 


13.84 6.10e-13 125-139 


1651 


PR00453C 


Von Willebrand factor type A domain 

• ^ i ITT 

signature III 


i i o/l i Z.Ha 1 a Oyi/r o^/l 
11. o4 3.!)/e-LU Z40-234 


1654 


IPB000222B 


Protein phosphatase 2C subfamily 


id.ou o.ooe-uy y4-iu4 


1 

lc04 


IJrrJUUUzzzC 


rrotem pnospnaiase suoiamny 


U. J mO\J\S 11 11 O IZl / 


1657 


IPB000917A 


Sulfatase 


9.52 4.79e-10 47-58 


1658 


IPB000063 


Thioredoxin 


9.89 1.69e-16 346-359 


1658 


IPB000063 


Thioredoxin 


9.89 2.38e-16 481-494 


1658 


PR00421A 


Thioredoxin family signature I 


10.35 1.00e-10 481-489 


1658 


PR00421A 


Thioredoxin family signature I 


10.35 2.13e-ll 346-354 


1658 


PR00421B 


Thioredoxin family signature II 


10.23 1.00e-ll 354-363 


1658 


PR00421B 


Thioredoxin family signature II 


10.23 7.75e-12 489-498 
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1658 


PR00421C 


Thioredoxin family signature III 


11.02 1.00e-09 394-405 


lODo 


tjp aaao i 
JrKUUHZlO 


T'Vi -r r\yc*r\ n\'in •fcnYiilx/ Ol <*r tl il til TP 1 IT 

i nioreaoxm idmny aigiidiuic a±a 


11 02 3 84e-10 533-544 


1661 


IPB003134F 


Repeat in HS 1/Cortactin 


15.66 9.68e-09 101-149 


loo2 


IrrJUU/jOUL/ 


mvoiucnn 


15 36 9 00e-09 56-97 


1662 


T)T)A1 T7f\ A 

FR01Z/UA 


riisione Qeaceiyiase superidiiniy M^ucum c i 


14 72 6 85e-10 1235-1258 


1662 


TIT* A1 OTATD 

PRO 1270 Jo 


Jnistone aeacetyiase supcridrniiy oigiidiiuc 

TT 
11 


0 70 5 76e-18 1269-1284 


looZ 


PKOiZ /OU 


xllSIOne QodL't/Lyiabe oUpcilalllliy oigjiiaLUi \J 

TTT 
All 


10 89 3 57e-09 1^60-1370 


loo J 


TJT> All A/1 A 

rKU 1 1 U4 A 


^r\Ila.piiyid.lOAIIl UIlCIIUJLd^ Lio i cucljlui 

CI (m 51 "fx IT*** T 
blgllCtlUl C A 


13 83 8 50e-09 83-94 


1666 


IPB000737 


Serine protease inhibitor, squash family 


13.14 8.31e-14 85-104 


1 £££ 
iooo 




CotYi -I fi o 1 t rani r»1 pi iTPrttpin T TPtl P 5lT C: 
jClnlllol VColCIC piULClll 1 lcptcuo 


9 00 1 00e-40 31-67 


iooo 


TDT5AAOAQB 


Odllillctl VCbldC piUldll l lCL»caia 


9 00 1 43e- 16 43-79 


IOOO 




QfiTnincil i/pciflf* nrrrfpin T T"t a nt a ft'f'^ 
Oclilillcxl VvolvlC pi L7LC111 J. ivptaio 


9 00 2 54e-17 19-55 


1000 


tpt^aaoaqs 


Oeillllldl VColl/lC piULClll X ICL/CCltO 


9 00 3 79e-15 37-73 


Iooo 


TD"D AflOAQQ 


oemindi vcaicit; piutcm i icpcctia 


9 00 7 15e-18 25-61 


1000 


TTJ'DAAOAQ.Q 

IrrJUUZUyo 


Qfaminal iTvaoij"*!^ T^frtt^lTl T Ti^nfJl'fQ 
vjeillllicll VColdC pi ULClll 1 UPpt'ClLo 


9 00 9 83e-10 49-85 


Iooo 


l.r oUUZZZ 1 D 


w/vir ~typc ^vviicy /t.ciu.i^ r lutciiiy iuui 

rliciil'Rrli 3 r»AT*f* rlATTidm 
UloUlllLlC L/U1C LHJ111U.111 


17 12 3 14e-20 92-113 


IOOO 


ppnAnrn a 


A-fxi Qii1r\Vii r?f* f*Arf=» ciicmixturf* T 
*+- uioUlLylllUw vvic oxgiia.LU.JL v x 


14 31 7.00e-09 69-78 


IOOO 


i J\u U U \J D \s 


zl-HiQiilnri iflp f*orp ^ipnatnre 1 1 1 

*T VllOLll JJlllU-t/ wvlv OlgllClLlXXV 111 


7.81 7.63e-12 98-107 


IOOO 




4«.Hiciii1nV\iHp pnrp Qipnatiife TV 

*T LllSUlLyillUw vvlv Olgllu-1.141 w 1 V 


9.63 8.50e-09 108-116 


1667 


IPB001909 


KRAB box 


17.37 1.00e-32 108-142 


10 /o 


TT>unn^ i ^nr 1 
Lrr>\J\)j LDVKs 


JvTTv J-'l>liA — UlllU-lllg UVJlllctlll 


25 34 7 08e-09 248-294 


1674 


IPB001322A 


Intermediate filament tail domain 


30.52 7.88e-ll 63-116 


lo /4 


tdt> ai nno A 
r'K.OiUOZA 


.riageiidr proiein ngj oigiiaLuic i 


10 02 9 65e-09 261-281 


1676 


IPB001865A 


Ribosomal protein S2 


35.08 8.94e-ll 53-106 


1678 


FROU/sOC 


Kyanoame receptor signaiure 111 


6 10 7 92e-09 1 07- 131 


1684 


IPB001998A 


Xylose isomerase 


15.52 3. 05e-09 214-227 


1684 


IPB002198 


JSnort-cnam denyarogenase/reauctase 
(SDR) superfamil^^ 


in 79 Q SOp-90 1 ^^-1 6R 


1684 


PR00080C 


oriort-cnain aenyarogenase/reauctase 
(SDR) superfamily signature III 




1684 


PR00081A 


Glucose/ribitol denyarogenase lamiiy 
signature 1 


1 A A7 S 9fip 1 0 


1684 


PR00081C 


Glucose/ribitol dehydrogenase family 
signature ill 


12.71 7.97e-09 127-143 


1684 


TiT") AAAO 1 T"\ 

PR00081D 


Olucose/riDitoi uenyarogenase iaimiy 
signature iv 


1 S 9? 4 71e-10 153-172 


1 /TO /I 

16o4 


"DO A1 1 

rKU 1 1 0 /r 


mseci aiconoi uenyurugeiidoe lamiiy 
signature vi 


1 1 22 2 34e-09 159-170 


IOoj 


Lri5\J\j 1 J ZO^ 


i->y-0/ U~F /\1N- UOIjUdlll 


13 04 2 03e~09 197-212 


1685 


PR01514F 


Kv2. 1 voltage-gated K+ channel signature 

VI 


2.00 5.29e-09 220-231 


1686 


IPB000095E 


PAK-box /P21-Rho-binding 


17.62 7.35e-13 679-724 




IPB000961C 


Prntein kinase C-terminal domain 


15.48 5.50e-12 678-712 


1686 


IPB001245A 


Tyrosine kinase catalytic domain 


22.45 8.88e-16 671-711 


1686 


IPB001245B 


Tyrosine kinase catalytic domain 


21.68 8.64e-09 736-774 


1686 


IPB003527C 


MAP kinase 


14.70 5.30e-09 663-711 


1686 


IPB003605C 


GS motif preceding kinase domain in TGF 
beta receptor 


14.92 2.91e-15 570-617 


1686 


IPB003605D 


GS motif preceding kinase domain in TGF 
beta receptor 


12.41 4.39e- 16 667-708 
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looo 


IrDUUooUDJD 


Uij rnoiir preceairig lvuidae uonidiix m a ^j-l 
beta receptor 


21 14 3 61e-15 710-749 


iooo 


irJaUUoowjr 


uj rnoiir preceuing Kiiia&e uwiiiaxix in i wi 
beta receptor 


23 21 2 56e-12 826-879 


loot) 




ACiivin type ii receptor bignaiuie 1 v 


12 01 9 76e-12 697-718 


1686 


PR00653E 


Activin type II receptor signature V 


14.33 2.93e-16 871-890 


1 /TOT 

1687 


xrBUUUoozA 


vj-protein coupiea recepxors iamiiy z, 
^secrenn-iiive ) 


1^ S1 ^ 25e~1^ 94-109 


loo / 




VddUuLLlVC llllCoLllldl pcpilUC Id/t/JJUJi 

oianatiifp 1 
olgllciLUIC 1 


12 62 2 98e-09 109-120 


loo / 


PI? 011 54 A 


VtfloW«C'LlVC lillCoLillctl jj c L) U uo l oucj^ iui j. 

<5ip"n?itiire 1 


11 05 3 84e-29 58-79 


1 6R7 
loo / 


PR01 1 54R 

.XXVU X UtD 


Va«:nartivp intestinal nentide recent or 1 
sipnatuTe 1 E 


12.24 1.39e-22 95-111 


1 6R8 
x uoo 


PR 00401 A 


SH2 domain signature I 


12.88 6.29e-15 214-228 


1 6S£ 

1UOO 


PR00452A 


SH^ domain sienature I 


9.57 2.35e-10 150-160 


1 688 


PR00452R 


v^i-f^ domain signature I' 


11.47 4.91e-12 164-179 


1 688 


PR00452C 


Sii-f^ domain sisnature ITi 


10.63 4.60e-09 181-190 


1 688 
1 uoo 


PR00499D 

J. xw ✓ J—' 


Neutrophil cvtosol factor 2 signature IV 


11.47 8.29e-10 152-172 


1688 

1UOO 


PR 01 260 A 


Na+/Ca2+ exchanger isoform 1 signature I 


9.46 2.75e-09 5-17 


1 601 


PR00019A 


T pneine-rich reneat signature 1 


11.72 1.27e-09 660-673 


1 601 

1U7J 


PR00019A 


T^eiicine-ricli reneat signature I 


11.72 1.55e-09 614-627 


i 6on 

1U7J 


PR 0001 9R 

x XNA/WW X 3? XJ 


T eucine-ricb reneat signature 11 


11.42 7.55c- 11 527-540 


1 601 


PR0001 QR 

I IW)\J\J X 27 XJ 


T piirinp-rirh reneat signature 11 


11.42 8. 91e-09 611-624 


1 604 


PR004Q9R 


TsJeiitronhil cvtosol factor 2 signature K 


9.48 4.73e-09 34-53 


1 605 


TPR001 ^59H 

XJ^XJWU 1 _7X X 


Svnansin 


22.58 4.58e-09 150-200 


1704 

1 / W *T 


TPR00097? 
ix xjwww^# / 


ATP1G1/PLM/MAT8 familv 

JT. XX X V-» X/ X X-/1VX/ XV J. XX J. U 1U11111 


14.36 8.53e-09 28-63 


1705 

X / UJ 


TPR000892B 


Ribosomal protein S26E 


13.49 3.50e-27 160-196 


1706 
l / wu 


TPR000906D 

XX UuuV7V/vi/ 


7!T J5 domain 

■* * \_/ +y ViV/iLlCllll 


23.89 2.54e-09 368-422 


1706 


TPR000906D 

XX UUvw wvl/ 


7,1 J5 domain 

/JU J VlV/lllfcllXl 


23.89 5.71e-09 437-491 


1706 


TPR000906D 


>^tJ5 domain 


23.89 8.54e-09 401-455 


1706 


IPB000906E 

XX Du V \J ~r \J \s X- I 


ZU5 domain 

* J ^rX>W^* X LlL^llX 


22.11 1.38e-09 448-488 


1706 


TPB000906G 

xx X.J v/uuy uu vj 


ZtJ5 domain 


25.85 3. 64e-10 382-430 


1706 


IPB000906G 


ZU5 domain 


25.85 5.87e-09 451-499 


1706 
X / uu 


TPR001580F 


Calreticulin familv 


2.93 8.20e-09 317-326 


1 706 
i / ww 


TPR00371 6C 

XJT XJVvJ i X wV^ 


RNA nolvm erase omega subunit 


13.72 9.39e-09 314-334 


1 706 
1 /wu 


PR 000 19B 


T eneine-rich reneat signature II 

IjvUvlllw llWll 1 V|^VHl' ljX^ilt*VVXi V> XX 


11.42 1.55e-09 742-755 


1 706 
i /wu 


PR00019R 


T eucine-rich repeat signature 1 1 


11.42 5.67e-10 773-786 


1706 


PR01415A 

x x^v x r j. xx. 


Ankvrin repeat signature I 


12.73 1.45e-13 415-427 


1706 

1 /WU 


PRO! 41 SA 

IT Ivv xx 


Anlcvrin repeat signature I 

JL VlliV V X XXX X VL/vUL UlgllCiL Ul W JL 


12.73 4.19e-10 451-463 


1706 


PR01415B 

X XV_v/ X *T X XJ 


Anlcvrin repeat signature II 

iklllVV 1 111 1 VL/VUL iJlgllMtLll V XX 


10.23 1.90e- 10 463-475 


1 706 

1 /WU 


PR01415B 

X 1W X J X *s \~* 


Ankvrin reneat signature II 

X XX XXV J X X X X X W 1^ V* v X *• v V* A. w X. X 


10.23 2.93e-12 427-439 


1 705? 
1 / wo 


TPB000822 


Zinc finger C2H2 tvoe 

xjiiiw itiigvi ■} w^dxiM r 


14.67 1.00e-18 235-260 


1708 
1 / wo 


TPB000822 


Zinc finger C2H2 tvoe 

XJlll V llllgvl j VJr J-/ W 


14.67 2.93e- 17 207-232 


1 70S 
1 / wo 


TPB000822 


Zinc finger C2H2 tvoe 


14.67 6.57e-10 263-288 


170S 
1 / WO 


TPR000S22 


Zinc finger C2H2 tvoe 

xjlllvy llll^vl ^ v>^-l X*-v IV L/vv 


14 67 8.71e-10 53-78 


1708 


IPB001275 


DM DNA binding domain 


19.17 7.04e-09 223-262 


1708 


PR00048A 


C2H2-type zinc finger signature I 


9.94 1.00e-08 204-217 


1708 


PR00048A 


C2H2-type zinc finger signature I 


9.94 4.38e-09 232-245 


1708 


PR00048A 


C2H2-type zinc finger signature I 


9.94 7.00e-09 260-273 


1708 


PR00048A 


C2H2-type zinc finger signature I 


9.94 9.63e-09 50-63 


1708 


PR00048B 


C2H2-type zinc finger signature II 


5.52 7.43e-10 220-229 


1708 


PR00498A 


Neutrophil cytosol factor 1 signature I 


12.92 6.74e-09 353-361 


1709 


IPB002870A 


Reprolysin family propeptide 


12.22 4.09e-10 108-124 
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SEQ 

ID 

NO: 


Database 
entry ID 


Description 


xtes u lis 


1710 


IPB000222B 


rrotem pnospnatase suuiamuy 


15 80 9 49e-09 1539-1549 


1710 


IPB001389D 


Flocculin repeat 


9.66 5.93e-09 1712-1739 


1710 


PR01391C 


Binary toxin B iamuy signature 111 


Id S7 S 44p-0Q 959-277 


1711 


PR01099B 


Hydroxyetnyltniazole Kinase iamuy 

_ • i . .„„ TT 

signature 11 


1 A AT> S 44e-09 6-27 


1712 


IPB000716C 


Thyroglobuim tj^pe-l repeat 


17 69 ^ 41e-12 287-305 


1712 


IPB000867A 


Insuhn-iiKe giowtli ractor-Dinaing protein 


7 19 7 19e-09 48-55 


1712 


IPB000867B 


Insulm-liKe growtn tactor-Dinamg protein 


11 44 4 1 ?e-l 8 66-82 


1712 


PR01 1691 


Ceratitis capitata alconoi aenyorogenase 
signature IX 


10 9 41e-09 425-445 


l_1715 


IPB003308C 


Integrase zinc-Dinumg aomam 


19 91 3 42e-33 22-75 


1715 


Tnn AAT1 A 0"P\ 

IPB003308D 


Integrase zinc-Dinumg aomdin 


8 41 7 43e-09 94-106 


1715 


T"m~1 AAO OAOt? 

IPB00330ob 


integrase zinc-Dinumg aomdin 


9 08 1 20e-ll 166-179 




IPB001134C 


INetrin, L^-terminus 


17 82 7 82e-10 1317-1331 


1717 ; 


tt~jT-j AA1 CC\C\ A 

IPB0Q1599A 


Aipna-z-macrogioDuim ldimiy 


10 97 1 45e-21 175-193 


1717 


IPB001599.B 


Aipna-z-macrogLODuiiii idiiiiiy 


7 45 3 35e-14 264-276 


1717 


TT4TIAA1 C C\C\ / — ** 

IPB001599C 


Alpna-z-macrogiODUiin iamuy 


14 40 4 60e-15 291-307 


1717 


IPB001599D 


Alpna-z-macrogiODUiin iamuy 


11 61 4 94e-l^ 783-793 


1717 


IPB001599E 


Alpna-z-macrogiODuim iamuy 


11 06 1 60e-10 810-819 


1717 


IPB001599F 


Aipna~z-macrogioDuiin iamuy 


18 95 5 30e-27 840-869 


1717 


IPB001599G 


Aipna-z-macrogioDuim idiiiiiy 


13 87 1 53e-13 988-997 


1717 


TTM1AA1 CAATT 

IPB001599H 


Alpna~z-macrogioDuiin idmny 


18 42 1 00e-23 1020-1047 


1717 


IPB001599I 


Alpha-2-macroglobulin family 


10.83 4.33e-12 1057-1066 


1717 


IPB001599J 


Alpna-Z-macroglODUlin iamuy 


20 99 3 10e-16 1088-1113 


1717 


IPB001599K 


Alpha-2-macroglobulin family 


8.15 7.60e-13 1273-1284 


1717 


IPB001599L 


Alpha-2-rnacroglobulin tamily 


1 8 66 4 O0p-95 1 104-1 33 1 

l o ,UU t.UvC 1 Ju7 J. JJ J- 


1717 


IPB001599M 


Alpha-2-macroglobulin family 


13.29 1.75e-12 1445-1456 


1717 


IPB001599N 


Alpha-2-macroglobulin family 


94 Q 77^-96 140S-1 510 


1717 


IPB001983D 


Translationally controlled tumor protein 


22.98 6.63e-13 1213-1253 


1718 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 l.UUe-lZ zi 1 


1718 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


O 11 O /11a 1 ^ 01 Q Ozl^ 


1718 


IPB000135D 


High mobility group proteins HMOl and 
HMG2 


1 1 1 9 1 7 77fl 7>4A 

2.13 Z.ooe-i/ zzu-z^Kf 


1718 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


1 11 1 /tlo 14 9152 949 
Z.13 3.43e-l i 4- Zlo-Z^tZ 


1718 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


9 19 A <Qo 1 1 914 9^8 
Z. 13 4. Doe- 1 1 Zl*+-Zoo 


1718 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


o i *a c Ola 19 999 94*7 

z.13 j.z3e-iz ZZJ-Z^-/ 


1718 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


9 19 ^ /Cfio 1A919 997 
Z. 1 3 D .DUe- lUZl j-Zj / 


1718 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


9 1 9 A 1 Sp HQ 9 1 7-796 
Z.13 0. l c>Q-vy Z1Z-ZOO 


1718 


IPB000135D 


High mobility group proteins HMGl and 


9 i 9 n < Q 1 *I 997 74^ 
z.13 /.Doe-iD ZZZ-Z^O 


1718 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


2.13 8.06e-l 1224-248 


1718 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


2.13 8.30e-14 216-240 


1718 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


2.13 9.13e-12 215-239 


1718 


IPB000135D 


High mobility group proteins HMGl and 


2.13 9.14e-17 221-245 
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SEQ 

ID 

jNO: 


Database 
entry ID 


Uescription 


xvvSltiiS 






HMG2 




1718 


IPBUUOliDD 


riign mobility group proteins oiviw i diiu 
HMG2 


9 n Q 1 Re-1 1 ?? i S-?49 


1718 


IPB000135D 


riiga mobility group proteins riJ.vi*_ri ana 
HMG2 




inl o 

1718 


IPBOllUoi /B 


tja /f i^l T n+-*A IT A /f i^l \r "TYM A "hi r» /-liner H r»m din 

Jn.iVivj-1 ana rnvivj- i urv/\.-Dinaing uunidin 
(A+T-hook) 


14 91 1 OOp-OR 924-242 


1718 


IPB000637B 


rlJVlij-1 ana riivio- 1 ursA-Dinamg uumam 
y/\rr i -nooK ) 


14 21 1 OOe-08 92S-243 


1 /lo 


IrhsUUUoo /Jd 


iiiVivjr-i ana riiviur- i jL^i>/\~Dinuiiig uuiiiain 


14 21 2 09e~09 223-241 


1 / lo 


ix xjUv/UOj /1j 


uynj onH HMH-Y DTsJA-hindinff domain 

lllVlvJ 1 dllLl 1 livl VJ 1 J-'l > ±V Ui i iviiiig uuiiicuu 

r A 4-' r_hnolr i 


14 21 4 82e-09 221-239 


1 / lo 


ll^tSUUUO-J / -D 


W\Ad-\ anH FTA/in-V DTsT A-hinHino: domain 
TA+T-hoolc^ 

1 1 1 HUUIv j 


14 21 8 36e-09 222-240 


1 71 S 
1/10 


TPR001 49?C 


T^JpiirniTinHTilin rrrAP-4^^ 

1 Nt/Cil VJlllUUUIlll ^VJ^l.1 ~*J J 


16.82 2.07e-10 218-253 


1718 
1/10 






16 82 2.13e-09 222-257 


1718 
1 / 1 o 


TPR001492C 


TsJpiirnmndulin ^GAP-43^1 


16.82 3.70e-09 220-255 


1 71 8 

1 / 1 o 




Neuromodulin fGAP-43^ 


16.82 3. 84e-10 217-252 


1 71 8 

1 / 1 o 


TPR001422C 


Neuromodulin fGAP-43") 


16.82 6.68e-10 223-258 


171 8 

1 / 1 o 


TPR0fl14??C 


Neuromodulin fGAP-43^ 


16.82 6.74e-09 221-256 


1 71 8 
1 / i o 


TPR001422C 


Neuromodulin TGAP-43^ 


16.82 8.93e-10 225-260 


1718 
1 / 1 o 


TPR001 


r^alff^tipnliti fRmilv 


2.93 4.94e-10 224-233 


1718 


11 IjVJVJ 1 JOul 


C^nlrptifMilin "Pmnilv 


2.93 4.94e-10 225-234 


1 71 8 
1 / 1 o 


1± D\J\J 1 J Oui 


f , a1rf a ti f*n1in familv 


2.93 4.94e-l 0 226-235 


1 71 8 

1 / 1 o 


TPR001 

11 DUU 1 JOwl 


C^alrfitiniilin familv 


2.93 4.94e-10 227-236 


1 71 8 

1 / 1 o 


TPR001 S80F 

11 Duul JOUl 


r^alrptipulin familv 


2.93 4.94e-l 0 228-237 


1 71 8 
1 / lo 


TPR001 S8QF 
ix duu ij o \jr 


r^nlrptipnlin familv 


2.93 4.94e-10 229-238 


171 8 
1 / 1 o 


TPR001 


("'alreticiilin "familv 


2.93 4.94e-10 230-239 


1 71 8 

1 / 1 o 


11 1_> vy V/ X 0 171 


Calreticulin familv 

\^dil vllvUUll -Itlllllljr 


2.93 4.94e-10 231-240 


171 8 

1 / 1 o 


TPR001580F 

_LJL J__> V_/ v UOuI 


Calreticulin familv 

VjCXI 1 V> L IV L41111 ILllllHJ 


2.93 4.94e-10 232-241 


171 8 

1 / 1 o 


TPB001580F 


Calreticulin familv 

V>Uil V LAV \-4r J. XI. X. ±L+liJ.ll J 


2.93 4.94e-10 233-242 


171 8 

1 / 1 o 


TPR001 580F 


Calreticulin familv 

V/Ull vUvUllll ICAllllljr 


2.93 4.94e-10 234-243 


1 71 8 
1 / 10 


TPROftl S80F 


Cn1rptir»ii1in farnilv 
y~sa.ll C/H^u.iiii lciiiiii y 


2.93 7.75e-09 235-244 


171 8 

1 / 1 o 


TPR0091 96A 


CnHliP'rin domain 

V_/dVJllV'l 111 UUlllulll 


14.68 7.00e-09 119-135 


1 71 8 
1 / 1 o 


TPR0021 96R 


CarlViprin Hotriain 

Vw^ClV_illwl ill UV^lllCllll 


12.04 5.20e-16 422-439 


171 8 

1/10 


TPR002126R 


CnHherin Hnmain 

V^ClVJllt'l 111 UVJ111C1111 


12.04 7.86e-13 156-173 


1718 
1 / lo 




Cpfllifrin Hrvmain 

Vw^ctUllCl 111 kJ*Jlllcilli 


12 35 8 41e-09 398-408 


1718 
1 / 1 o 


PR 001 94D 

lI\.uU 1 .7'TJU' 


Tronnm vo^iin ^ipnature IV 

1 1 WJJ1J111 y Uoiii oigiici.nan»/ x v 


9.54 1.94e-09 222-245 


1718 
1/10 


PR 001 Q4D 

1 SWJKJ 1 ^*Tl-/ 


TrnnnmvnQiii ^icrnatnre XSf 


9 54 2.05e-09 220-243 


1718 
1 / lo 


PR0070S A 


CaflliPTin cionntnrp 1 

VvClvllld 111 olglltlLLll \s 1 


17 38 3 50e-13 108-127 


1718 
1 / 1 o 


PR00205R 


CnHViprin cjionature 1J 

V-zClVll 1^/1 111 OlgUCllUlv 1.X 


20.09 3.30e-ll 551-580 


171 8 
1 / 10 


PR00205B 


Cadherin signature II 

V-'UllllVl 111 OlgllUlUJ V -LA 


20,09 4. 66e- 14 434-463 


171 8 

1 / 1 o 


PR00205B 


Cadherin <iiPnatureTT 

V— ' Civil 1^1 111 OlgLluLUl w -IX 


20.09 9.50e-21 168-197 


1718 
1 / lo 


PR00205C 


Cadherin ^ionature T ^ 

v_^civjiivi in Oigiiaiuj v in 


13.59 3.25e-ll 397-409 


1718 
1/10 


PR00905D 


Cadhprin «;icmatnre TV 

V/uUllvl 111 jl^UULUi V 1 T 


12.22 4.46e-19 413-432 


1718 


PR00205E 


Cadherin signature V 


10.82 4.32e-ll 432-445 


1718 


PR00205F 


Cadherin signature VI 


19.57 4.77e- 10 380-406 


1718 


PR00205F 


Cadherin signature VI 


19.57 5.20e-13 493-519 


1718 


PR00205F 


Cadherin signature VI 


19.57 7.30e-13 115-141 


1718 


PR00205G 


Cadherin signature VII 


13.05 8.71e-14 416-433 


1720 


IPB003360A 


US22-like viral protein 


16.29 4.27e-09 133-146 


1722 


IPB001128 


none 


11.42 5.15e-13 469-481 


1722 


PR00385A 


P450 superfamily signature I 


13.94 4. 19e-10 341-358 
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1U 

l\jn« 


Database 
entry xu 


Description 


Rp suits* 




PR00385C 


P450 sunerfamilv signature IK 


17.08 6.29e-10 395-406 


1 799 

jl / 


proo3R5d 


P450 Qimprfamilv <sicmatnre TV 

l "t*J\J OUL'L'lXCUllliy Ol£pliaiL41 V/ X V 


1111 9.47e-10 470-479 


1 799 


PR003R^.h 


P450 Qimprfamilv ^ionatnrfi V 

I ^^JKJ Jaiui i_y Qi£i,iiaLLii v 


13 50 5.50e~09 479-490 


1 799 


PR00463H 

X XVV \J*T\JJ X X 


R-r;1a«i<? P450 pronn T signature VIII 


11.44 5.68e-10 469-479 


1722 


PR00465B 


E-class P450 group IV signature II 


17.22 7.92e-10 113-136 


1 999 
1 /ZZ 




Tn_r»1cioa P/l^O mvMin T"\/ cicmntTirP III 
Xj-L-laob it-JW glUUp I V olgilalULC 111 


20 35 8 92e~23 332-358 


1722 


PR00465D 


E-class P450 group IV signature IV 


15.21 8.62e-16 390-406 


I /ZZ 


"DT? OO/I f^T? 


xi-ClaSS xt-^U group 1 Y MglldlUIC V 


1 5 83 4 41e-13 423-437 


1 TOO 

1 /ZZ 


PPfiA/1£<P 

rK.UU40Dr 


H-ClaSS x *-rD\J gTOUp 1 V blglialUlC V i 


1^ 57 1 00e-13 439-457 


1 noo 
1 /ZZ 


t)T3 no/1 


n-ciass r group iv aign<iiurc vxi 


15 73 8 07e-16 463-479 


1 999 
1 /ZZ 


ppnoA^w 
r^ix.uuH-ojxi 


X-/ _ Wl«.Oo r'+JU g,lUUp X V Dlg,ll<HUIC/ V XXX 


19 55 9 25e-21 479-497 


1 99£ 
1 /ZD 




irnniuiiugiuuuiiii ciiiu. niajui 

Vii ctr\r»r»TYiT>atiHi litv pfMnnlpY Hornain 

llloLL)t/\JlllJ-/«.LlL/lJlt3' ^VJllipit/iV VJ.Wllia-111 


17 51 7 65e-12 91-113 


1 79£ 
1 / ZD 


TPR003006R 


Ttnm i in ocrl oVii i1 in anH mm or 

XllllllUlllvJgHJ ULllill dllU lllojVl 

ViictoKnmnatiHilitv comolex domain 


20 23 2 80e-10 130-167 


1 797 


TPR001985A 


S vn ant onTi v«i in / ^ vn ant onori n 


12 19 1 00e-40 26-72 


1 797 
1 /z / 


TPR001 9RSR 


^l\/niintAnTi vein /^vnanfrmrvrin 


21 42 2 00e-35 73-104 


1 797 
1 /z / 


TPR001 9RSR 

1TDUU 1 Z.O JJJ 


Q vn antrtnli vQin /cvn ant onori n 
i_> y i iciiji-vj u'ii y oiii/ o jriic*jji,\_/|jv^i in 


21 42 7.86e-12 72-103 


1 797 
1 /z / 




Svtiflntnnh vQin/«ivn antonorin 

oyilcipLXjpilj&lil/ a^l la L^LwpvJi ill 


15 24 1 00e-40 108-148 


1727 


IPB001285D 


Synaptophysin/synaptoporin 


12.74 1.00e-40 149-183 


1 ^99 
1 /Z / 


TPPinrn 9r^p 

IrDWu IZo jjo 


o YIldpiAjpiiyoill/o yiiapivjpvJi ill 


9 55 1 00e-40 184-230 


1727 


IPB001285F 


Synaptophysin/synaptoporin 


6.39 1.00e-40 231-275 


1 /Z7 


T LVDOO 1 1 Q^T7 

IroUUiZoDr 


o ynaptopnys m/synapiopor m 


fx 3Q 1 S4p-0Q 948-909 


1727 


IPB001285F 


Synaptophysin/synaptoporin 


6.39 3.46e-09 242-286 


1727 


LPB0012o5r 


oynaptopnysm/synaptopoi m 


^ R ^5^-1 1 9^R-^09 
o.O-)C-l 1 /*JO~D\)^ 


1727 


IPB001285F 


Synaptophysin/synaptoporin 


6.39 9.83e-ll 263-307 


1727 


T-i T"» AATOn A 

PR00220A 


iSynaptopnysin/synaptoporin tamiiy 
signature I 


1 'X /^Q 1 41 & 97 9^ 


1727 


TIT> AfllOAD 

PR00220B 


bynaptopnysin/synaptoponn iamny 
signature 11 


19 8 ^1i=i 9R zL9 99 


1 TT7 
I /Z / 


PKUUZZUU 


oynaptopnysin/synapioporin iarnny 
signaiure 111 


1 1 Rr» 7 00p-9R 1 00-1 94 


Y 1 LI 


pp nn990F\ 


oynaptopnysin/ayndpLopuriii id.11111 y 

cicmdtiiT^ WJ 
ol^llclLUiv IV 


10 60 8 13e-28 132-155 


1 797 
1 /Z / 


PR00990P 


Qx/iio-r\'fr\"nIi\/t3tn/c''v/ii nntnnnrin "fatmlv 
O YiiapL\JiJiij'oiii/Q i yiia|-'LVJ^\Ji 111 xcniiii^ 

siQrati lfft V^ 


4 56 5 68e-23 199-217 


1797 


PR00761 A 


Rindin nrecur^or sienatiire 1 


6.20 7.00e-10 287-303 


1798 


TPR002360C 


Tti vol 1 1 cx\ n 

.111 V XJl Ll^XllX 


15.36 1.40e-09 114-155 


1 798 


TPR002360C 

XJl UUVAJUUV 


Involucrin 

-LIL V V' 1 Uvl 11 1 


15.36 7.80e-09 105-146 


1 798 


TPR009^5RF 

1JT 13 VJ \Jj^J *J OX 


T/T WFO domain 


14.64 7. 3 3 e- 09 155-185 


1 790 


TPR003137 

1 1 DUUJ 1 J / 


Prntf a a<!F' a<;^opiatftd ^PA^ domain 

X. IULwUOV ClOoVJvlClLvU \_1 * J VlVJlllO-lll 


22.40 2.86e-15 551-581 


1729 


PR00747C 


Glycosyl hydrolase family 47 signature III 


12.34 5.14e-ll 145-163 


1 990 

i /zy 


PPHH9il9F 
x ixUU /H / J_/ 


x/r»r\c"\/1 Iri/rlivil a cf> "f5inni1"\7 47 ^icmsitii'rp "V 
VJiyvvUbyi 11 yvll vJldoC xaiiiiiy *r / oi^iiciLuit; v 


14 81 2 08e-18 225-242 


1729 


PR00747H 


Glycosyl hydrolase family 47 signature VIII 


12.76 5.50e-17 326-346 


1 / 


pp nmoo/^ 


Alpna/Deia giiaam iamny signature xxi 


3 37 5 39e-09 77-90 


1731 


IPB000276B 


Rhodopsin-like GPCR superfamily 


4.97 9.05e-09 192-203 


1 7^1 


TPRonn97^p 


Plu^rl/vnoiM-lilr^ PrPr^P cnnpffamilv 
xxiiotiopalll lllvC v-117 v_>-rv bupu iciiiiiiy 


8 03 3 50e-l 1 244-255 


1731 


IPB000276D 


Rhodopsin-like GPCR superfamily 


9.40 9.47e-14 285-301 


1731 


PR00237A 


Rhodopsin-like GPCR superfamily 
signature I 


9.81 2.42e-13 35-59 


1731 


PR00237B 


Rhodopsin-like GPCR superfamily 
signature II 


12.45 5.24e-16 68-89 


1731 


PR00237D 


Rhodopsin-like GPCR superfamily 
signature IV 


9.76 1.47e-ll 133-154 
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Table 3B 
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SEQ 
w 

INU: 


Database 
entry JLU 


Description 


Results* 


1 711 
I / D I 


pp on9i7T? 


xvJluaopSlIl-ilivG UrVjlv SUpL/Ilmllliy 
olgridLUIc V 


1 ^ 03 R 07e-l 1 1 S4-907 


1 71 1 


pt? 009i7T? 


xvnoaopsiii-'liKc vjti^v-Jv ouperiaiiniy 

ci cm est i nv* "\/T 
ol^IlctlUlC VX 


14 ?4 9 SQp-1 7 934-9^8 


1 7^ 1 

1 / D 1 


PR009^7G 


PTinrlrincin-ltlrf 3 frPf^P 1 1 mpr Fa m i 1 V 
IVllUU*jpblII~lllVC VJJTOXN. oLljJClXClIllIXjr 

sionfifiirp VTI 

olgllcllLll w V XX 


19 23 8 1 le-1 9 275-301 

X J7 *jL/-J 0 . 1 X v/ 1. ^ jLi 1 *J *J\J X 


1731 


PR 00744 A 


NpnrAkinin rpppntAr siimatnre 1 


10 06 5 00e-15 29-40 


1731 

X / — s X 


PR00244B 


Npnrnkinin rpoentAr sipnature 11 


12 18 1 00e-13 55-64 


1731 


PR00944C 


Npurnki nin rpppntAr sipnature 11 1 


13 48 2 87e-12 108-118 


1731 


PR00944D 

JT XX\j'Uy<i 1 T*TXX 


Npnrnkinin rpppntAr sionntnrp TV^ 

IN CUl VJlVLlll.il 1 V^Vw'V/fJ L\Jl olgildLUi v X V 


7 47 9 36e-14 125-135 


1731 

X / J X 


PR00244F 


TvIpiirnVinin rprpntnr Qicmatijrs V 

1 1W Lil V/1V1 11111 1 vvvplUl OlglldtUl V V 


5 67 1 44e-ll 183-192 


1731 


PR00244F 

X xvw v^~~x 


TxlRiimlrinin rpcRntor sipnatiiTe VT 

11 will \J 1V1 1 1 1 i 1 IwvvUkV/l OlKllUtUl v V X 


12 03 6 29e-16 204-217 


1731 

X / J X 


PR00244G 


TvJpnrnVi nin rprpAtor sipnature VTf 

1> wUl UlVllllll IvvvpkVl OlglltlLUl V V XX 


9 65 1 45e-15 268-279 


1731 

X / w> X 


PR00244H 


Tsfpurnl^inin receiptor sionatLire VII [ 


13 18 5 50e-26 294-312 


1731 


PRO 1025 A 


Neurokinin NTC2 receptor sienature I 


8 63 5 50e-16 2-15 


1731 


PR01025B 

X 1\U X UAJ XJ 


Tvlpiirnkinin MR 7 rpppntor sipnatnre TT 

X> wLil WXVX11X1X lllvfa IWuvpLUl L7XgllH.LL4.lVv XX 


12 20 7 lle-17 15-28 


1731 


PR01 025C 


Npiirnkinin TnJTC7 rpppntor sienature Til 

1 lL/Lll UIV111111 X l XV £ * lLk/wL/LWl LJlglltX L Lil \-/ XXX 


18 53 2 23e-16 154-166 


1731 


PR01025D 


Neurokinin NTC2 recentor signature IV 


12.91 5.60e-18 168-182 


1731 

X / ~> 1 


PR01025E 


Neurokinin NTC7 recentor signature V 


8 25 7 23e-25 212-230 

XX v *W *X / «*/•/ XX sX <«-^ X a-X ^X«X VX 


1731 


PRO 1 025 F 

X IVU X \J4**J X 


Neurokinin NK2 recentor sifmature V7 


14 52 1 00e-17 257-269 

X I • X * V V XX X / Aw«X / XV XX .x 


1731 


PRO 1 025 G 


TSTeiirnlriniTi 7^IK2 recentor signature VTI 

X 1 Vv LXX V/1V111X1X Xii.Vfa 1 wv vptvl iJl gllw^Ul V Y XX 


9 85 6 65e-23 312-331 

_x « w ^-^ XX ■ XX *X XV «X JV. »X w> x. 


1733 


TPB000917A 


Siilfatase 


9 52 5 26e-10 44-55 

_X • w' ^X a>X ■ 4ad XX X^ X. \X ■ 1 MX ^X 


1734 


PR00464A 


Group II E-class P450 signature I 


19.27 7.26e-14 167-187 


1 7^ 

X / JX 


TPR000937R 


f"rP TP HfVIVICllfl 
VJXN.XX UUxlldlli 


30 66 R l4e-09 273-323 


1735 


IPB002097C 


Profilin 


22.96 4.04e-10 986-1040 


1 71^ 
1 / JJ 


xl^xxUUZU;/* / v_^ 


Pr/"\"Fi Tin 

x roiiim 


99 Qfi S OSp-OQ 991-104^ 


1735 


IPB003753G 


Exonuclease VII, large subunit 


31.72 8.85e-09 787-831 


1 71 <C 




x^rotiiin signature ill 


1/1 49 9 OQp AO QC£ 1 OO^I 


1735 


PR00392D 


Profilin signature IV 


10.64 4.35e-09 1008-1022 


1 /DO 


TDDAA1 TJO 

IrrsUUl 1 do 


rungal transcriptional regulatory protein, N- 
terminus 


17 17 Q 7Qo OQ £1 7Q 

la. i / y. /ye-uy oz- /o 


1 aio 




INAJJ aependent epimerase/aenyaratase 
family 


1/1 41 O £^<=. OQ 17Q 400 


1 7/i 1 

1/41 


lrBULoo44J 


Domain round m JNa-L-a exenanger ana 
integrin-beta4 


1 Q ^/1 ^ ^7o OQ ^7 111 

io.o4 o.o /e-uy x /- 1 1 1 


1 HA 1 

1741 


n t> nnn i c / — 1 


Luteo virus group 1 coat protein signature 

Vll 


1^ 7/1 £ ?Qo Ifl Q AQ 

1j./4 0.xc5e-lU 


1 7/1 < 

1 /4j 


IrrsUU IZcoA 


iixtracelluiar proteins &Lr/ 1 px- 1/Agj/rK- 

1 /Qr»7 


1^ 9£ A /^7<=> 10 9Q AC 

lo. zd o.o/e-iu zy-4o 


1 f L rD 


TPPI001 9 SIR 


Pvt-rQf^llnlar nrnfpmc Q f~ 1 p/ r T , rxY 1 /A ff C/pp 

1/S1p7 

1/ / 


1^ OQ 9 ?^e-19 ^7-79 


1745 

X / *TX 


TPR0019S3F 

XX \ J\J\J l/,Oj X_/ 


Fvfrapp11n1ar nrotpins SPP/Tnv-1 /Aa5/PR- 

J_yA.ll dVCllUldi IJUJIVSJIIO Jv^l/ 1 UA l/AgJ/llV* 

1/Sc7 

Xf kJV/ / 


13 34 5 50e-20 127-147 


1745 


IPB003106E 


TTompohoY associated leucine 7inner 

ILV/lllvvL/V/A CluiJWlUtvVi iVUVlllw I VJ VJ vl 


13 73 6 90e-09 25-55 

X *J + i +J V/ « .x* xx W vx -> w *x -x 7 


1745 


PR00837A 

X X W UU J / xV. 


Allprcrpn V5/Tnx-1 farnilv sipnatnrp T 

-TVll'wvl gV^ll V +sl X X 1 **- x x«-i.ixii_y OlgllCH-lXX C X 


14 69 1 56e-14 57-75 


1745 


PR00837C 


Allprppn V5/Tnx-1 familv sipriafurp TTT 


16 31 5 50e-20 126-142 

X XX • *X X «X * W XX XX VX X V X ■ 


1745 


PR00838D 


Venom alleraen 5 signature IV 


8.59 3.57e-09 57-75 


1745 


PR00838G 


Venom allergen 5 signature VII 


15.39 8.60e-18 125-144 


1749 


PR00672D 


Inhibin beta C chain signature IV 


10.52 6.40e-09 96-112 


1749 


PR01035I 


Tetracycline resistance protein signature IX 


13.24 3.86e-09 568-589 


1749 


PR01221G 


Rodent urinary protein signature VII 


12.11 8.68e-10 61-78 


1750 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 8.16e-09 1852-1889 


1751 


IPB001991A 


Sodium :dicarboxylate symporter family 


10.78 2.53e-20 9-40 


1751 


IPB001991B 


Sodium:dicarboxylate symporter family 


14.40 2.29e-33 324-355 
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Table 3B 

443 



SEQ 
ID 


Database 
entry ID 


Description 


ivesuits^ 


1 7^1 


11 D\J\J 17> Iv 


Qr , ir1iiim'r1ir*5i , r'f^rkw/1{i+/ a c\/mYl<TrtP1* ■ffimilv 
OUUlUlll.Uil/CU. UL>A_yiCtLC oyilljJUA Iwl J.o.illilj' 


Q 91 ^ 40e-74 175-^94 


1751 


PR00173B 


Glutamate-aspartate symporter signature II 


7.82 7.87e-12 19-39 


1 "7^ 1 

1 / J 1 


pp aai txc* 

rrsSJ\J I / JU 


ijriutamate-aspanBte symporter kigiidiure 111 


1 Pi £7 Q 44p«9^ 197-TS? 


1753 


IPB003006B 


Immunoglobulin and major 
Histocompatibility complex aomam 


20.23 6.54e-09 286-323 


1755 


IPB002617C 


Vesicular monoamine transporter 


16.23 5.03e-10 91-143 


1755 


PR01035E 


Tetracycline resistance protein signature V 


lu.o/ 2.uce-uy iui-izj 


1755 


PR01035F 


Tetracycline resistance protein signature VI 


12.88 7.52e-09 332-352 


\151 


TTVT4 /"\ r4/\0 />/\T» 

IPB000300B 


Inositol polyphosphate related phosphatase 
family 


15.64 1.93e-lo 4oo-491 


1757 


IPB000300C 


Inositol polyphosphate related phosphatase 
iamiiy 


14. dU o.jUe-lZ j3o-->4S 


1757 


IPB000300D 


Inositol polyphosphate related phosphatase 
iamiiy 


29.87 4.51e-18 589-623 


1757 


IPB000300E 


Inositol polyphosphate related phosphatase 
iamiiy 


15.50 1.43e-14 774-790 


1757 


IPB000492B 


Protamine 2 (PRM2) 


5.26 4.23e-09 83-117 


1758 


tvt>aai zrr\i~> 

PR00169B 


Potassium channel signature H 


1 c Ad o cria oo oio o/^t 
lo.4y 6.bUe-zo jLoy-Zoj 


1758 


PR00169C 


Potassium channel signature III 


1*7 CO AC<3 Q 11 "Jlrt 


1758 


PR01496C 


Shaker voltage-gated K+ channel family 
signature III 


9.37 6.92e-l 2 202-213 


1758 


PKU1496JL) 


onaker voltage-gated Jv+ cnannel tamuy 
signature i v 


y. /4 J.zj>e-zi zoo-ZDz 


1 / JO 


rKU I4yoii 


anaKer voitage-gateu jsjt cnannei iamiiy 
signature v 


1 A S£ 7 11/= 1 <d. 9*CQ 770 

iu.oo z. iie-14 LZ>y~L i u 


1 7^8 
1 / JO 




JVVL.Z VUlldgC-gaLCU IV ~ dlctiliiCl blgllclLUiC 1 


SO ^ S7e-1 7 978-7Q? ! 

L>. /C— 1 / Z, / 0"X.3?Z» 


1 7^8 


ppm snoR 


ivvi.z voixage-gatea jntt cndnnci signdiure 

TT 

n 


S lO 1 76r_1 7 7Q4-^f»6 

0.17 l.ZDC-1 / Z^'JUU 


1759 


IPB000631B 


Uncharacterized protein family UPF003 1 


10.96 5.00e-12 100-112 


1 7^Q 


TPpnnn/^^ i 


uncxiaraCiGrizcu proiein id-iniiy urruuji 


17 C7 7 7^p_19 ^76-^87 

1Z.O / / . /JC LZ* J / \J JO I 


1 *7<Q 

i /jy 


TDT3AAA£1 i t? 
IroUUUoj IJci 


uncnaractenzea protein iamiiy ur ruuj i 


7 87 Q 9*\i=. 1 1 4^8 4^S 

/.oz y.zDe-i i 4jo-40o 


1 7^7 

1 IK) J. 


tdpaai n'XA A 


oooium.soiute symponer iamiiy 




1 7/^7 
1 /OZ 


tppaai 


oOGiurn.soiuie syrnporier iainiiy 




1 7/^7 
1 /OZ 




ooaium.soiute symporter iamiiy 


1871 9 l^p 71 4zLS 477 
lo.Zl Z. lOe-Zi *t*to-H- / / 


1 /Oj 


JLJr J3UUU4o J 


jLeucine ncn repeaT i^-teirninai aomaiii 


1 1 1 R 9 7^p_1 9 ^SR-^79 

1 l.lo Z.Zjc- 1 JJ O - J / z. 


1763 


IPB003006B 


Immunoglobulin and major 
nihiocompdiioiiiLy complex uonidm 


20.23 5.15e-09 462-499 


1 / OJ> 




jLreucine-ricn repedi bigndiure i 


1 1 77 S SOp-T 9 86-00 


1763 


PR00019A 


Leucine-rich repeat signature I 


11.72 8.91e-09 278-291 


1 lOD 


JrlvUUU iy±5 


Leucme-ncn repeat signature n 


1 1 AO 1 flflp A8 1 A7 1 9A 
11. 4Z l.UUe-UO 1U/-1ZU 


1 /DO 


rKUUUiyJts 


i^eucine-ncn repeat signature ii 


1 1 A7 8 77z=> 11 OA 

i i.4z o. / /e-i i oj-yo 


1 /OJ 




Eeucine-ricn repeat signature 11 


1 1 AO Q <*7o 1A 7*7^ 9QQ 

ii. 4Z y.o/e-iu z/d-zoo 


1 /o4 


TOT) Ann i on 
Lrr>\Jy)\)L jU 


JNeutral zinc metaiiopeptiaases, zmc- 
Dinaing region 


j.oo o.o /e-uy zy4-JU4 


1764 


IPB001762A 


Disintegrin 


23.93 5.21e-10 241-281 


1 *7/^/1 
1 /04 


TDT3AA1 A 

lrr>V\J 1 oOZA 


JMemorane attacic complex 

nnTTiTinnPTits/nerfoTiTl/coTTir»lRrnprit C^Q 


1 O </i 1 1 A yl 1 ^ A7 1 
1Z.D4 1. /Oe-lU 410-40 1 


1764 


IPB002861B 


Reeler domain 


10.50 1.41e-09 422-450 


1764 


IPB002870D 


Reprolysin family propeptide 


16.31 6.25e-12 265-280 


1764 


IPB002870E 


Reprolysin family propeptide 


11.90 4.41e-12 296-308 


1764 


IPB002870F 


Reprolysin family propeptide 


18.81 3.73e-13 335-359 


1764 


PR00977A 


Scytalidopepsin B aspartic protease (A4) 
signature I 


7.30 9.1 le-09 233-255 


1765 


PR00824A 


Hepatic lipase signature I 


8.114.44e-22 6-24 
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SEQ 

ID 

NO: 


Database 
entry ID 


Description 


Ke suits 


1768 


IPB000337A 


G-protein coupled receptors lamuy ^ 
(Metabotropic glutamate receptor-iiKej 


q 174 3?e~14 47-58 


1768 


IPB000337B 


G~protein coupled receptors family 3 
(Metabotropic glutamate receptor-iiKe; 


24.51 9.90e-09 150-198 


1768 


IPB001828A 


Receptor family ligand binding region 


13.21 9.05e-09 74-86 | 


1768 


PR00248A 


Metabotropic glutamate CjPCK signature i 


7 4.6 0 1 Op- 16 46-SR ' 


1768 


PR00248B 


Metabotropic glutamate GPCR signature II 


12.69 2.89e-17 75-90 


1768 


PR00248C 


Metabotropic glutamate GPCK signature ill 


11 q< s ?np-24 90-109 1 


1768 


PR00248D 


Metabotropic glutamate GPCK signature IV 


u q OOp-1 5 1 53-179 1 


1768 


PRO 1054 A 


Metabotropic glutamate receptor 4 signature 

I 


7 79 1 ??p-75 Q-26 


1768 


PR01054B 


Metabotropic glutamate receptor 4 signature 
II 


o^A i 47e-26 26-44 1 


1768 


PR01054C 


Metabotropic glutamate receptor *\ signature 

TTT 
111 


10 37 1 15e-22 131-147 | 


1769 


IPB000008B 


C2 domain 


17 91 3. 02e- 10 226-243 ! 


1769 


IPB000008C 


C2 domain 


23.37 2.80e-31 400-439 


1769 


IPB000008C 


C2 domain 


23.37 3.04e-10 265-304 


1769 


IPB000008D 


C2 domain 


14 83 8 59e-14 455-473 ! 


1769 


PR00360A 


C2 domain signature i 


15 18 3 00e-10 398-410 


1769 


PR00360B 


C2 domain signature 11 


11 64 2 33e-12 427-440 ] 


1769 


PR00360B 


C2 domain signature 11 


11 64 8 88e-13 292-305 I 


1769 


PR00399A 


Synaptotagmm signature i 


15 05 5 50e-ll 236-251 [ 


1769 


PR00399B 


Synaptotagmm signature 11 


14 30 5 50e-10 251-264 


1769 


PR00399B 


Synaptotagmm signature ll 


14 30 7 92e-12 385-398 1 


1769 


PR00399C 


Synaptotagmm signature ill 


15 89 1 63e-10 308-323 1 


1769 


PR00399D 


Synaptotagmm signature IV 


12 72 6 33e-10 328-338 


1770 


IPB000534A 


Semialdenyde denydrogenase 


15 79 4 21e-09 287-304 I 


1771 


IPB001195 


Glycopnorm A 


7 52 6 07e-10 717-745 


1771 


IPB001359H 


bynapsin 


22 58 7.99e-09 871-921 


1771 


IPB001514D 


RJN A polymerases ui d\j to 4U iva suounnis 


19 99 9 43e-09 1326-1352^ 


1771 


IPB002126A 


Cadnerin domain 


14 68 4 15e-ll 412-428 | 


1771 


IPB002126B 


Cadnerm domain 


12 04 l,50e-09 232-249 j 


1771 


IPB002126B 


Cadnerin domain 


12 04 3 2^-12 448-465 ! 


1771 


IPB002126B 


Cadnerm domain 


12 04 7.25e-09 123-140 | 


1771 


PR00205A 


Cadnerm signature 1 


17 38 5.66e-10 293-312 1 


1771 


PR00205A 


Cadnerm signature 1 


17 38 7 84e-ll 75-94 ! 


1771 


PR00205B 


Cadnerm signature 11 


20 09 2 73e-09 135-164 1 


1771 


PR00205B 


Cadnerin signature 11 


20 09 4 06e-10 460-489 i 


1771 


PR00205B 


Cadnerm signature ll 


20 09 8 80e-15 244-273 ! 


1771 


PR00205C 


Cadnerm signature in 


13 59 7.75e-09 317-329 


1771 


TXT% f\f\K\CT\ 

PR00205D 


Cadnerm signature i v 


12.22 3.42e- 14 439-458 ] 

Am- m\mW • +mmT mtmW *mW » » 


1771 


PR00205D 


Cadnerm signature iv 


12 22 3 83e-12 223-242 \ 


1771 


PR00205D 


Cadnerm signature IV 


12 22 6 40e-12 544-563 i 

J_ m CmW % m\mm1m*mA) m T W XW ■ » ""^ ■WW 1 


1771 


PR00205D 


Cadnerm signature iv 


12 22 6 71e-ll 331-350 | 


1771 


PR00205D 


Cadnerm signature IV 


12 22 7 51e-10 114-133 

X 1 - » dtmrnl i • +mS X W A. V/ A -L L ^" W W' 


1 ^771 
1 / / 1 




Cadherin sienature V 

Vy(AVAllVl 111 A 1 


10.82 4.86e-09 458-471 


1771 


PR00205F 


Cadherin signature VI 


19.57 1.95e-ll 408-434 


1771 


PR00205F 


Cadherin signature VI 


19.57 2.50e-15 513-539 


1771 


PR00205F 


Cadherin signature VI 


19.57 3.65e-12 300-326 ~1 


1771 


PR00205G 


Cadherin signature VII 


13.05 2.65e-09 334-351 


1771 


PR00205G 


Cadherin signature VII 


13.05 4. 65e- 11 226-243 


1771 


PR00205G 


Cadherin signature VII 


13.05 7.91e-10 442-459 


1771 


PR00205G 


Cadherin signature VII 


13.05 9.57e-14 547-564 



WO 03/054152 
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SEQ 

ID 

NO: 



1771 



1771 



1774 



1774 



1774 



1774 
1774 



1775 



1775 



1775 



1775 



1775 



1775 



1775 



1775 



1775 



1775 



1778 



1778 



1778 



1778 



1778 
1780 



1780 



1780 



1780 



1780 



1780 



1780 



1780 



1782 



1782 



1782 



1782 



1782 



1782 



1782 



1782 



1782 



1782 



1782 



1783 



1783 
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Database 
entry ID 



PR01157D 



PR01383B 



IPB000301B 



IPB000301D 



PR00259B 



PR00259C 



PR00259D 



IPB000276A 



Description 



P2 purinoceptor signature IV 



Claudin-10 signature II 



Transmembrane 4 family 



Transmembrane 4 family 



Transmembrane four family signature II 



Transmembrane four family signature III 



Transmembrane four family signature IV 



Rhodopsin-like GPCR superfamily 



IPB000276A Rhodopsin-like GPCR superfamily 



PR00237E 



PR00245A 



PR00245A 



PR00245B 



PR00245B 



PR00245C 



PR00245D 



PR00534A 



IPB000920C 



IPB001388 



PR00219A 



PR00219B 



PR00219C 



IPB000353B 



IPB000353C 



IPB001039A 



IPB001039C 



IPB001039D 



IPB003006A 



IPB003006B 



IPB003363E 



IPB000084A 



IPB000258G 



IPB000258H 



IPB000765 



IPB000790B 



IPB001140A 



IPB001140B 



IPB001482B 



PR00300A 



PR00449A 



PR00671B 



IPB000175A 



Rhodopsin-like GPCR superfamily 
signature V 



Olfactory receptor signature I 



Olfactory receptor signature I 



Olfactory receptor signature II 



Olfactory receptor signature II 



Olfactory receptor signature III 



Olfactory receptor signature IV 



Melanocortin receptor family signature I 



Myelin P0 protein 



Synaptobrevin 



Synaptobrevin signature I 



Synaptobrevin signature II 



Synaptobrevin signature III 



Class II histocompatibility antigen, beta 
chain, beta-1 domain 



Results* 



16.03 5.42e-09 815-827 



5.36 5.05e-09 724-736 



18.74 4.68e-26 492-527 



26.02 7.30e-23 662-702 



13.09 3.03e-17 486-512 



16.39 5.76e-19 513-541 



14.90 8.54e-18 676-702 



11.56 1.95e-10 111-122 



11.56 5.50e-09 199-210 



13.03 3.90e-10 280-303 



10.98 1.90e-10 85-96 



10.98 2.50e-10 173-184 



13.73 l.OOe- 10 210-222 



13.73 8.71e-09 122-134 



14.65 4.91e-15 257-273 
9.34 8.41e-13 317-326 



12.77 9.56e-l 144-56 



15.78 2.33e-09 160-212 



26.97 7.60e-30 115-169 



10.78 1.35e-ll 118-137 



9.91 5.30e-09 138-157 



6.91 2.20e-13 20-39 



Class II histocompatibility antigen, beta 
chain, beta-1 domain 



Major histocompatibility complex protein, 
Class I 



Major histocompatibility complex protein, 
Class I 



Major histocompatibility complex protein, 
Class I 



Immunoglobulin and major 
histocompatibility complex domain 



Immunoglobulin and major 
histocompatibility complex domain 



Glycoprotein GG/GX 



PE N-terminus 



19.16 4.96e-12 177-226 



20.11 9.18e-10 228-282 



17.17 1.00e-40 15-68 



19.82 8.08e-12 151-204 



16.49 1.00e-40 229-283 



17.51 3.25e-18 198-220 



20.23 5.50e-28 235-272 



13.35 2. 18e- 10 282-314 



Bacterial ice-nucleation proteins octamer 
repeat 



Bacterial ice-nucleation proteins octamer 
repeat 



GTP1/OBG family 



ATP synthase alpha subunit, C-terminal 



ABC transporter transmembrane region 



ABC transporter transmembrane region 



Bacterial type II secretion system protein E 



ATP-dependent Clp protease ATP-binding 
subunit signature I 



Transforming protein P21 ras signature I 



Inhibin beta B chain signature II 



Sodium:neurotransmitter symporter family 



IPB0Q0175B 1 Sodium:neurotransmitter symporter family 



14.90 3.44e-09 644-691 



8.61 4.96e-09 681-734 



10.13 6.03e-09 719-773 



26.91 9. 84e- 10 449-492 



10.45 9.64e-10 424-467 



21.73 2.42e- 18 437-483 



15.62 7.70e-12 555-593 



12.05 3.70e-09 447-469 



7.82 8.27e-09 451-469 



12.48 1.67e-09 449-470 



4.29 9.41e-09 396-415 



16.29 1.00e-40 211-260 



19.12 3.37e-32 298-332 
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Table 3B 
446 



SEQ 
ID 

INO: 


Database 
entry ID 


uescripiion 


Results* 


1783 


IPB000175C 


Sodium:neurotransrnitter symporter family 


15.09 3.75e~13 385-436 


1783 


TtJ*DAAn 1 *7 

lr\b>UUU 1 /jD 


CA^iitrrt^npiimtrdncmit+pi" cvmnnrter tarnilv 
oOQlUm.neurUirdriMlill.iei oyui[J»-M.LGi Lainny 


23 45 1.00e-40 451-503 


1783 


IPB000175E 


Sodiummeurotransmitter symporter family 


21.88 1.67e-30 544-583 


1783 


IPBOOOl/Dr 


oOQium.neurouansimuer ayiiipui u^i Laumy 


25 63 1.00e-30 639-678 


1783 


IPB000175G 


Sodium:neurotransmitter symporter family 


16.18 6.32e-20 700-722 


1783 


PR00176A 


(bodium/cnioriae neuroxrdnsmitier 
symporter signature I 


16 97 5 50e-24 211-232 


1783 


PR00176B 


oOQium/cnioriQe neurotranbiiiiutci 
symporter signature II 


7 07 2 13e-22 240-259 


1783 


PR00176C 


oouium/cnloriae neurorransmiiLer 
symporter signature III 


10 57 3 16e-23 283-309 


1783 i 


PR00176D 


o odium/ cnioriae neurorransmiuer 
symporter signature IV 


8 96 7 92e-21 412-429 


1783 


PR00176E 


boaium/cnloriae neurorransmiuer 
symporter signature V 


11 14 3 S4e-19 494-514 


1783 


PR00176F 


ooaium/cnionae neurorransmiuer 
symporter signature VI 


11 11 9 57e-20 548-567 


1783 


PR00176G 


ooaium/chloride neurotransmitter 
symporter signature VII 


i ^ i?7 67e-16 630-650 


1783 


PR00176H 


oodium/cnioriae neurotransmiuer 
symporter signature VIII 


1^ 94 4 52e-16 670-690 


1783 


PRO 1204 A 


OJycine neurotransmitter transporter type i 
\\jL, I i - 1 ) signature i 


15 11 3 25e-18 187-200 


1783 


PR01204B 


Lilycme neurotransmiuer urdnsponei type 1 
(GLYT-1) signature II 


12 89 1 37e-15 201-211 


1783 


PR01204C 


Glycine neurotransmitter irmiapui lci \>yy& i 

rr^T VT ^'\ Qicmatiirp TFT 


12 34 1 60e-16 335-347 


1 'TOO 

1783 


TIT) A 1 OA/1 T"\ 


oiycme neui oiiaiioiiiiLLci Liaiispuii.vyi Ly\j<> ± 

^ vjrj_- 1 L L J oigiiatuic j. v 


10 51 6.86e-18 349-364 


1 /04 


Ix J3UU 1 OZ / rv 


OClllcl UWillcllll 


13.76 3.02e-09 662-674 


1 /o4 


lri3UUZ lO DU 


Plf^vin T"f ft n< =k s^t 


14.72 1.15e- 10 662-674 


1 7QA 
1 /o4 


PDA1 11 Q A 

x ISA/ 1j1 


ClMa] npll line-derived neurotrophic factor 
recent or alnha 3 signature I 


3.85 7.08e-09 27-39 


1 7B^ 
1 /OD 




PTF/NF-T familv 


11.22 1.00e-40 644-693 


1 70^ 


Ix JjUUUOH- / J3 


PTF/NF-T familv 


10.06 2.14e-30 694-718 


I /CO 


TPRnnn^j.7r i 

irDUUUUT' / v> 


PTF/MF-I familv 


9.27 1 .00e-40 724-765 


I /CO 


x pt3 n ri n ^ A7Ti 


PTF/NF-T familv 


12.61 1.20e-32 766-790 


i/OJ 


TPT5nnn^id.7P 

lx JdUUUOH- / JD 


rTF/MF-T familv 


12.21 9.18e-38 791-821 


1 7Q£ 
1 /oO 




T pupinp rich reneat C-terminal domain 


11.18 4.54e-10 675-689 


1786 


PR00019A 


Leucine-rich repeat signature I 


11.72 4.67e-10 243-256 


1 /oo 




T fanr»inf*-rir , ri reneat cjionatiire T 


11.72 6.45e-09 434-447 


1786 


PR00019B 


Leucine-rich repeat signature II 


11.42 4.27e- 11 431-444 


1786 


nn AAA1 A T"5 

PR00019B 


i_/eucme-ricn repcd.i faigiidiuit; 11 


11 42 5 09e-09 479-492 


1786 


PR00019B 


Leucine-rich repeat signature II 


11.42 7.55e-09 597-610 


1786 


PR00019B 


Leucme-ncn repeal signature 11 


1 1 42 8 64e-09 240-253 


1786 


PR00364D 


Disease resistance protein signature IV 


14.89 4.86e-09 188-204 


1 7SQ 


ppnn7H9F 
x^jvuu /wzr 


Aeriflavin resistance orotein familv 
signature VI 


14.03 1.46e- 17 49-72 


1789 


PR00702G 


Acriflavin resistance protein family 
signature VII 


10.67 6.00e-21 74-97 


1789 


PR00702H 


Acriflavin resistance protein family 
signature VIII 


14.64 3.67e-14 153-170 


1789 


PR00702I 


Acriflavin resistance protein family 
signature IX 


16.75 1.39e-15 216-230 
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Table 3B 
447 



SEQ 
ID 


Database 
entry ID 


Description 


ivesuus 


1 /yu 


PP007Q7P 


Qtr^n+nnQin / /~* 1 A\ r*\/ctf»rrua TYTOtPa<3P; fpmilv 
otlcpiupam \ \~/\.\J) uyaLCiiic jjiui&ao^/ jLcuimy 

ci on dtTiff* "WT 
olgllaLUIC V 1 


11 RQ 8 81e-09 41-62 


1 70 1 

i / y i 


TPP»000R4Qr> 


vJip 1 lalllliy Ul 11 cUlbpUi IClO 


22 39 9 25e-31 55-91 


1791 


IPB000849E 


GlpT family of transporters 


19.05 2.88e-36 108-145 


1 7Q1 

i /y i 


TPTinnnSiiQTr 
ir^jjuuuo'+yi 1 


(~X\ rV 1 ' ■Pom ill/ rt"f +t*Q , MOi^fM*tf>i*c 
Uip 1 lalllliy Ul tlallapUl It-lb 


15 06 8 00e-20 162-179 


1 7G0 

l /yz 


lrx3UUUy 1 /L/ 


ouiiatase 




1 OQO 

i /yz 


TPnfifkl££1 A 
Lr&KJYJDOO 1 A 


riis jvinase a uomain 


0 1 60p-00 944-9 


1 OQ£ 

i /yo 


TPRnnri?n^ a 

lr r>\J\J\JZ\JjJ\ 


f^P 01 Am oiti 


1 8 40 6 63e-12 903-933 


1 7Q£ 
i /yo 


TPPionnooiP; 


uro uomam 


1^ OS 0 19p-19 9SS-1009 


1796 


IPB000483 


Leucine rich repeat C-terminal domain 


11.18 6.04e-ll 371-385 


1 OQ£ 

i /yo 


tdt* n n n ci o f 4 


vj-protem coupieo. rcceptorb laiiniy ^ 
(secretin-like) 


iq <i 7 ^p-OQ 017 


1 7Q£ 

i /yo 


rKAJUUiyA 


jucucine-ncn repeat signdiure 1 


1 1 79 1 OOp-OQ ^1 5-39 S 

11./j6 I.UUC u7 JlJ JaO 


1796 


PR00019B 


Leucine-rich repeat signature II 


11.42 8.00e-10 312-325 


1 TOO 

1 /yo 


IroUUzzUoC 


oec i protein 


iJ.Jl ^r. lJC-1/ J\J-D\J 


1798 


IPB002208D 


SecY protein 


22.30 2.29e-36 82-118 


1798 


PR00303C 


Preprotein translocase SecY subunit 
signature III 


K A< *7 0/1q ID 1/1 <T 

13. /.o4e-iy / 


1798 


PR00303D 


Preprotein translocase SecY subunit 
signature IV 


id.4o i.yDe-zz /z-y/ 


1798 


PR003U3b 


Preprotein translocase isec y suDumt 
signature v 


1/1 /ll O IQo 71 OS 101 

14. 4 j /. iye-Zj yo-izi 


1798 


PR00825B 


Vespid venom allergen phospholipase Al 
signature li 


13.74 9.40e-09 234-254 


1800 


PR00081A 


Glucose/ribitol dehydrogenase family 
signature i 


10.07 9.70e-12 179-196 


1 onn 


nn AA£I£QTJ 

rKUUooyiJ 


inniDin aipna cnam signature 11 


J.OO J.7JC-U7 OU-70 


1 OA 1 
loU 1 




riK peptide transporters (F i kz j 


Q 7 1 1 7zlf» 1119 9A 
o.zi j./'fe-ij iz-z^+ 


1 OA1 

loOl 


tddaaa 1 nnn 
lrDVKJUWyiJ 


rlK peptide transporters i i<z j 


OC AO 1 AAa OQ A&A 1 

ZD.uy l.uue-zy £ *o £ i—d i i 


1802 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
y w i D repeat 


7.05 4.96e-ll 494-504 


loUZ 


TDT3AAA1 ^O 


Aspartic acia ana asparagme nyaroxyiation 
site 


o.oO 1 /e- l^f £ t-oy-»/U*+ 


1 cm 




integrin ueta, L>-iermmus 


77 A^s 7 Qflp- 1 0 070_ 1 0 1 7 
Z /.*+»/ Z,7UC"lu !7/V/-lV7lZ, 


1802 


IPB001774C 


Delta serrate ligand 


18.25 2.19e-09 973-1015 


1 OAO 


TT>T3 A A 1 O01D 


L^aicium-Dinding tiijrr-iiKe uoniain 


IZ.Zo D.Ooc-11 ^oy-Jw 


1 OAO 

loUz 


TOT3AA1 Q^O A 

irr>uuiyoyA 


nuKaiyotic anu viral aspartic protease active 
sue 


1 ^ 17 7 nfV-i n i7S_iqa 


1 ©m 
loUZ 


ppoooi r\r* 


type li rsijrr-UKe signature in 


0 / . 1 UC-1VJ t t7H-JU'T 


1 coo 


risJJKJ /yz,D 


r epsin ^A i ) aspartic protease iamny 
signature 11 


19 fi*; ^ 9^p-1^ ^^1-^44 


1 cno 


po AA7Q9P 
rlvuU / yz\^ 


repsm ^ai ) aspartic pioiease ramiiy 

01 nnotiirp TTT 

oigiiaiuie in 


R 0 ^Ip-13 380-3Q1 

O.OJ 7.JlC"lJ JOU-J7 1 


1 sn^ 


TPR00079QA 


PA/TP - 9 7 /F MP /MP ? 0 farnilv 


19 70 1 95e-09 1-22 


1 CAI 


tpp»ooo77qa 


PA/fP 7 9 /Th A/TP /MP 70 fnmiK/ 
x^lVUr -ZZ/ Jj/lvlr / JVLr Z\J lalllliy 


1 0 70 9 90p-1 0 1 8-39 






P A/TP-9 9 /F MP /MP9 0 famih/ 
Jrivur "£.z.i Jj/iviir /ivj_rz<vj idiiiiiy 


13 56 5 68e-10 49-59 


1803 


IPB000729C 


PMP-22/EMP/MP20 family 


37.83 3.47e-23 87-139 


1803 


IPB000729C 


PMP-22/EMP/MP20 family 


37.83 7.83e-22 80-132 


1803 


IPB000729D 


PMP-22/EMP/MP20 family 


18.96 4.94e-18 160-187 


1803 


PR01077B 


Claudin signature II 


14.12 8.80e-10 49-55 


1803 


PR01077C 


Claudin signature III 


13.60 9.43e-12 63-73 


1804 


IPB001717I 


Anion exchanger family 


7.53 2.07e-ll 18-58 


1804 


IPB001717L 


Anion exchanger family 


14.31 9,28e-14 84-130 


1804 


IPB001717M 


Anion exchanger family 


19.47 6.23e-32 133-185 
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Table 3B 

448 



ID 

IN*-"'. 


Database 
entry id 


uesci ipuon 


1\V3U113 




TPROfll 71 7N 


Aninn pyrhanopr fVimilv 


14.40 1.00e-30 183-208 


1 SOA 


TPR001 71 70 


An'nn f^Yf^liciticjpr "fmnilv 


10 14 4 76e-16 209-236 


1804 


IPB001717Q 


Anion exchanger family 


17.70 5.77e-09 232-273 


1 QAA 
15U4 


pphai 

rtsXfyJ LODr 


./T.1JUU1 I CAUllcillgCl. JLculliiy olguaiuii^ v j. 


14 73 1 50e-ll 42-49 


1804 


PR00165J 


Anion exchanger family signature X 


6.53 4.18e-09 127-135 


1 OA/1 


nn Art i (icrr 


Anion excnanger iamny signdture yvi 


17 74 6 40e-11 156-164 


1804 


PR00165M 


Anion exchanger family signature XIII 


9.29 7.26e-12 255-265 


1804 


T»-r> All OAA 

PR01120D 


riant oi_*c cnioriae cnannei signature i v 


in 46p-0Q 1^4-141 


1804 


PR01231E 


HC03- transporter superfamily signature V 


8.87 4.38e-15 40-52 


1804 


PR01231H 


HC03- transporter superfamily signature 
VIII 


19 1< Q IQo 1^111 1 94 


1805 


IPB001394A 


Ubiquitin carboxyl-termmal nydrolase 
family 2 


19 1^4 1 1 /Cp i/C 70/C OAl 
IZ.OH- 3. lOc-10 /oO-ol/j 


1805 


IPB001394B 


Ubiquitin carboxyl-terminal hydrolase 
iamily 2 


17.87 3.05e-21 1215-1244 


1805 


IPB001394C 


Ubiquitin carboxyl-terminal hydrolase 
family 2 


8.52 3.50e-10 1290-1299 


1805 


IPB001394D 


Ubiquitin carboxyl-terminal hydrolase 
family 2 


O 1 O A &C\cx 1 A 1 1A5 1 11 7 

y. iy 4.oue-iu i3Uo- 13 1 / 


1806 


PR0H30C 


Delayed-early response 
protein/equilibrative nucleoside transporter 

' j TTT 

signature 111 


7.16 6.00e-09 187-210 


1806 


PR01130E 


Delayed-early response 
protein/equiliurative nucieosioe transporter 
signature V 


14 91 ^ n7p m ino i^n 


1806 


t*t> m 1 OAT? 

PR01130r 


Uelayed-eariy lesponse 
protein/equilibrative nucleoside transporter 
signature v i 


^ 14 ^ ^4^-1 1 ^7_^S4 


1 oa^z 
ioU/ 


TPPAfil Q 1 QT3 


v>eiiuiose-uinaing uornd-iiij uaciciidi type 


14 22 2 Q7e-0Q 104-128 


lOIA 

15 1U 


tppaai^^9 a 


IyquwiI pi i 1^4""^ «j "f" "fron en r\rf pre 

vjeneiai bUDSiTdie iraiibpuiLcit> 


18 97 8 67e-14 736-768 


1811 


PR01228C 


Eggshell protein signature III 


5.69 3.90e-09 51-66 


lOII 

1813 


IPBUUU/oUA 


iNAUri-uDiquinone oxiaoreauciase cnam h } 
ammo terminus 


19 ^9 9 SOp-OQ 16-40 


loli 


IrrJUUUZoUA 


iN/\J_-ixi-UDiquinone oxiuoreaui/idoc oridiii 
ammo Lerminus 


1 9 5? 4 66e-1 0 1 9-52 


10 11 

io 13 


If r> U U Kj^h H-ts 


^aiiifiiiic/urdoii pcrrricdacs idiinij/ 


22 44 4 21e-09 51-94 


1 O 1 1 

lo 13 


TPP AAA/1 A ATI 


yVdnuiinc/ uidC/ii pciincdbco idiiiiiji' 


22 44 4 32e-09 43-86 


•toil 
lo 13 


TPPAfiA£0,4T 


nuKaryouc jvin/\ poiymerd^c 11 
heptapeptide repeat 


1 49 7 25e-09 841-883 


1 Q 1 1 
lo 13 


TPP.nnno99 


zLinu liiigCL j v_/^nz^ type 


14 67 1 00e-18 272-797 


1 oil 
lo 13 


TPT*nnns99 


■Zinr* finopr P9M9 tvnp 
Zjiiii/ oznz, type 


14 67 2 71e-10 416-441 


1513 


Tppnnna99 


Zjiiiu iiiigci, vvZ-iiz, type 


14 67 7 19e-14 487-512 1 


1011 
lo 13 






14 67 8 00e-13 515-540 


ion 

1513 


TPPHAAR99 
lrl3UWUOZZ 


/_jH1L' iiiigci, v^ZiiZr type 


14 67 8 lie- 12 300-325 


1017 
1513 


TPPnnnR99 


ZjIiH/ linger, uzn^ type 


14 67 9 05e-12 773-798 


1011 

1513 


TPOAPkl 4^7 


in jtv ri-u Di qu 1 non e/ p i d s to quinone 
oxidoreductase chain 6 


1Q 26 5 81e-09 5-46 


1813 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.25e-09 770-783 


1813 


PR00048A 


C2H2-type zinc finger signature I 


9.94 9.53e-ll 269-282 


1813 


PR00048B 


C2H2-type zinc finger signature II 


5.52 6.79e-10 285-294 


1813 


PR00219C 


Synaptobrevin signature III 


6.91 6.34e-09 33-52 


1815 


EPB000001D 


Kringle 


11.31 2.02e-13 53-69 


1815 


IPB000001H 


Kringle 


12.24 5.29e-ll 318-328 


1815 


EPB000126A 


Serine proteases, V8 family 


11.75 5.07e-ll 53-68 
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Table 3B 
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SEQ 
ID 

NO: 


Database 
entry ID 


Description 


Results* 


i on 

lol5 


1PJ3UUU 1 / /M 


Apple domain 


7.10 O.JJC-IO ZZO"ZOv 


1815 


TDDAAA1 HH~K\ 

1P1300U1 / /N 


Apple domain 


LKJ.l/ J*Z> le- 1 D jUo-jH-Z 


1815 


IPB001254A 


Serine proteases, trypsin family 


9.98 1.00e-15 53-69 


1815 


IPB001254B 


Serine proteases, trypsin iamily 


1 ai /C oAn iq n< no 


1815 


PR00722A 


Chymotrypsin serine protease family (SI) 
signature I 


12.06 9.28e-15 54-69 


1815 


PR00722C 


Chymotrypsin serine protease family (SI) 

• j . _ TTT 

signature III 


10.74 8.88e-13 315-327 


1815 


PR00839B 


V8 serine protease family signature II 


11.20 6.42e-09 53-70 


1817 


IPB003819A 


Taurine catabolism dioxygenase 
TauD/TfdA 


1 A flC O AO ^. AA O A A OA7 

10.95 3.03e-09 2U(J-2U/ 


1817 


IPB003819B 


Taurine catabolism dioxygenase 
TauD/TfdA 


21,71 5.o4e-2o ZIZ-ZjD 


1817 


IPB003819C 


Taurine catabolism dioxygenase 
1 auD/ 1 rclA 


1 1 07 O 7/1 Q n TOO aA/l 

11.5/ Z. /4e-l / ZoSOU4 


1817 


IPB003819D 


Taurine catabolism dioxygenase 
1 auiJ/ 1 IdA 


z/.Dy 4.ooe-oo oiy-3jy 


1 O 1 A 

1819 


1PBUUU387 


1 yrosme specinc protein pnospnatase and 
Quai specmciiy protein pnospnaidse lauuiy 


i a 7^ o oo p 1 o 1 A'XA^ AAA 


i om 

I8iy 


"DTJAAA1 AC* 


riDronectin type ill repeat signature 111 




i q 1 a 

lo iy 


"D"D AAA 1 Af~* 

JrKUUU 14C- 


riDronectin type ill repeat signature 111 


1*+.*+/ ^t.oac-u^ OQO-/UU 


1819 


PR00014C 


Fibronectin type III repeat signature III 


14.47 5.00e-10 1056-1074 


1 O 1 A 

1819 


nn aaa i a r~* 
FKUUU14C 


riDronectin type ill repeat signature 111 


\a ai 7 nn<=» no sszi 
14.*+/ /.uue-uy odd-ooh 1 


1 O 1 A 

18 19 


DDAA1 1 OTJ 


Myelin r u protein signature v 


j.ji i.ooe-U-7 lioo-iziu 


1 O 1 A 

1819 


"DID AA7AA A 


rrotem tyrosine pnospnatase signature i 


^ ^ snp i o 1 ^n^-1 ^1 ^ 


1 O 1 A 

1819 


BT)AA7AAr' 


rrotem tyrosine pnospnatase signatuie m 


1 ^ SQ 7 RRp-1 7 1 A07 


1 O 1 A 

1819 


TIDAA'TAAA 


rrotem tyrosine pnospnatase signature iv 


1 n o o/Zf* oa 1 /IT, 1 _ 1 AAQ 


1 O 1 A 

1819 


r>D AATAAT7 


Protein tyrosine pnospnatase signature v i 


in ^ A 71p 19 1478 14RR 


1819 


PR01371E 


Salmonella^Y ersinia modular tyrosine 
pnospnatase signature v 


11.71 5.98e-12 1432-1443 


1 89H 




Vo/»im1citirifr (^xft/ifnYin 
V tlL/LlUld.Llllg L/yLUHJA.111 


7 27 9 28e-09 100-146 


1821 


IPB000276D 


Rhodopsin-like GPCR superfamily 


9.40 9.65e-09 405-421 


1821 


PR00245D 


Olfactory receptor signature IV 


9.34 1.53e-13 359-368 


1821 


PR00245E 


Olfactory receptor signature V 


8.96 6. 8 8e- 13 406-417 


1822 


IPB003861B 


E4 protein 


9.06 1.00e-08 34-48 


1822 


PR00494E 


Fanconi anaemia group C protein signature 
V 


10.42 8.26e-09 621-640 


1822 


PR01039A 


Tryptophanyl-tRNA synthetase signature I 


14.70 9.59e-09 200-216 



* Results include in order: accession number subtype; raw score; p-value; position of signature in 
amino acid sequence 
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Table 4A 
450 



SEQ 
ID 


Model 


Description 


Mi ' i; 11 Inn 


Score 


Jxepeaxs 


i. osiiion 


912 


MIF 


Macrophage migration 
inhibitory factor (MIF) 


1 At* KA 


i oa n 


l 


MA 997 


912 


GST_C 


Glutathione S-transferase, 
C-terminal domain 


6.2e-25 


92.6 


2 


1-45:410-525 


912 


GST_N 


Glutathione S-transferase, 
N-terminal domain 


1 /la 1 c 

1 .4e- 1 o 


OZ.D 


i 


OOA AHA 


913 


tsp_l 


Thrombospondm type 1 
domain 


2.9e-23 


OA *7 

yu. ^ 


**> 
2 


1 1/1 1 "7/1 • T 2 1 TJ1 
124-1 /4.1ol-Zol 


913 


EGF 


EGF-like domain 


1 t£~. OA 


81. 0 


/ 


AHA ^TAQ.^IA 
4/40UcS.J 14- 

6^^-fiQ9-790-S02- 
uj j . \jy jCi i z*\j .ouz. 

OJU. O " u 


y 14 


JVLlr 


iviacropnage migration 




1 / 27. O 


-j 


9-101 


915 


MIF 


Macrophage migration 
inhibitory factor (MIF) 


6.1e-68 


239.1 




2-129 


916 


trypsin 


Trypsin 


3.8e-99 


314.8 




445-675 


916 


kringle 


Kringle domain 


4.2e~45 


163.3 




286-367 


916 


fh2 


Fibronectin type II domain 


4.5e-35 


94.9 




108-148 


916 


EGF 


EGF-like domain 


6.2e-15 


63.0 


2 


164-197:245-278 


916 


fhl 


Fibronectin type I domain 


5.2e-ll 


45.0 




OAO 0'3'7 
ZUZ-ZJ / 


917 


laminin_EGF 


Laminin EGF-like 

/"r-v * ttt J i r\ 

(Domains III and V) 


2.3e-158 


539.5 


12 


282-346:349- 

4uy:4iz-4oy.4/z- 
520:523-565:786- 
831:834-877:880- 
925:928-984:987- 
1036:1039- 

mO/l.10Q^ 11*20 

iuy4: ltiyD-i i^y 


917 


laminin_Nterm 


Laminm N-termmal 
(Domain VI) 


5.9e-105 


362.1 


1 


A X OCA 


918 


EGF 


EGF-like domain 


3.2e-53 


190.2 


9 


33-68:74-110:116- 
151:161-197:208- 
243:247-282:288- 
323:329-362:368- 


918 


CUB 


CUB domain 


l.le-16 


68.8 


1 


708-817 


919 


Parathyroid 


Parathyroid hormone iamily 


3.1e-az 


Z50.O 


1 


a a 1 o*< 
4o- 1 /D 


920 


EGF 


EGF-hke domain 


z.ze-Zz 


O/.O 


y 


Z-JU.4 /-oZ.oo- 

122:128-163:232- 
zoo.z /o-juo.j i y- 
349:353-394:407- 


921 


IL8 


Small cytokines 
(intecrine/cnemoKine), inter 


2.7e-09 


35.4 


1 


25-65 


922 


Zn carbOpept 


Zinc carboxypeptidase 


O 1 Q 1 1 O 

.3. le-i iy 


Af\Q K 

4uy.D 


i 




922 


Propep_M14 


Carboxypeptidase activation 
peptide 


5.6e-20 


79.8 


1 


7-84 


yz.3 


11 Drinogeii_*w 


jriDrinogcii ljclo. diiu t,a.iniiia 
chains, C-term 


1 . OC" \J I 


9^7 n" 


i 


62-248 


927 


FKBP 


FKBP-type peptidyl-prolyl 
cis-trans isomeras 


1.3e-44 


149.6 


2 


40-111:112-122 


930 


FAD_binding_4 


FAD binding domain 


6.5e-89 


308.8 


2 


11-176:186-381 


930 


FAD~oxidase_C 


FAD linked oxidases, C-- 
terminal domain 


2.6e-63 


223.8 


1 


383-647 


933 


Fringe 


Fringe-like 


1.4e-06 


-12.8 


1 


262-487 
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937 


14-3-3 


14-3-3 protein 


l.le-124 


427.6 


1 


58-282 


938 


S_100 


S-100/ICaBP type calcium 
binding domain 


3.6e-23 


90.4 


1 


65-108 


938 


efhand 


h,b nana 


U.UU1Z 




i 
i 


114-1 49 


940 


ig 


Immunoglobulin domain 


1.2e-33 


113.5 


5 


38-105:139- 
906-941-293*126- 

377-412-487 


941 


LRR 


Leucine Rich Repeat 


3,6e-30 


113.7 


8 


42-63:64-87:88- 
112-113-136-137- 
160-161-184*185- 
208:209-232 


y4i 


T DDfT 


terminal domain 


1 2e-07 


38 9 


1 


242-292 




AiVJUr-DlIlCilng 


/Yivix^-Dmuing, ciizyiiic 


1 2e-16 


62 6 


1 


82-159 


CM /I 


• 

serpm 


QI<a»»t^1'M ^C/iT'i'M^ Vlfrt'f'^CI 

oerpin ^serine piuicd-oc 

HJlIlL/i LKJl J 


7e-1 ^4 

/ C JL J*T 


455 1 


1 


173-532 


y4 1 


ivUIN 


T?TTTsJ Hnmain 

JXLJIN LiVJilldlll 


8e-44 


159.0 


1 


31-163 


y4 / 


ri YD 


J? 1 V JLj ZjIIL\s XLllgt/1 


1 6e-18 


70 7 


1 


529-576 


950 


lectin c 


Lectin C-type domain 


2.4e-08 


41.1 


1 


130-234 


952 


Glyco_transf_8 


Glycosyl transferase family 

o 


0.01 


-47.7 


1 


1255-1526 


953 


TPR 


TPR Domain 


0.0044 


23.7 


1 


171-204 


956 


Branch 


\^ore-z/i-r5ranciiing enzyme 


J.JC 1 V/j£> 


352 2 


1 

X 


74-306 


yj / 




r l Dronecim type in uunidiii 




65 6 


1 

JL 


30-108 


958 


Peptidase M10 


Matrixin 


5.9e-121 


415.2 


1 


58-225 


958 


hemopexin 


Hemopexin 


Zc-jo 


9D7 S 




105-347-349- 

393:398-445:447- 

487 

i w # 


962 


chromo 


'chromo' (CHRromatin 
LJrganiZation ivi^ciiiier^ 


5.5e-09 


35.7 


1 


64-84 


y64 


• 


lininunogiouuiiii uuiijuuii 


1 

l.OC \JJ 


22 6 


I 


34-115 


967 


/tm_l 


/ iransmemDrane receptor 

f f\"\ r\r\ nncin "Pntni 1 \/ 1 
^IIlULlUpalll LtLlli-iLy j 




Sfi 0 

OU, V/ 


9 


41-154-161-237 

i X X +S • i X V X r -wV / 


968 


adh_zinc 


Zinc-binding 
aenyurogcndbca 


7.4e-60 


212.3 


1 


179-495 


y l\j 


Jo, i_aenyarog 


L/enyurogenase ci 

C L>nip UIlcllL 


Z# . 1C JL OJ 


622 7 


1 


129-425 


QH 1 

y / 1 


* 

!g 


Tmmiin i~»cr1 r^V*n1 it*i »"1f*vnn 511*1 
lilllllUllUgll/Ullilil UUlIlalil 


9 ^e-l 1 


39 7 


1 


45-122 


y Id 


ZI-iVllZ< 




3 2e-32 


120 5 


1 

JL 


279-331 


y 15 


CAT) 


o/\r uorn<un 


7 4p-0fi 

/ .tC"ULI 


32 9 


1 


2-36 


975 


WD40 


WD domain, G-beta repeat 


5e-08 


40.1 


3 


234-270:447- 
483:491-528 


977 


An_peroxidase 


Animal haem peroxidase 


5.6e-140 


478.4 


1 


137-701 


978 


Defensin_propep 


Defensin propeptide 


1.9e-17 


71.4 


1 


32-82 


978 


defensins 


Mammalian defensin 


8.6e-05 


23.3 


1 


95-123 


981 


ig 


Immunoglobulin domain 


3.6e-06 


24.9 


1 


49-130 


982 


ig 


Immunoglobulin domain 


1.2e-09 


36.1 


1 


36-112 


yoj 


tsp_i 


TVi rAtnnricnnvi i n ■fvrif^ 1 
JL 111 vmlUUoJJUIlU.111 tjfJJC I 

domain 


0 007*5 


17 1 

i / . i 


1 


151-206 


987 


LRR 


Leucine Rich Repeat 


l.le-58 


208.4 


12 


78-101:102- 

125:126-149:150- 

173:174-197:198- 

221:222-245:246- 

269:270-293:294- 

317:318-341:342- 
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365 


988 


fer4_lNiiH 


4re-4o iron suiiur cluster 
binding proteins 


n nm 5 


1 5 Q 


1 
1 


110-114 


988 


ParA 


Jr arA lamny a. i .rase 


U.UuO 




1 
1 


177-283 


989 


vwd 


von Wilieurana iactor type 
D domain 


1 17 
1 .OcO / 




1 
1 


364-514 


t-\ f> f\ 

989 


vwc 


von Wiilebrand iactor type 
C domain 


5 1*=» 15 


1 90 7 


5 


163:166-224:238- 
289-301-357 


989 


ILL 


irypsin inmuitor iiKe 
cysteine ricn aomdin 




19 ft 


1 

JL 


629-682 






iiniHunogioDuiiii uuiiiciiii 


9 6e-17 i 


59 0 


2 


34-111:150-220 


994 


ig 


Immunoglobulin domain 


1.6e-28 


97.0 


3 


43-110:143- 
197:230-291 


995 


SCP 


S CP-like extracellular 
protein 


4.7e-41 


149.8 


1 


4-173 


996 


ig 


Immunoglobulin domain 


8.9e-06 


23,7 


2 


73-123:153-206 


999 


RNase PH 


3' exoribonuclease family 


1.5e-28 


108.3 


1 


264-469 


1000 


ig 


Immunoglobulin domain 


2.6e-05 


22.2 


1 


38-102 


1002 


zf-C2H2 


Zinc finger, C2H2 type 


7.6e-54 


192.3 


7 


183-205:211- 
233:239-261:267- 
289:295-317:323- 
345:351-373 


1003 


zf-C2H2 


Zinc finger, C2H2 type 


2.1e-114 

■ 


393.5 


16 


260-282:297- 

11 Q.QO^ "5/1 H'^X^'X 

375:381-403:409- 

1 .'fJJ /-'rJi/.HOJ" 

487:493-515:521- 
599:605-627:633- 

UJJ . MvJ 1 UOJ.UU7 

711 


1007 


T TTV If 

LIM 


LIM domain 


i.oe-j / 


1 1R 9 


a 


^00-44^-450- 
507*510-576 


1010 


zf-C2H2 


Zinc finger, C2H2 type 


4.2e-06 


33.7 

< 


3 


2-24:29-52:154- 
177 


1011 


60s_ribosomal 


60s Acidic ribosomal 
protein 


l.le-28 


94.4 


2 


1-22:25-89 


1012 


abhydrolase_2 


Phospholipase/ Carboxyleste 
rase 


1.3e-24 


95.2 


1 


9-213 


1016 


Anti_prohferat 


BTG1 tamily 


3.oe- /5 


95£ 5 


i 
i 




1017 


zf-C2H2 


Zinc finger, C2H2 type 


9.3e-71 


248.5 


11 


147-169:213- 
231:269-291:325- 

J-t/.jJj-j / J.jOl 

403:409-431:437- 
459:465-487:493- 
515:521-543 


1017 


KRAB 


KRAB box 


1.7e-21 


84.8 


1 


42-122 


1 non 
LV2.V 


UM 


j_»iivi cjorri din 




119 6 


4 


78-136 139- 

197:200-256:259- 

318 


1024 


zf-C2H2 


Zinc finger, C2H2 type 


3.3e-53 


190.2 


10 


156-183:184- 

206:212-234:240- 

262:268-290:296- 

318:324-346:352- 

374:380-402:408- 
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410 


1024 


KRAB 


KRAB box 


2.1e-42 


154.3 


1 


15-77 


1023 


KALrZ 


KecomDination activating 
protein 2 


n 
u 


2 


1 
I 


1-S97 


1026 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


3.7e-09 


33.9 


1 


16-54 


1029 


Keratin JB2 


Keratin, high sulfur B2 
protein 


3.8e-06 


16.4 


2 


5-149:150-281 


1031 


pro_isom erase 


Cyclophilin type peptidyl- 
prolyl cis-tr 


1.5e-106 


367.4 


1 


5-165 


1035 


zf-C2H2 


Zinc finger, C2H2 type 


9.9e-114 


391.2 


17 


100-122:132- 

1 . 1 /T A 1 OT.1 OO 

154: loU-loz: loo- 
ZlU:zio-z3o:Z44- 

9^^-999 9QA-100 

434:440-462:468- 
490:496-518:524- 
546-552-574 










67 8 


1 


4-54 


1038 


Sulfate transp 


Sulfate transporter family 


4.3e-103 


355.9 


2 


2-284:441-751 


1 C\1 Q 
l\) jo 


OTA C 
O 1 AO 


o i /\o uomam 








774-987 


1 A.9G 


UDiquitm 


Uuiquiiin iamny 






— j 




1U4Z 


• 


immunoglobulin domain 




1 V/J>.0 




229:264-316:349- 
400:433-501 


1 AX *2 


U rAK JLYo 


U-rAK/Ly-D ooiTiain 




9A8 7 




61-1 on 


1047 


MHC_I 


Class I Histocompatibility 
antigen, domains 


3.2e-147 


502.5 


i 


25-203 


1047 


ig 


Immunoglobulin domain 


0.057 


11.4 


i 


220-285 


1050 


• 


Immunoglobulin domain 




1^ 9 




i in 


1 A-C 1 


r Mr ZZ_d auain 


rJvLr- 

zz/rLNir/ ivir zu/L,iauom 
family 


A AA 1 A 

u.uuiy 


A 9 


— : 


*+- 


11D4 




ocavenger receptor 
cysteine-rich domain 








777-890 




r ZA._receptor 


j\ ir rZA receptor 




9 




11.198 
l->- J oo 


11D / 


zona penuciaa 


Z/Ona peiiuciaa-iiKe aomam 




9S1 






1057 


trefoil 


Trefoil (P-type) domain 


0.02 


9.1 


i 


224-262 


1058 


Aatrans 


.transmembrane ammo acid 
transporter protein 


O A a. An 


4Z.D 




^0 1QQ 


1059 


/tm_i 


/ transmemorane receptor 
(rhodopsin family) 


z.ie-io 


zlzl Q 
44. 0 




^1-980 


1 A/TA 

1060 


in a_suipn_symp 


odium: suliate symporter 
transmem or ane 


1 9/a 1/11 

i.ze-143 


/IOA, 9 




16 


1064 


ERG4JBRG24 


Ergosterol biosynthesis 


l.le-103 


357.8 


i 


7-350 


1066 




Immunoglobulin domain 


4.7e-16 


56.8 




42-95:135- 
192:231-288 


1067 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


2.3e-33 


108.1 




28-266 


1069 


pkinase 


Protein kinase domain 


4.9e-68 


239.4 




298-578 


1069 


Activin_recp 


Activin types I and II 
receptor domain 


1.6e-27 


104.8 




20-107 



WO 03/054152 



PCT/US02/39555 



Table 4A 
454 



SEQ 
ID 


Model 


Description 


E-value 


Score 


Repeats 


Position 


1 AHA 


rMr2z__Claudm 


JrMP- 

22/EMP/MP20/Claudin 
iamiiy 


0,00018 


I loo 
li.2 


1 


3-177 


1071 


MHCJ 


Class I Histocompatibility 
antigen, domains 


4.4e~14 


55.1 


1 


24-196 


1071 


• 


Immunoglobulin domain 


2.8e-07 


28.5 


1 


218-284 


1073 


sugar_tr 


Sugar (and other) 
transporter 


0.028 


126.7 


1 


48-528 


1074 


sugarjr 


Sugar (and other) 
transporter 


0.028 


126.7 


1 


136-616 


1075 


ank 


Ank repeat 


1.7e-45 


164.6 


6 


31-63:64-96:97- 

129:130-162:163- 

195:196-228 


1076 


7tm_l 


7 transmembrane receptor 
(rnodopsin family) 


5.6e-12 


40.2 


1 


178-349 


1077 


sugar_tr 


Sugar (and other) 
transporter 


0.0032 


100.1 


1 


46-470 


1079 


G_glu_transpept 


Gamma- 

glutamyltranspepti dase 


2.8e-05 


144.9 


1 


122-500 


1080 


TPR 


TPR Domain 


2.3e-19 


77.8 


5 


28-61:68-101:108- 

141:148-181:188- 

221 


1081 


ACAT 


Sterol O-acyltransferase 


1.9e-32 


121.3 


1 


300-406 


1082 


WD40 


WD domain, G-beta repeat 


2.3e-16 


67.8 


7 


1015-1050:1059- 

1097:1115- 

1151:1158- 

1194:1203- 

1240:1246- 

lzoi: lzy^-iozy 


i ncj 


pkinase 


rrotein Kinase domain 


/t fin C7 


OAO A 

202.9 




245-492:537-564 


1083 


Activin_recp 


Activin types I and II 

i eceptor domain | 


3.1e-36 


133.8 


1 


26-127 


1084 


pkinase 


Protein kinase domain 


4.9e-57 


202.9 


2 


310-554:599-626 


1 AQ/1 


Activin_recp 


Activin types I and II 
receptor domain 


3.1e-36 


133.8 


1 


26-127 


1 nc^ 




CZ domain 


1.7e-0o 


35.0 


1 


233-316 


1 act 


Cache 


Cache domain 


1.5e-25 


96.2 




557-650:960-985 


1088 


Glyco_hydro_3 1 


Glycosyl hydrolases family 
31 


4.9e-268 


903.8 


2 


1-92:114-636 


iuyu 


rlM(jr_DOX 


HMG (high mobility group) 
box 


3.8e-32 


120.2 


1 


/TO 1 TilA 

681-749 


1 AO 1 


* 

serpm 


Serpm (senne protease 
inhibitor) 


3.1e-195 


662.0 


•t 
1 


315-683 


iuyz 


trypsin 


Trypsin 


A A A /I /i 

0.0044 


12.4 


1 


406-526 


1 aa /i 


Aa_trans 


Transmembrane ammo acid 
transporter protein 


0.0042 


-25 A 


1 


1/11 £- 1 

141-551 


1097 


ATP-synt_C 


ATP synthase subunit C 


3.3e-18 


73.9 


1 


72-140 


1 0QR 


al/UIll LabC 


Aconuase iamiiy ^acomraie 
hydratase) 


1 1 OR 


1 T 
UJl./ 


z 


LOZ-ZH I .OZI- /44 


1098 


Aconitase^C 


Aconitase C-terminal 
domain 


8.9e-72 


251.9 


1 


872-1043 


1103 


PAP2 


PAP2 superfamily 


6.3e-15 


63.0 


1 


89-236 


1104 


ig 


Immunoglobulin domain 


1.4e-20 


71.3 


2 


80-148:183-251 


1106 


TCTP 


Translationally controlled 
tumor protein 


3.5e-93 


323.0 


1 


1-166 
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1109 


efhand 


EF hand 


1.2e-13 


58.8 


3 


33-61:102- 


1110 


ATP1G1 PLM_ 
MATS 


ATP 1 G 1 /PLM/M AT8 
family 


1.8e-13 


58.2 


1 


92-146 


1112 


ldl_recept_a 


Low-density lipoprotein 
receptor domain 


0.00073 


26.3 


1 


115-153 


1112 


CUB 


CUB domain 


0.002 


-3.5 


1 


9-109 


1115 


7tin_l 


7 transmembrane receptor 
(rhodopsin family) 


3.9e-21 


69.3 


1 


139-317 


1116 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


1.4e-24 


80.3 


3 


2-28:114-275:348- 
363 


1119 


tsp_l 


Thrombospondin type 1 
domain 


4.5e-38 


139.9 


10 


149-198:306- 
364:571-626:631- 
69o:7U7- /ol:o41- 

ooy:y /u- 

i no 1 • i noo 
iuzi . iuyy- 

1148:1219- 


1120 


HMG14 17 


HMG14 and HMG17 


1.3e-34 


128.4 


1 


2-86 


1125 


ig 


Immunoglobulin domain 


4.7e-20 


69.7 


3 


Q/1 1 <1 « 1 

255:292-347 


1127 


ig 


Immunoglobulin domain 


l.6e-l0 


38.9 


1 


/ii iii 

4Z-1 1/ 


1131 


C2 


C2 domain 


5.7e-19 


76.5 


2 


o^'7./:^'7 o^a 
lo /-Zd /:do /- /du 


1133 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


6.9e-29 


93.9 


1 


104-352 


1134 


MIP 


Major intrinsic protein 


3.9e-40 


125.5 


3 


80-189:197- 
zoz:3Uo-3ZD 


1136 


sugarjr 


Sugar (and other) 
transporter 


0.024 


124.9 


1 


23-504 


1137 


ATP-synt C 


ATP synthase subunit C 


3.3e-35 


130.4 


2 


14-79:90-155 


1138 


DHDPS 


Dihydrodipicolinate 
synthetase family 


4 4 O 

•4.4e-32 


120.0 


1 


1 A IOC 

34-325 


1139 


SSF 


Sodium: solute symporter 
family 


1 _ AO 

1.5e-48 


1 *7 A H 

174. / 


1 




1143 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


2.2e-50 


162.2 


1 




1145 


PID 


Phosphotyrosine interaction 
domain (PTB/PID) 


1.9e-94 


327.2 


2 


hoo-OZ / .00 1-/OZ 


1145 


WW 


WW domain 


2.5e-08 


41.1 


1 


j /J-4U1 


1148 


C2 


C2 domain 


2.1e-30 


114.4 


2 


0 /- 1 00 .Z^U-jZU 


1152 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


9.6e-3l 


99.8 


1 


iri /I 1 A 
101-41U 


1153 


WD40 


WD domain, G-beta repeat 


0.013 


22.1 


4 


zo-oz. / 1-lVjy.ZJU- 

271:430-467 


1155 


aajermeases 


Amino acid permease 


5.6e-06 


179.9 


1 


193-613 


1158 


dUTPase 


dUTPase 


5.le-29 


109.8 


1 


46-167 


1158 


MIP 


Major intrinsic protein 


3.8e-28 


88.8 


1 


2-56 


1158 


rvp 


Retroviral aspartyl protease 


2.le-22 


85.1 


1 


179-280 


1158 


G-patch 


G-patch domain 


0.00095 


25.9 


1 


285-329 


1160 


laminin_EGF 


Laminin EGF-like 
(Domains III and V) 


9.5e-l32 


451.1 


14 


299-353:356- 

423:426-466:490- 

532:535-578:584- 

627:630-680:683- 

727:1265- 
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1308:1311- 

T5<0. 1 1^S. 

13DZ: 13DD- 
1 zini • 1 ADA 

1452:1686- 
1730:1733-1786 


1160 


lamimn_Nterm 


T ' \r / _ • ^ 1 

Lammm N-termmal 
(Domain VI) 


4e-77 


zoy.o 


i 


4/-zy / 


1160 


lammm_G 


Laminm G domain 


o.le-Jl 


1 lo.z 


3 


ZoZj-Z/4o.3U1 J- 
jIjo.j! cSD-j J 1 J 


1160 


laminin B 


Laminin B (Domain IV) 


Lle-09 


45.6 


1 


1517-1651 


1161 


EGF 


bOr-like domain 


Z.ze-zo 


1A1 1 

1U1.1 


3 


Zy- J / . OU-O o . 1 

1 AC\' 1 47 1 78- 1 £^ 
1HU. l*+/-l / Q. 1 oj- 

216 


1 1 

1 loZ 


mterieron 


mterieron aipna/oeia 
domain 


1 ^#=» 1 7 

1 .JC-l / 


71 7 
/ 1 . / 


1 
1 


1^-171 


1 1 /CI 

1 163 


TT 1 

1.L1 


interleuKin- 1 / lo 


o. /e-zi 


89 *\ 


i 
1 


1 Q 1 
i y- ijy 


1 lo4 


rL/Cxr 


r^ateiet- derived growtn 
iacior (rUur ) 


i .oe-D i 


1 CH-.H- 


1 

L 


^9-1 ^0 

j>z- 


1165 


IL1 


Interleukin-1 / 18 


3.7e-23 


90.4 


1 


11-144 


1167 


Palm_thioest 


Falmitoyl protein 
thio esterase 


z.^+e-zio 


7"39 1 


i 
1 


98 989 


1 1 /TO 

1 loo 


* 

serpm 


ijerpin ^serine protease 
inhibitor) 


i .oe-zuz 


OoO.Z 


1 
1 


Al-A 1 -S 

*T / H 1 J 


I 1 'Tfl 

I I /u 


uetensm propep 


lyeiensm propeptide 


1 9^ Ok 


i H9 n 


1 
1 


JOVU 


1170 


defensins 


Mammalian defensin 


7.3e-14 


59.5 


1 


103-131 


1172 


Y_phosphatase 


Protein-tyrosme 
pnospnatase 


1 1 in 

l.oe-1 1U 


i on A 


i 
1 


971 /IOQ 

z / i-4yy 


H79 


Metallophos 


Calcineurin-like 
pnospnoesterase 


l.le-11 


52.3 


1 


70-285 


H84 


MHC_I 


Class I Histocompatibility 
antigen, domains 


6.2e-06 


-5.9 


1 


29-205 


1 187 


PLA2_B 


Lysophospholipase catalytic 
domain 




1 CO Q 


i 
1 


con 


1 1 QH 
115/ 


/~iO 

L,Z 


L^z domain 


i .oe-uo 




1 
1 


A«-\ 90 


I 1 8Q 

I I oy 


iipocann 


.uipocaiin / cytosoiic iatiy- 

a\s\\\ Ulllulllg, 


9 ^_^Q 
Z.OC-Ji/ 




1 
1 


7Q-1 8R 
j y loo 


I 1 8Q 

I I oy 




rvUlllLZ/ OUVII1C pallwICclllW' 

trypsin inhibito 




Q0 0 


9 


231-273-274-319 


1190 


ig 


Immunoglobulin domain 


4e-07 


37.1 


1 


38-116 


1191 


ig 


Immunoglobulin domain 


4e-07 


37.1 


1 


38-116 


1200 


zf-DHHC 


DHHC zinc finger domain 


3.7e-15 


63.8 


1 


87-149 


1201 


KRAB 


KRAB box 


6.6e-22 


86.2 


1 


33-73 


1202 


KRAB 


KRAB box 


6.6e-22 


86.2 


1 


33-73 


1204 


trypsin 


Trypsin 


3e-31 


117.2 


1 


56-239 


1212 


LRR 


Leucine Rich Repeat 


2.5e-54 


193.9 


15 


73-96:97-122:123- 
149:150-175:176- 
lyj: 19 /-ZZU:ZZ1- 

94fv9A7.-97l>9Q?- 

317:318-337:339- 
362:363-388:389- 
408:410-433:434- 
459 


1213 


Zn carbOpept 


Zinc carboxypeptidase 


4.3e-62 


219.7 


1 


50-288 


1221 


Clq 


Clq domain 


3.4e-41 


150.3 


1 


134-258 


1221 


Collagen 


Collagen triple helix repeat 


2.6e-10 


47.7 


1 


41-100 
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TT\ 
111 


Model 


uescnpuon 


ji/-v<uue 


ijLurt 




JT UdlllUll 






^zu copies^ 










1223 


pkinase 


Protein kinase domain 


9e-75 


261.8 




158-417 


1 OTJ 




ori j aornam 




7 4 






1223 


UBA 


UBA/TS-N domain 


0.037 


20.6 




987-1026 


1224 


vwa 


von Willebrand factor type 
A domain 


1.2e-05 


29.0 




51-242 


1225 


abhydrolase 


alpha/beta hydrolase fold 


3.7e-12 


53.8 




111-390 


1227 


lectin c 


Lectin C-type domain 


9.6e-29 


108.9 




54-166 


1237 


fibrinogen_C 


Fibrinogen beta and gamma 
chains, C-term 


L5e-09 


-1.3 




242-385 


1257 


LRR 


Leucine Rich Repeat 


9e-65 


228.6 


14 


no 1 n 1 -f r\<"* 

78-101:102- 

125:126-149:150- 

173:174-197:198- 

221:222-245:246- 

269:270-293 :294- 

31 / :3 lo-o4l.34Z- 

365:366-389:390- 

1 j 


1257 


LRRCT 


Leucine rich repeat C- 
terminal domain 


0.011 


22.3 




423-475 


1263 


Pep_M12B_prop 
ep 


Reprolysin family 
propeptide 


1.8e-28 


108.1 


i 


75-191 


1263 


disintegrin 


Disintegrin 


2.7e-ll 


51.0 


i 


419-494 


1263 


Reprolysm 


Reprolysin (M12B) family 
zinc metallo 


7.3e-07 


-zz.z 




ZUO-4UZ 


1266 


Clusterm 


Clusterm 


o ia one 
z.3e-z9o 


1 AH/1 

1UU4. 
o 




z-3^4 


1267 


* 

serpin 


kerpin (serine protease 
inhibitor) 


3.oe-i4U 




— : 


RA 49^ 


lzoo 


• 

serpm 


oerpin (serine protease 
inhibitor) 


o.oe- loz 


JJ 1 .Z 




70-40 1 


1269 


COX6C 


Cytochrome c oxidase 
subunit Vic 


2.3e-38 


140.9 


> 


1-75 


1273 


Rhodanese 


Rhodanese-like domain 


1.5e-48 


174.7 




16-138:165-266 


1274 


trypsin 


Trypsin 


2.9e-80 


280.1 


— 


lzl-346 


1279 


Calpamlll 


Calpain large subunit, 
domain III 


1.5e-25 


9o.3 




3u-i /y 


1287 


zf-C2H2 


Zmc finger, C2H2 type 


4.4e-66 


233.0 


9 


173-195:201- 

223:229-251:257- 

279:285-307:313- 

335:341-363:369- 

391:397-419 


1287 


KRAB 


KRAB box 


4.9e-23 


9U.0 


1 


1 4 ^/t 


1294 


vwc 


von Willebrand factor type 
C domain 


1.3e-09 


45.4 


1 


oy-iz / 


1295 


zf-C2H2 


Zinc finger, C2H2 type 


5e-50 


179.6 


8 


263-285:319- 

341:347-369:375- 

397:403-425:431- 

4S^-4^Q-4R1 -4R7- 

510 


1295 


KRAB 


KRAB box 


8.9e-26 


99.1 


1 


8-48 


1296 


zf-C2H2 


Zinc finger, C2H2 type 


3.2e-113 


389.6 


13 


190-212:218- 

240:246-268:274- 

296:302-324:330- 

352:358-380:386- 

408:414-436:442- 
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score 


ivepeais 


Pncifinn 














464:470-492:498- 
520-526-548 


1296 


KRAB 


KRAB box 


6.7e-28 


106.1 


1 


14-54 


1299 


hemopexin 


Hemopexin 


7e-25 


o£ 1 




^67-369-415-417- 
461 


1299 


Peptidase _M 10 


Matrixin [ 




J o.o 


i 
i 


1-176 


1305 


DUF25 


Domain of unknown 
function DUF25 


5.3e-76 


265.9 


1 


40-162 


1307 


kazal 


Kazal-type serine protease 
inhibitor domain 


L3e-12 


55.3 


1 


21-68 


1309 


Defensin propep 


Defensm propeptide 


l.ze-zu 


1UZ.U 


i 
i 


74-1 26 


1309 


defensins 


Mammalian defensin 


7.3e-14 


59.5 


1 


139-167 


1312 


ras 


Ras family 


2.7e-59 


1 1 A /I 

ZlU.4 


1 


£A 7^1 


1314 


zf-C2H2 


Zinc finger, C2H2 type 


l.le-51 


185.1 


8 


163-185:191- 

71 7*71 Q-741 ! ?4.7- 

269:275-297:303- 
381 


1315 


KeratinJB2 


Keratin, high sulfur B2 
protein 


0.0035 


-30.0 


1 


13-154 


1316 


Keratin_B2 


Keratin, high sulfur B2 
protein 


3.6e-15 


63.8 


1 


77 1 


1317 


Keratin_B2 


Keratin, high sulfur B2 

* 

protein 


8.2e-20 


79.2 


z 


0f\ 1£7-1£d-9QS 
Z\J- 1 Oj . 1 OHr-^yj 


1319 


zf-C2H2 


Zinc finger, C2H2 type 


2.9e-52 


187.1 


1 


33-55:61-83:89- 

111*11 7-1 7Q-1 

111.11 /-lJ7. I H-.J - 

167:173-195:201- 
223 


1321 


Keratin_B2 


Keratin, high sulfur B2 
protein 


0.00014 


-8.3 


1 


23-193 


1331 


is 


Immunoglobulin domain 


9.7e-06 


3z.!> 


1 


94 78 
Zt- / o 


1332 


Kunitz_BPTI 


Kunitz/B ovine pancreatic 
trypsin mhibito 


2.2e-18 


74.5 


1 


76-126 


1333 


zf-C2H2 


Zinc finger, C2H2 type 


4.6e-55 


196.3 


7 


199-221:227- 

74Q- 7 S ^-777*283- 

305:311-333:339- 
361:367-389 


1333 


KRAB 


KRAB box 


D.oe-24 




i 
i 


\J~H\J 


1336 


vwa 


von Willebrand factor type 
A domain 


0.0038 


-8.1 


1 


32-193 


1337 


vwa 


von Willebrand factor type 
A domain 


0.0038 


O 1 

-8.1 


1 


17 1 Q7 

jz- lyo 


1339 


PMP22_Claudin 


PMP- 

22/EMP/MP20/Claudin 
family 


0.0088 


-16.3 


1 


A 9H7 


1340 


PMP22_Claudin 


PMP- 

99/ThMP /MP? O/Cl audin 
family 


0.0019 


-4.7 


1 


4-194 


1343 


is 


Immunoglobulin domain 


1.5e-13 


58.5 


2 


81-158:204-278 


1347 




Immunoglobulin domain 


1.2e-07 


38.9 


2 


168-226:276-330 


1348 


ig 


Immunoglobulin domain 


1.5e-18 


75.1 


3 


157-217:269- 
325:373-427 


1353 


LRR 


Leucine Rich Repeat 


2e-39 


144.4 


9 


120-143:144- 
167:168-191:192- 
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215:216-239:240- 
263:264-287:311- 
334:335-358 


1353 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.0066 


23.1 


1 


92-119 


1356 


ig 


Immunoglobulin domain 


2.1e-17 


71.3 


3 


37-106:138- 
208:245-301) 


1357 


CD36 


CD36 family 


3.9e-231 


781.3 


1 


5-445 


1358 


DUF139 


Cysteine rich repeat 
(DUF139) 


1.2e-08 


42.2 


4 


195-211:245- 

261:288-304:316- 

332 


1359 


• 

ig 


T 1 1 1 • "1 * 

Immunoglobulin domain 


1.6e~06 


35.1 


2 


41-124:156-230 


1359 


Gag MA 


Matrix protein (MA), pi 5 


0.0044 


-25.2 


1 


291-429 


1360 


ion trans 


Ion transport protein 


0.0031 


24.2 


1 


285-445 


1363 


spectrin 


Spectrin repeat 


9.5e-22 


85.7 


6 


17-121:124- 
226:229-340:372- 
476:678-785:788- 
896 


1366 


Clq 


Clq domain 


2.1e-31 


117.8 


1 


266-390 


1366 


Collagen 


Collagen triple helix repeat 
(20 copies) 


0.00029 


19.8 


l 


182-241 


1367 


rnaseA 


Pancreatic ribonuclease 


l.le-34 


128.7 


I 


27-132 


1368 


3HCDH_N 


3 -hydroxyacyl-CoA 
dehydrogenase, NAD 
binding 


0.0021 


-60.2 


l 


50-141 


1371 


voltage_CLC 


Voltage gated chloride 
channel 


3.1e-199 


675.3 


l 


92-528 


1371 


CBS 


CBS domain 


1.2e-24 


95.4 


2 


559-617:761-814 


1375 


Peptidase_Cl 


Papain family cysteine 
protease 


le-120 


A "1 A C 

414.5 


l 


114-332 


1376 


7tm_l 


7 transmembrane receptor 
(rhodopsm family) 


2.6e-50 


180.6 


l 


48-454 


1378 


7tm_2 


7 transmembrane receptor 
(Secretin family) 


4.9e-09 


32.5 


l 


250-500 


1380 


MAGE 


MAGE family 


l.4e-9l 


317.7 


l 


3-231 


1381 


• 

ion trans 


Ion transport protein 


0.066 


16.8 


l 


158-323 


1382 


ig 


Immunoglobulin domain 


7e-l2 


52.9 


2 


37-128:160-241 


1385 


MHCJ 


Class I Histocompatibility 
antigen, domains 


9.3e-06 


-8.4 


l 


29-204 


1388 


pro_isom erase 


Cyclophilin type peptidyl- 

1 1 4 A 

prolyl cis-tr 


7e-88 


305.4 


l 


45-184 


1390 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


5.3e-06 


21.2 


l 


1-210 


1393 


7tm_l 


7 transmembrane receptor 
(rhodopsm family) 


l.6e-l2 


55.0 


l 


99-329 


1397 


fh3 


Fibronectin type III domain 


4.7e-78 


272.7 


5 


533-619:631- 
717:734-833:854- 


1397 


ig 


Immunoglobulin domain 


3.7e-42 


153.5 


5 


154-227:261- 

318:362-418:453- 

511:1253-1316 


1404 


ig 


Immunoglobulin domain 


2.9e-08 


40.9 


2 


42-99:139-198 


1409 


ig 


Immunoglobulin domain 


8.le-30 


112.5 


3 


142-199:263- 
321:359-438 


1414 


mito carr 


Mitochondrial carrier 


3.5e~64 


226.7 


3 


40-130:137- 
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Model 
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TT— xml hp 


Score 


Ren eats 


Position 

227:238-322 


1416 


is 


protein 

Immunoglobulin domain 




25 6 


1 


57-112 


1417 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING linger) 


6.6e-06 


33.1 


1 


27-67 


1418 


Synaptophysm 


Synaptophysm / 
synaptoporin 


Q ^p_7^ 
7.JC- / O 


255 1 


1 


27-273 


1419 


PMP22_Claudin 


PMP- 

22/EMP/MP20/ Claudin 
family 


5.2e-18 


73.3 


1 


4-165 


1421 


Ammonmm_tran 
sp 


Ammonium Iransporter 
Family 


OJo-io 


72 6 


1 


25-383 


1423 


* 

rnaseA 


Pancreatic noonuc lease 


4 1 P _9Q 


110 1 


1 


32-136 


1427 


LRR 


Leucine Rich Kepeat 


t, JC 


100 0 


6 


3-26:27-50:51- 

74:76-99:100- 

123:124-144 


1427 


fn3 


Fibronectin type III domain 


2.7e-08 


41.0 


1 


334-417 


1427 


is 


Immunoglobulin domain 


2.4e-07 


37.9 


1 


225-283 


1427 


LRRCT 


Leucine rich repeat C- 
terminal domain 


7.2e-06 


32.9 


1 


164-209 


1428 


GRAM 


GRAM domain 




13 1 


1 


177-245 


1435 


Rhomboid 


Rhomboid family 


1.5e-45 

He* 909 1 


164.8 
6R4 1 


1 

1 


169-326 
102-513 


1437 
1437 


Sema 
tsp_l 


Sema domain 
Thrombospondin type 1 
domain 


/e-^o^ 

i i P /L9 


vut. i. 

1 ^5 3 


± 

6 


589-637:644- 
695:702-746:833- 
883:890-940:945- 
985 


1437 


PSI 


Plexin repeat 


2.5e-14 


61.0 


1 


531-578 


1438 


ank 


Ankyrin repeat 


2e-15 


64.7 


3 


336-368:383- 
418:468-498 


1438 


ion trans 


Ion transport protein 


9.4e-07 


35.9 


1 


615-817 


1439 


ank 


Ankyrin repeat 






3 


417-449:464- 
499:549-579 


1439 


ion trans 


Ion transport protein 


Q At* 07 


IS 0 


1 


696-898 


1442 


LRR 


Leucine Rich Repeat 


3.2e-20 


80.6 


5 


57-80:81-104:105- 

128:129-152:153- 

176 


1442 


LRRCT 


Leucine rich repeat C- 
terminal domain 


9 ^ 07 


17 0 


1 


186-240 


1442 


LRRNT 


Leucine rich repeat JN- 
terminal domain 


0 0001 1 


98 R 


1 


27-55 


1443 


7tm_2 


7 transmembrane receptor 
(Secretin iamily) 


7.2e-23 


89.4 


1 


166-476 


1443 


HMG_box 


HMG (high mobility group) 
box 


3.1e-05 


30.8 


1 


3-76 


1443 


GPS 


Latrophilin/CL-l-like GPS 
domain 


0.0066 


19.1 


1 


110-162 


1444 


DUF214 


Predicted permease 


U.UU43 


- u.u 


1 

i 


136-276 


1 A^9 


TYll+rt caw 


Mitochondrial carrier 
protein 


8.4e-58 


205.5 


2 


27-123:124-217 


1454 


mito_carr 


Mitochondrial carrier 
protein 


0.00049 


22.3 


2 


79-179:198-287 


1455 


WD40 


WD domain, G-beta repeat 


2.7e-26 


100.8 


4 


64-99:105- 

141:235-273:279- 

315 


1456 


Furin-like 


Furin-like cysteine rich 


6.1e-89 


308.9 


1 


172-328 
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* 

region 










1456 


Recep .L domain 


iveceptor \-i aorndiri 


^ le-80 

J. lv U V 


280.0 


2 


46-169:345-471 


1456 


pkinase 


Protein kinase domain 


7e-65 


229.0 


1 


986-1258 


1462 


Peptidase M10 


JVLatnxin i 


1 Sp-79 

l.JC" / ~' 


277 7 


1 


62-227 


1462 


hemopexm 


riemopexm i 


1 4p-10 


48 6 


1 


309-350 


1463 


Armadillo_seg 


Armadillo^eta-catenin-like 
repeat 


0.028 


21.0 


4 


549-591:641- 

685:686-737:738- 

783 


1466 


C2 


uz uomam 


6 5e-07 


36.4 


1 


219-304 


1468 


RTC 


KJN J\. j -Terminal pnoopiiaic 
cyclase 


0 0006 


123.2 


1 


3-221 


1476 


CN hydrolase 


Carbon-nitrogen hydrolase 


0.0014 


-83.3 


1 


64-341 


1477 


zf-C2H2 


Z»inc linger, i^zxiz type 




33 7 


3 


267-289:294- 
317:419-442 


1478 


Euk porin 


Eukaryotic porin 


3.3e-06 


-42.1 


1 


1-142 


1484 


BTB 


Id 1 Dir\J/-i aomdin 


1 8e-09 


44 9 


1 


52-194 


1487 


rvt 


iveverse transcriptase 


S 6e-56 


199 4 


1 


241-507 


1489 


Galactosyl 1 2 


oal actosyi transierase 


2, 6e-175 


595 3 


1 


97-404 


1491 


GBPC 


ouanyiaie-Dinamg proiem, 
L^-xermindi uuiiiam 


1 8e-32 


117 0 


1 


23-125 


1493 


XTT TTMV 


TV At i+' 1 1 lil^a /Iatviqih 


0 0018 


23 5 


1 


297-439 


1496 


Big_2 


Jtiacteriai ig-iiKe aomam 
(group 2) 


4 5e~16 


66.8 


1 


1177-1258 


1497 


• 

iff 


T-»-vi wi 1 1 f> /~\ rrl Ani iltft /n /-vino in 

immunogiODUim aumain 


9 9e-13 


46 1 


1 


38-112 


1498 


aldo ket red 


Aldo/keto reductase family 


6.8e-37 


126.5 


2 


207-237:245-359 


1502 


Caveolin 


Caveolin 


1.6e-06 


13.3 


1 


90-262 


1504 


sushi 


Sushi domain (SCR repeat) 


8.2e-16 


66.0 


2 


223-281:286-344 


1511 


zf-C2H2 


Zinc Finger, C2H2 type 


l.le-16 


68.9 


2 


341-363:369-391 


1511 


KRAB 


KRAB box 


6e-05 


22.1 


1 


231-275 


1516 


CBS 


CBS domain 


2.8e-33 


124.0 


3 


189-243:264- 
317:336-389 


1517 


zf~C2H2 


Zinc finger, C2H2 type 


2.1e-128 


440.0 


18 


279-301:307- 
329:335-357:363- 

385:391-413:419- 

407-^0^-525-868- 

890-896-918-924- 

946*952-974:980- 

1002:1008- 

1030:1036- 

1058:1064- 

1086:1092-1114 


1517 


KRAB 


KRAB box 


5.3e-69 


242.7 


2 


74-136:684-735 


1518 


ifi 


ImmunogiODUim aomam 




29 8 


1 


38-102 


1521 


CoA trans 


Coenzyme A transterase 


1 1p 1^1 




7 


42-261:271-472 


1521 


TGF-beta 


1 ransrormmg growtn iactor 
beta like domain 


O.OC-O't 


775 7 


1 

i. 


637-741 




Renrolvsin 


Reprolysin (M12B) family 
zinc metallo 


4.4e-95 


329.3 


1 


223-422 


1523 


Pep_M12B_prop 
ep 


Reprolysin family 
propeptide 


6.7e-30 


112.8 


1 


121-234 


1523 


disintegrin 


Disintegrin 


l.le-18 


59.5 


1 


437-513 


1523 


EGF 


EGF-like domain 


0.0092 


22.6 


1 


661-693 


1524 


DSPc 


Dual specificity 
phosphatase, catalytic doma 


3.6e-71 


249.9 


1 


102-241 

..... 
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1525 


7tm„l 


7 transmembrane receptor 
(rhodopsin family) 


5.6e-14 


46.6 


1 


1-53 


1527 


Spermine_synth 


Spermine/spermidine 
synthase 


6e-13 


56.4 


1 


«"» A A A 1 

254-441 


1528 


efhand 


EF hand 


3.8e-10 


47.1 


4 


65-91:145- 

173:329-357:365- 

393 


1530 


rvt 


Reverse transcriptase 


6e-68 


239.2 


l 


125-395 


1531 


RPH3A_effector 


Rabphilin-3A effector 
domain 


1.9e-09 


-6.1 


I 


1/^1 OHO 

161-343 


1533 


TPR 


TPR Domain 


4.1e-10 


47.0 


3 


220-253:444- 
477:478-511 


1534 


SAM 


SAM domain (Sterile alpha 
motif) 


0.046 


18.0 


1 


ooo- /->i 


1538 


pkinase 


Protein kinase domain 


7.2e-42 


152.5 


2 


42-76:82-242 


1540 


zf-C2H2 


Zinc finger, C2H2 type 


0.00035 


27.3 


1 




1541 


COLFI 


Fibrillar collagen C-terminal 
domain 


8.5e-216 


553.4 


1 


624-841 


1542 


EGF 


EGF-like domain 


2.6e-ll 


51.0 


3 


916-947:1004- 
1044: lloU-luy4 


1543 


iff 


Immunoglobulin domain 


1.3e-12 


45.7 


1 


38-112 


1544 


UCH-1 


Ubiquitin carboxyl-terminal 
hydrolases famil 


9.1e-15 


62.5 


1 


477-508 


1544 


UCH-2 


Ubiquitin carboxyl-terminal 
hydrolase family 


6.4e-13 


56.3 


1 


759-811 


1544 


zf-UBP 


Zn-finger in ubiquitin- 
hydrolases and other 


0.00069 


18.9 


1 


350-420 


1546 


LRR 


Leucine Rich Repeat 


2.4e-30 


114.3 


9 


132-155:156- 

179:180-199:201- 

224:250-269:271- 

294:295-318:319- 

341:342-365 


1546 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.00082 


26.1 


1 


62-84 


1551 


Peptidase_S9 


Prolyl oligopeptidase family 


4.3e-10 


36.8 




5y /-4oy 


1552 


Orn DAP Arg d 
eC 


Pyridoxal-dependent 
decarboxylase, C- 


1.5e-14 


55.5 


i 


150-202 


1552 


Orn_Arg_deC_N 


Pyridoxal-dependent 
decarboxylase, py 


A A t 1 

4.4e-ll 


/in o 

40.3 


1 




1555 


rvt 


Reverse transcriptase 


5e-17 


67.6 


i 


616-718 


1557 


rvt 


Reverse transcriptase 


l.le-68 


nil £" 

241.6 




Z4/-0ZU 


1558 


RhoGEF 


RhoGEF domain 


1.5e-25 


98.4 


i 


971-1155 


1558 


PDZ 


PDZ domain (Also known 
as DHR or GLGF) 


4.4e-12 


53.6 


1 


159-234 


1558 


PH 


PH domain 


0.0017 


17.5 


1 


1199-1312 


1559 


pkinase 


Protein kinase domain 


1.9e-24 


92.1 




151-/51:309-41? 


1560 


FH2 


Formin Homology 2 
Domain 


4.4e-lll 


382.4 


1 




1565 


HMG_CoA_synt 


Hydroxymethylglutaryl- 
co enzyme A synthas 


6.5e-300 


1009. 
7 




50-582 


1568 


GCV H 


Glycine cleavage H-protein 


1.6e-76 


267.6 




65-185 


1577 


Peptidase_Ml 


Peptidase family Ml 


1.3e-114 


332.5 




42-161:225- 
354:365-399 


1579 


rrm 


RNA recognition motif. 


0.0068 


23.0 




823-891 


1580 


serpin 


Serpin (serine protease 


8.9e-135 


458.1 


3 


37-241:352- 
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inhibitor) 








475:504-533 


1581 


CH 


Calponm homology (CH) 
domain 


0.012 


13.5 


l 


Q/l lOI 


1582 


LRR 


Leucine Rich Repeat 


2.2e-37 


1 on c 
137.6 


1 o 

1Z 


/u-yj.y^— 1 1 / . 1 1 o- 

14 L . 14Z- IDJ . 1DU- 

189:190-213:214- 

285:286-310:311- 
335-336-359 

J J J > J U *J J 


1582 


LRRCT 


: : 

Leucine rich repeat C- 
terminal domain 


o.ie-iz 


^9 7 


i 
i 


369-421 


1582 


• 

ig 


Immunoglobulin domain 


1 Of* OR 


%0 9 


1 
i 


438-499 


1582 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.0099 


22.5 


1 


28-68 


1587 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


0.00027 


18.1 


1 


168-213 


1587 


IBR 


IBR domain 


U.UU / 1 


1 0 9 

1VJ.Z 


1 
i 


237-298 


1588 


rvt 


Reverse transcriptase 


1.2e-05 


25.4 


1 


192-250 


1589 


rvt 


Reverse transcriptase 


i Q in 


AA 9 


i 
l 




1591 


TPR 


TPR Domain 




^7 1 




321 -322-355 


1592 


PHD 


PHD-lmger 




9 0 


9 


6^4-725-1 117- 
1163 


1595 


Peptiaase_JVll 


x'eptioase iamny ivii 


?p-1 1 ^ 


329 1 


1 


100-279 


1597 


IPPc 


inositol poiypnospnate 
phosphatase family, c 




57 7 




50-112 

W V/ JL -4- 


1600 


thyroglobulin_ 1 


1 liyrogioDuiin type-i repeat 




34 1 

wJ". X 




38-118 


1604 


ig 


Immunoglobulin domain 


0.014 


13.3 


i 


34-59 


1606 


zf-CXXC 


CXXC zinc ringer 


i . / e-z i 






50-96 


1606 


F-box 


F-box domain 


/ .ze-uo 


^9 Q 




545-602-715-762 


1606 


PHD 


rHD-rmger 


u.uoz 


1 3 


— j 


201 -21 \ 

jt-l \J f JL* i L 


1613 


rrm 


RNA recognition motii. 


9 Q<= 1A 


ou.o 


— j 


151-221 


1614 


SRCR 


Scavenger receptor 
cysteme-ncn uomam 


9 Op 9£ 

z.ze-zo 


101 1 


— j 


52-145 


1617 


B56 


Protein phosphatase 2A 
regulatory d suounu 


6.1e-35 


124.7 




146-218 


1618 


SI 


SI RNA binding domain 


1.2e-10 


44.3 


i 


352-429 


1619 


trypsin 


Trypsin 


<: 1 e* 1Q 
D. 1C-J7 


19^^ 
1ZJ» . J 




30-122 


1620 


STphosphatase 


Ser/Thr protein phosphatase 


1.7&-71 


245.5 


i 


66-236 


1623 


Flgl 


Flagellar P-rmg protein 


D.4e-ol 


9Q9 ^ 




70-931 


1623 


FlgH 


Flagellar L-ring protein 


7.5e-34 


110.4 


i 


21-73 


1625 


Semialdhyde_dh 
C 


Semialdehyde 
dehydrogenase, dimerisat 


o.le-61 


ZlD.J 






1625 


Semialdhyde_dh 


Semialdehyde 
dehydrogenase, NAD bmai 


2.6e-34 


109.5 


1 


20-128 


1628 


HTH_1 


Bacterial regulatory helix- 
turn-helix protei 


2.6e-33 


124.1 


1 


7-146 


1 A'X 1 


X jO/Jl -ui-iiizcro 


PT?P-iitili7in^ enzvme. TIM 

barrel doma 


9.1e-64 


225.3 


x 


53-163 


1632 


ACR tran 


AcrB/AcrD/AcrF family 


3.1e-15 


58.8 




188-254 


1632 


OTCace 


Aspartate/ornithine 
carbamoyltransferase, A 


1.7e-09 


35.8 




459-514 


1632 


OTCace__N 


Aspartate/ornithine 
carbamoyltransferase, c 


0.44 


8.5 




411-426 


1632 


Replicase 


Replicase family 


2.9 






399-576 
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102.2 






1634 


ribonuclease T2 


Ribonuclease T2 family 


4.3e-17 


70.2 


1 


78-138 


1638 


efhand 


EF hand 


5.6e-10 


46.6 


4 


TO /C/C7/t 1 AO. Ill 

3S-oo:74-lu2: 111- 
1 jy: 14 /-I /5 


1638 


RmaAD 


Ribosomal RNA adenine 
dimethylase 


5 


2.6 


1 


133-173 


1 /" A O 

1643 


T* ! 1 IT OO 

Ribosomal L23 


T» "1- 1 j. 1 T OO 

Ribosomal protein L23 


1 O „ 1 A 

1.2e-14 


53.9 


1 


iy /-Z /O 


1645 


tsp_l 


Thrombospondm type 1 
domain 


0.0026 


O 1 o 

21.0 


5 


42U-4 /o:4oU- 
DJ4.D41-dU1.oU /- 

663:669-718 


1647 


Argmosuc synth 


Argmosuccinate synthase 


z.oe-oo 


oio o 


o 

Z 


A O 1 O/T. 1 OO 1 OQ 

4o- IZo. IZo-loy 


1651 


vwa 


von Willebrand factor type 
A domain 


l.le-33 


125.4 


1 


89-326 


1652 


* 

ig 


Immunoglobulin domain 


^ 1 a, 1 fi 

o. le-iu 


o /. 1 


o 

Z 


J4-1 1 / .ZZ\J-Zyj 


1654 


PP2C 


Protein phosphatase 2C 


3.4e-05 


21.8 


1 


86-138 


1658 


tniored 


Tnioredoxm 


l.le-lUj 




o 

z 


Jz4-43z:4Dy-D /U 


1662 


Hist_deacetyl 


Histone deacetylase family 


5.5e-85 


one *7 

295.7 


o 

2 


oD /-924: 124 /- 

1 /I a >1 
1434 


1666 


wap 


W AF-type (Wney Acidic 
rroteinj iour-aisuiii 


1 ,z>e-zl 


1 

o5. 1 


i 
1 


/Z- 1 1 0 


1667 


KKA13 


KKArJ DOX 


A O- /|Q 


13O.0 


i 
1 


1 f\/Z 1 /TO 

iUO-lOo 


1672 




immunoglobulin domain 


1 o 11 
le-l 3 


4y. j 




14D-ZUJ.Z4D- 
Zi/J . j j 1 —H-UZ 


lo /4 


lransposase zz 


i_»i transposaoie element 


i.De-3i 


154. / 


1 


o-zo i 


1676 


Ribosomal S2 


Ribosomal protein S2 


9.5e-09 


34.8 


1 


51-134 


16 /o 


• 


Immunoglobulin domain 




1Z.U 


1 


JJ- /o 


1684 


adh_short 


short chain dehydrogenase 


2.3e-32 


120.9 


1 


2-244 


1686 


pldnase 


Protein kinase domain 


2.5e-50 


180.6 


2 


553-770:875-896 


1687 


HRM 


Hormone receptor domain 


7.8e-05 


29.5 


1 


82-134 


1688 


SH3 


SH3 domain 


L9e-21 


84.7 


1 


150-206 


1688 


SH2 


SH2 domain 


1.2e-08 


28.6 


1 


214-233 


1688 


RGS 


Regulator of G protein 
signaling domain 


0.0003 


19.5 


1 


75-100 


1693 


LRR 


Leucine Rich Repeat 


5.3e-34 


-1 O / A 

126.4 


10 


505-528:529- 
554:555-573:575- 
598:613-632:634- 
65 /:65y-6o4:6o6- 
708:709-732:738- 

/05 


1694 


SH3 


SH3 domain 


0.00093 


18.2 


1 


19-73 


1701 


cystatin 


Cystatin domain 


8.9e-ll 


41.4 


1 


32-78 


1705 


Ribosomal S26e 


Ribosomal protein S26e 


2.1e-16 


/ o o 

68.0 


i 
1 


133-231 


1706 


ank 


Ank repeat 


4.9e-30 


113.2 


3 


381-413:414- 
446:450-482 


1706 


TPR 


TPR Domain 


1.4e-06 


35.3 


4 


3-36:43-76:164- 
197:205-238 


1706 


LRR 


Leucine Rich Repeat 


0.0026 


24.4 


4 


716-743:744- 
/o4: / /D-oUz:o-5D- 
859 


1708 


zf-C2H2 


Zinc finger, C2H2 type 


3.6e-23 


90.4 


4 


51-74:205- 

227:233-255:261- 

284 


1709 


PepJV112Bj>rop 
ep 


Reprolysin family 
propeptide 


0.048 


-17.2 


1 


120-196 


1710 


TIG 


IPT/TIG domain 


1.4e-101 


350.8 


10 


620-705:708- 
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788:793-876:883- 

700. 1 \/»J*t 

1169:1174- 
19.S4-1 9S6- 

1336:1341- 
1423-1424- 
1511*1516-1602 


1711 


HK 


Hydroxyethylthiazole 
kinase family 


c /to no 


is 0 




12-39 


1711 


pp-binding 


Phosphopantetheine 
attachment site 


5.4e-09 


38.4 


i 


220-255 


1712 


IGFBP 


Insulin-like growth factor 
binding pr 


2.1e-26 


97.4 


i 


40-99 


1712 


thyroglobulinl 


Thyroglobulintype-1 repeat 


4.6e-05 


29.5 


i 


264-323 


1715 


rvt 


Reverse transcriptase 


/.ae-zo 






18-147 


1717 


A2M 


Alpha-2-macroglobulin 
family 


0 


1259. 




776-879:880- 
1111*1112- 

X X X X * X X X 

1172*1243-1530 


1717 


A2MJST 


Alpha-2-macrogIobulin 
family IN-terminal regi 


A 
U 


109? 

6 
\j 




75-684 


1717 


TCTP 


Translationally controlled 
tumor protein 


0.0013 


-47.0 




1172-1249 


1718 


-tt — : 

cadherm 


Cadhenn domain 




1112 




68-159:334- 
425:439-543 


1722 


p450 


Cytochrome r4DU 


U.JC"tO 


172 0 




270-506 


1725 


ig 


Immunoglobulin domain 


0.0079 


14.2 


i 


34-116 


1726 


ig 


Immunoglobulin domain 


H-.ze-uo 


91 S 




87-131*145-163 


1727 


Synaptophysin 


Synaptophysin / 
synaptoporin 




/ / J.*T 


- 


12-307 


1729 


PA 


PA domain 


1 /I o 90 

z.^e-zy 


110 0 


r 


521-626 


1729 


Glyco_hydro_47 


Glycosyl hydrolase family 
47 


o.ze-i d 


9R id 




99-364 


1731 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


i . ie-oj 


910 R 


9 


50-117*118-293 


1732 


ig 


Immunoglobulin domain 


1 . /e-l o 


64 6 


9 


49-125:169-239 


1734 


p450 


Cytochrome r4DU 


le- id 


61 8 


3 


50-106:162- 
228:298-425 


1735 


profilm 


Profilin 






1 


966-1071 


1735 


Transposase_22 


LI transposable element 


0.00026 


1 09 7 


1 


731-1016 


1742 


PAS 


PAS domain 


3.2e-07 


31.9 


3 


360-413:474- 
507:581-613 


1744 


Dynein heavy 


Dynein heavy chain 


9.8e-251 


846.4 


1 


34-781 


1745 


SCP 


SCP-like extracellular 
protein 


9.2e-36 


132.2 


1 


4-180 


1749 


hpocalm 


Lipocahn / cytosohc tatty- 
acid binding pr 


o.De-UD 


9i n 

z l.U 


i 

L 


42-76 


1 7^ 1 




S o di um • di carb ox vl at e 
symporter family 


1.8e-47 


171.2 


2 


1-34:315-396 


1753 


ig 


Immunoglobulin domain 


9.4e-19 


65.5 


5 


49-124:162- 
' 219:261-310:345- 
400:432-496 


1753 


fii3 


Fibronectin type III domain 


0.00052 


26.8 


1 


514-606 


1757 


IPPc 


Inositol polyphosphate 
phosphatase family, c 


3e-68 


240.1 


3 


64-350:363- 
621:753-796 
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1762 


SSF 


Sodium: solute symporter 
family 


4.2e-256 


864.2 


1 




1763 


LRR 


Leucine Rich Repeat 


3.4e-37 


137.0 


11 


61-84:85-108:109- 
13z: 1 JJ-1 jo:1 j /- 
180:181-204:205- 

OOQ-O^O 07^-077 

ZZo.Zj J-Z /O.Z / /- 

300:301-324:349- 

O / 1 


1763 


ig 


Immunoglobulin domain 


9.1e-09 


33.3 


1 


428-486 


1763 


LRRCT 


Leucine rich repeat C- 
terminal domain 


J.le-u/ 


on r 
J /.J 




O^ Q A 1 1 

JO 5-41 1 


1 T/""1 

1763 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.00023 


27. y 


1 


OA ^ A 

30-59 


1764 


Reprolysin 


Reprolysin (M12B) family 
zinc metalloprote 


l.le-21 


85.5 


1 


156-359 


1764 


tsp_l 


Thrombospondm type 1 
domain 


O C „ A 

2.5e-0o 






/too Am 


1768 


ANF_receptor 


Receptor family ligand 
binding region 


1 a« o a 






1 '7/1 


1769 


C2 


C2 domain 


2.5e-66 


233.8 


2 


249-337:383-471 


1771 


cadherm 


Cadnenn domain 


O^ OA 

2e-o0 


O O A /C 

ZoO.O 




1 A A OO <-0/10 

o/n»o^G /i^i'/f^^ 
o4j.jjy-4j 1 .400- 

^<<C.£A7 /TAT 


1775 


7tm_l 


7 transmembrane receptor 
^rnoQopsin iamny^ 


3.2e-34 


110.9 


2 


34-130:147-330 


1778 


synaptobrevin 


Synaptobrevin 


2.3e-15 


64.5 


1 


104-192 


17ot) 


TV /TUT/" 1 T 


Class I Histocompatibility 
antigen, domains 


o 1 cy> 

o. ie-y3 


3Z1 ,o 


i 
i 


O^ 1 75 
ZD- l/o 


1 /oU 


ig 


immunogioDuim uomain 


Q Qp ns i 


jU.U 


i 
i 


li?4-Z0y 


1 TOO 

1 /oZ 


ArsL^ tran 


ArJC transporter 


n -la 1Q 


/0. 1 


i 
i 


44O-0Z0 


1 TO o 

1783 


OTVTT7 


b odium, neurotransmitter 
symponer iamny 


A 

u 


I 1 CO 

I I oZ. 

D 


i 
i 


OAO 7/7 
JajD" /4 / 


1784 


TIG 


IPT/TIG domain 


4.6e-33 


123.3 


2 


1032-1119:1121- 
1206 


1784 


Plexin_repeat 


Plexin repeat 


4.1e-08 


40.4 


3 


653-706:806- 
858:953-1009 


1784 


Sema 


Sema domain 


2.8e-06 


26.6 


o 


59-160:576-602 


1785 


CTF NFI 


CTF/NF-I family 


1.6e-167 


570.0 


1 


651-824 


1786 


TIR 


TIR domain 


le-38 


132.4 


1 


791-931 


1786 


LRR 


Leucine Rich Repeat 


4.4e-37 


136.6 


15 


143-166:192- 

215:216-241:242- 

266:267-290:385- 

/inc./inn zioo./ioo 

40o:409-43z:4oi- 

A d /C. /I C7 /10A./1Q1 

0UZ.04D-00y. J /u- 
643:645-668 


1 / OO 




JL>CUI/1I1C 111/11 ICJJCciL V_- 

terminal domain 


/ .JC" LJ 






675-726 


1788 


RPEL 


RPEL repeat 


L2e-17 


72.1 


2 


222-247:266-291 


1789 


ACR tran 


AcrB/AcrD/AcrF family 


1.8e-191 


649.5 


1 


1-452 


1792 


Sulfatase 


Sulfatase 


0.00013 


-44.3 


1 


502-912 


1792 


3 5 exonuclease 


3-5' exonuclease 


0.0041 


-14.7 


1 


764-923 


1794 


DUF81 


Domain of unknown 
function DUF81 


3.2e-49 


176.1 


1 


8-166 
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ocore 


xve peats 


Position 


1 /yo 


t Tjn 


Leucine Rich Repeat 


n e^a. aq 

/.oe-uy 


4Z.O 


o 




1 /yo 


LKRC1 


Leucine rich repeat C- 
terminal uomam 


i .ye-Uo 


,4.1 ^ 
41. D 


1 
1 


J / 1 -4Z 1 


1 OA/£ 


vjrJr'S 


Latropniim/UL-i-riice Lxr.3 
domain 


a nr\A 1 

U.UU41 


ZU.o 


1 
1 




i /yo 


rlKiVl 


xiormone receptor aomam 


a nn7^ 

U.UU /0 


lO.j 


1 
1 


D J i-ouu 


1796 


7tm_2 


7 transmembrane receptor 
(Secretin family) 


0.018 


101.1 


1 


935-1252 


1 TOO 

179o 


secY 


eubacterial secY protein 


y.oe-oz 


Zlo.D 


1 
1 


1 O CO* 


1801 


PTR2 


POT family 


3.5e-38 


130.8 


2 


4-72:364-505 


1802 


asp 


Eukaryotic aspartyl protease 


5.1e-13 


41. o 


1 


133-4/1 


1802 


Dakl 


Dakl domain 


6.7e-06 


1 0.0 o 

13Z.Z 


1 


695-1003 


1803 


PMP22_Claudin 


PMP- 

22/EMP/MP20/Claudm 
family 


4.5e-21 


83.4 


1 


4-181 


1804 


HC03_cotransp 


HC03- transporter family 


1.3e-69 


Z44. / 


3 


"2Q <0 . 1 AO 
1 £C 171 OOQ 

loo. i /1-ZZo 


1805 


UCH-2 


Ubiquitin carboxyl-terminal 
hydrolase family 


5.1e-17 


70.0 


1 


1278-1339 


1805 


UCH-1 


Ubiquitin carboxyl-terminal 
hydrolases famil 


9.3e-08 


39.2 


1 


785-820 


1805 


zf-UBP 


Zn-finger in ubiquitin- 
hydrolases and other 


0.0049 


15.8 


1 


688-727 


1 OA/" 

1806 


Nucleoside tran 


Nucleoside transporter 


5.7e~42 


1 CO o 

152.8 


1 


1 OO 

127-446 


1810 


sugarjr 


Sugar (and other) 
transporter 


0.044 


132.3 


1 


637-1048 


1813 

■ 


zf-C2H2 


Zinc finger, C2H2 type 


le-38 

■ 


142.1 


7 


270-292:298- 

nnn ATA ^T7. ^ ^ A 

320:414-437:440- 
463:485-507:513- 

fir, T0 1 "7AO 

jd$: / / 1- /yJ 


1815 


trypsin 


Trypsin 


4.6e-56 


179.2 


1 


28-341 


1817 


TauD 


Taurine catabolism 

di oxygenase TauD, TfdA fa 


1.7e-196 


666.2 


1 


107-385 


1 Q 1 C 


r$JrJJ_transp 


oinuing-protein- dependent 
transport system 


U.UU4o 


Zd.d 


1 


1 OQ 1 Q/t 

izy-iy4 


1819 


Y_phosphatase 


Protein-tyrosine 
phosphatase 


L3e-103 


357.6 


1 


1277-1494 


1819 


fh3 


Fibronectin type III domain 


1.2e-51 


185.1 


7 


461-541:552- 
634:641-719:730- 
812:819-900:908- 
990:998-1087 
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912 


MIF 


Macrophage migration 
inhibitory factor (MIF) 


2.4e-54 


194.0 


1 


124-227 


912 


GST_C 


Glutathione S-transferase, 
C-terminal domain 


7.1e-21 


82.8 


1 


410-525 


912 


GST_N 


Glutathione S-transferase, 
N-terminal domain 


2e-15 


64.7 


1 


324-400 


913 


tsp_l 


Thrombospondin type 1 
domain 


2.9e-23 


90.7 


2 


124-174:181-231 


913 


EGF 


EGF-like domain 


1.6e-20 


81.6 


7 

- 


474-508:514- 
553:559-591:597- 
633:692-720:802- 
836:842-882 


913 


Keratin_B2 


Keratin, high sulfur B2 
protein 


3 


-75.8 




523-649 


913 


granulin 


Granatin 


3.2 


-15.2 




A CI A r 1 A 

484-510 


913 


TIL 


Trypsin Inhibitor like 
cysteine rich domai 


6.4 


-13.2 


2 


540-597:785-842 


913 


cadherin 


Cadherin domain 


6.5 


-10.4 


1 


903-995 


914 


MIF 


Macrophage migration 
inhibitory factor (MIF) 


4.5e-50 


179.8 


1 


2-101 


915 


MIF 


Macrophage migration 
inhibitory factor (MIF) 


6.1e-68 


239.1 


1 


2-129 


916 


trypsin 


Trypsin 


2.3e-91 


316.9 


1 


445-675 


916 


kringle 


Kringle domain 


4.2e-45 


163.3 




286-367 


916 


fn2 


Fibronectin type II domain 


4.1e-25 


96.9 




108-148 


916 


EGF 


EGF-like domain 


6.2e-15 


63.0 


2 


164-197:245-278 


916 


fhl 


Fibronectin type I domain 


4.8e-10 


46.8 




202-237 


916 


KeratinJ32 


Keratin, high sulfur B2 
protein 


5.3 


-79.6 




104-211 


916 


DUF32 


Domain of unknown 
function DUF32 


9.6 


-154.4 




445-556 


917 


laminin_EGF 


Laminin EGF-like 
(Domains III and V) 


9.8e-159 


540.7 


12 


282-346:349- 

409:412-469:472- 

520:523-565:786- 

831:834-877:880- 

925:928-984:987- 

1036:1039- 

1094:1095-1139 


917 


laminin_Nter 
m 


Laminin N-terminal 
(Domain VI) 


5.9e-105 


362.1 




45-280 


917 


DUF26 


Domain of unknown 
function DUF26 


3.2 


-13.0 


1 


1110-1155 


917 


DUF232 


Putative transcriptional 
regulator 


3.2 


-24.4 


1 


1149-1281 


917 


LIM 


LIM domain 


3.8 


-16.6 




1125-1189 


917 


zf-CXXC 


CXXC zinc finger 


4.5 


-8.4 


-\ 


367-414 


917 


Methyltransf 
D12 


D12 class N6 adenine- 
specific DNA met 


5.3 


-88.8 




1404-1654 


917 


VSP 


Giardia variant-specific 
surface prot 


8.4 


-265.1 




864-1168 


917 


EB 


EB module 


9.5 


-12.4 




1069-1125 


918 


EGF 


EGF-like domain 


3.2e-53 


190.2 


9 


33-68:74-110:116- 
151:161-197:208- 
243:247-282:288- 
323:329-362:368- 
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404 


918 


CUB 


CUB domain 


2.9e-17 


, ~ir\ o 

70. o 




/Uo-o I / 


918 


TIL 


Trypsin Inhibitor like 
cysteine rich domai 


1.3 


-5.0 


i 

— 


219-288 


918 


granulin 


Granulin 


2.4 


1 A A 

-14.4 




Z4Z-Zo4 


918 


wap 


WAP-type (Whey Acidic 
Protein) 'four-disul 


4.7 


-8.7 




6-71 


918 


KeratinJB2 


Keratin, high sulfur B2 
protein 


6.6 


-81.2 




95-247 


918 


VSP 


Giardia variant-specific 
surface protein 


in 


-264.1 


1 


95-403 


919 


Parathyroid 


Parathyroid hormone family 


3.1e-82 


286.6 




46-175 


920 


EGF 


EGF-like domain 


2.2e-22 


87.8 


9 


2-36:47-82:88- 
122:128-163:232- 
263:276-306:319- 
349:353-394:40 /- 
439 


921 


IL8 


Small cytokines 
(intecrine/chemokine), inter 


0.19 


-3.2 


1 


25-65 


922 


Zn_carbOpep 
t 


Zinc carboxypeptidase 


8.7e-120 


J -t *1 A 

411.4 


1 


1 y1f\ /ion 

149-429 


922 


Propep_M14 


Carboxypeptidase activation 
peptide 


5.6e-20 


79.8 


1 


7-84 


923 


fibrinogen_C 


Fibrinogen beta and gamma 
chains, C-term 


9.5e-59 


208.6 


1 


62-263 


927 


FKBP 


FKBP-type peptidyl-prolyl 

• * 

cis-trans isomeras 


2.4e-41 


150.8 


1 


40-122 


930 


FAD- 
oxidase G, 


FAD linked oxidases, C- 
terminal domain 


4.1e-88 


306.2 


1 


395-647 


930 


FAD_binding 
4 


FAD binding domain 


5.2e-78 


272.6 


2 


24-172:219-381 


931 


UPF0136 


Uncharactensed protein 
family (UPF0136) 


0.0003 1 


17. 6 


1 


1Z- /D 


933 


Fringe 


Fringe-like 


l.4e-06 


-12.8 


1 


262-487 


933 


Galactosyl T 


Galactosyltransferase 


2.6 


-84.8 


1 


282-439 


937 


14-3-3 


14-3-3 protein 


9.5e-l24 


424.5 


1 


50-282 


938 


S_100 


S-100/ICaBP type calcium 
binding domain 


3.6e-23 


90.4 


1 


65-108 


938 


efhand 


EF hand 


0.0012 


25.6 


1 


114-142 


940 


ig 


Immunoglobulin domain 


2.9e-35 


130.6 


5 


38-105:139- 

206:241-293:326- 

377:412-487 


941 


LRR 


Leucine Rich Repeat 


2e-30 


114.5 


<-> 

8 


42-63:64-87:88- 
112:113-136:137- 
160:161-lo4: Ioj- 
206:209-232 


941 


LRRCT 


Leucine rich repeat C- 

Iv/l 1 ill Hell UUlilulll 


3.6e-07 


37.2 


1 


242-292 


941 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.46 


15.7 


1 


11-40 


944 


serpin 


Serpin (serine protease 
inhibitor) 


l.6e-l33 


457.0 


1 


173-532 


947 


RUN 


RUN domain 


8e-44 


159.0 


1 


31-163 


947 


FYVE 


FYVE zinc finger 


3.5e-l3 


57.2 


1 


529-598 


947 


Tektin 


Tektin family 


1.9 


-229.2 


1 


154-473 



WO 03/054152 



PCT/US02/39555 



Table 4B 
470 



SEQ 
ID 


Model 


Description 


E-value 


Score 


Repeats 


j^osixion 


947 


filament 


Intermediate filament 
protein 


4.7 


-202.1 


i 


216-501 


947 


PAPS„reduct 


Phosphoadenosine 
phosphosulfate reductase 


4.9 


-69.1 


i 


305-467 


947 


metalthio 


Metallothionein 


7.3 


-11.4 


i 


576-635 


950 


lectin c 


Lectin C-type domain 


2.4e-08 


A 1 1 

41.1 






951 


lectin c 


Lectin C-type domain 


8 


-36.4 


1 


OK 7 Q 1 


951 


FAD- 
oxidase C 


FAD linked oxidases, C- 
terminal domain 


9.7 


-84. 1 


1 

— 


7/^7 


952 


Glyco_transf_ 
8 


Glycosyl transferase family 
8 


0.01 






1 <I9^ 

1ZD3-13ZO 


952 


Glyco hydro 
20 


Glycosyl hydrolase family 
20, catalyti 


9.8 


-224.0 




1Q 1 fHA 


953 


TPR 


TPR Domain 


0.0042 


23.8 


i 


171-204 


954 


DENN 


DENN (AEX-3) domain 


4.2 


-67.5 




1 £ 1/10 

1D-14Z 


956 


Branch 


Core-2/I-Branching enzyme 


2.2e-77 


270.5 


i 


46-310 


957 


fh3 


Fibronectin type III domain | 


l.le-15 


65.6 




OA 1 AO 

30-108 


958 


Peptidase Ml 
0 


Matrixin 


5.9e-121 


415.2 


i 


58-225 


958 


hemopexin 


Hemopexin 


2e-58 


207.5 


4 


305-347:349- 
393:398-445:447- 

/I 0*7 

4o / 


958 


Astacin 


Astacin (Peptidase family 
M12A) 


0.28 


-108. 1 


1 




958 


PG_binding_ 
1 


Putative peptidoglycan 
binding domain 


2.4 


-9.5 


i 


44-112 


960 


Robl LC7 


Roadblock/LC7 domain 


1.8e-30 


114.7 




1/17 
DO- 14/ 


962 


chrorno 


'chrorno* (CHRromatin 
Organization Modifier) 


2.7e-05 


31.1 


i 


47-84 


962 


SET 


SET domain 


1.2 


-48.2 


i 


243-307 


964 


is 


Immunoglobulin domain 


0.0047 


23.6 




■3/1 1 1 < 

J4-1 ID 


967 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


3.5e-13 


57.2 


i 


41-237 


968 


adh zinc 


Zinc-binding dehydrogenase 


2.4e-61 


•O 1 *7 1 

217.2 


— 


\HA AQ^ 


968 


DUF142 


Domain of unknown 
function DUF142 


6.6 


-67.7 




7/1 Q H7 


970 


El_dehydrog 


Dehydrogenase El 
component 


2.1e-183 


622.7 


i 


129-425 


970 


transketolase 


Transketolase, thiamine 
diphosphate bin 


2.6 


-227.4 


i 

_ . 


173-410 


971 


ig 


Immunoglobulin domain 


5.8e-09 


43.2 




45-122 


973 


zf-MIZ 


MIZ zinc finger 


3.2e-32 


120.5 




T7H 111 


973 


SAP 


SAP domain 


2.4e-05 


31.2 


i 


2-36 


973 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


5.7 


-6.3 




283-325 


975 


WD40 


WD domain, G-beta repeat 


4.3e~09 


43.6 


— 


Oil 07/V/l/l£ 
/1C1-/1Q1 ^OQ 


977 


An_peroxidas 
e 


Animal naem peroxiaase 


o.^e- id/ 






137-701 


978 


Defensin_pro 
pep 


Defensin propeptide 


3.8e-19 


77.0 




32-82 


978 


defensins 


Mammalian defensin 


0.0015 


25.3 




95-123 


981 


ig 


Immunoglobulin domain 


0.00022 


28.0 




49-130 


982 


ig 


Immunoglobulin domain 


5.8e-07 


36.6 




36-112 


983 


Acetyltransf 


Acetyitransferase (GNAT) 


0.22 


15.5 




260-334 
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family 










985 


tsp_l 


Thrombospondin type 1 
domain 


0.0075 


17.1 


1 


151-206 


985 


Furin-like 


Furin-like cysteine rich 
region 


5.1 


-94.5 


1 


37-149 


985 


TNFR_c6 


TNFR/NGFR cysteine-rich 
region 


9.6 


-0.7 


1 


114-148 


987 


LRR 


Leucine Rich Repeat 


5.5e~59 


209.4 


12 


78-101:102- 
125:126-149:150- 
173:174-197:198- 
22 1 :222-z45 :Z46- 
zoy:z /u-zyj.zyH- 

jl /.jlO-J^rl.JTZ- 

365 


987 


LRRNT 


— * * 1 t "X T 

Leucine rich repeat N- 
terminal domain 


0.23 


18. U 


l 


04 *\9 


987 


Keratin_B2 


Keratm, high sulfur B2 
protein 


6.1 


on /C 

-80.6 


I 




988 


ParA 


ParA family ATPase 


0.029 


0.0 


1 


177-283 


988 


APS kinase 


Adenylylsulfate kinase 


0.92 


-112.5 


1 


1 AO OO^C 


988 


ArsA_ATPas 
e 


Anion-transporting ATPase 


0.97 


-180.2 


1 


103-336 


989 


vwd 


von Willebrand factor type 
D domain 


l.le-38 


141.9 


1 


364-514 


989 


vwc 


von Willebrand factor type 
C domain 


5.3e-35 


129.7 


5 


rn 1 nf, i no 

5U-1Uj: IUo- 

1 uj : 1 0O-ZZ4. Zj o- 

289:301-357 


989 


TIL 


Trypsin Inhibitor like 
cysteine rich 


8.9e-06 


32.6 


1 


/TOO £TQO 


989 


Keratin_B2 


Keratin, high sulfur B2 
protein 


1 


/TO /I 

-68.4 


i 
1 


O&A ^QR 


989 


TILa 


TILa domain 


1 A 

1.4 


o n 
-Z. i 


i 
l 


j\j i-jjj 


989 


fhl 


Fibronectin type I domain 


2.3 


4.7 


1 


166-198 


989 


Metallothio P 
EC 


Plant PfiC family 
metallothionein 


4.8 


-38.2 


i 
l 


COO" /TCI 

Do /-OD 1 


990 


Cys_knot 


Cystme-knot domain 


3.2 


-j 1.3 


i 
l 


ZZ /-jIj 


992 


ig 


! Immunoglobulin domain 


2.9e-13 


57.5 


2 


34-111:150-220 


994 


ig 


Immunoglobulin domain 


1.2e-26 


1 ao a 

102.0 


o 
J 


/lO 1 1 A. 1 A'l 
^fD-l 1U. i'H-J- 

197:230-291 


995 


SCP 


SCP-hke extracellular 
protein 


5e-41 


149.7 


1 

1 


/i i 9^ 

H- I ID 


995 


granulin 


Granulin 


6.9 


-17.5 


1 


1 7A 1 QQ 


996 


ig 


-v- 111*1 * 

Immunoglobulin domain 


0.014 


OO A 

22. U 


1 
1 




999 


RNase_PH 


3' exoribonuclease family, 
domain 1 


6.1e-23 


89.6 


1 


272-408 


999 


RNase_PH_C 


3' exoribonuclease family, 
domain 2 


1.4e-13 


58.5 


2 


96-157:392-469 


/-\/*\ /-\ 

999 


PIN rase 


jrolyriuonucieotiae 
nucleotidyltransferase 


0. 1 C VJ 


9S 0 


1 

i. 


189-269 


1000 


ig 


Immunoglobulin domain 


0.00054 


26.7 


1 


38-102 


1002 


zf-C2H2 


Zinc finger, C2H2 type 


7.6e-54 


192.3 


7 


183-205:211- 
233:239-261:267- 
289:295-317:323- 
345:351-373 


1002 


zf-BED 


BED zinc finger 


3.9 


-3.7 


1 


224-262 
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1002 


LIM 


LIM domain 


7.8 


-19.2 


1 


185-249 


1003 


zf-C2H2 


Zinc fmger, C2H2 type 


2.3e-114 


393.3 


16 


260-282:297- 
319:325-347:353- 
375 :3 8 1-403 :4(jy- 
431:437-459:465- 
487:493o 1 j ,dz 1- 
D43:j4yo / 1,D / /- 
Dyy :oio-oz / ,ojj" 

O JO .OOl-OOO. DO -7 

711 


1003 


zf-BED 


BED zmc finger 


0.41 


5.0 


i 
i 




1003 


TFIIS 


Transcription factor S-II 
(TFIIS) 


0.75 


2.4 


2 


353-391:468-503 


1003 


GATA 


GATA zinc finger 


1.8 


-6.6 


1 


HOJ-'H-O 1 


1003 


PHD 


PHD-fmger 


5.5 


-16.7 


1 


326-389 


1003 


E6 


Early Protein (E6) 


6.9 


-67.7 


1 


40 /-DD4 


1003 


Bowman- 
Birk_leg 


Bowman-Birk serine 
protease inhibitor 


8 


-18.8 


1 


355-411 


1007 


LIM 


LIM domain 


1.3e-38 


141.7 


3 


390-448:450- 

DO /.D 1UO /O 


1007 


E7 


E7 protein, Early protein 


0.7 


-39.5 


1 


235-422 


1008 


Transposase_ 
8 


Transposase 


2.9 


-12.7 


1 


1/10 

jj- 14o 


1010 


zf-C2H2 


Zinc finger, C2H2 type 


4.8e-06 


33.5 


3 


Z-Z4.zy-JZ. 1D4- 

177 


1010 


zf-C3HC4 


Zinc fmger, C3HC4 type 
(RING fmger) 


7.1 


-7.0 


1 


4-34 


1011 


60s ribosoma 
1 


60s Acidic ribosomal 
protein 


1.2e-20 


82.1 > 


1 


1-89 


1012 


abhydrolase 
2 


Phospholipase/Carboxyleste 
rase 


1.3e-24 


95.2 


1 


9-213 


1016 


Anti_prolifera 
t 


BTG1 family 


1.7e-73 


257.6 


1 


11-253 


1017 


zf-C2H2 


Zinc finger, C2H2 type 


9.8e-71 


248.4 


ll 


147-169:213- 

o4 / JDjo / j.oo 1- 

^fUO.^f 1 .HO /- 

459:465-487:493- 


1017 


KRAB 


KRAB box 


2.6e-26 


100.8 


l 


42-82 


1017 


DM- domain 


DM DNA binding domain 


0.26 


A O 


l 




1017 


zf-BED 


BED zinc finger 


1.8 


-0.6 


2 


338-376:394-432 


1017 


LIM 


LIM domain 


3.4 


-16.2 


1 


495-549 


1017 


zf-TRAF 


TRAF-type zinc finger 


5 


-6.6 


1 


/1T7 /ion 

4Z/-4oU 


1017 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


9.5 


-8.1 


1 


177-218 


1017 


Bowman- 
Birk leg 


Bowman-Birk serine 
protease inniDitor 


9.6 


-19.6 


1 


147-203 


1020 


LIM 


LIM domain 


2.3e-34 


127.6 


4 


78-136:139- 

197:200-256:259- 

315 


1020 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING fmger) 


3.3 


-4.2 


1 


106-142 


1020 


TNFR_c6 


TNFR/NGFR cysteine-rich 
region 


8.2 


-0.1 


1 


89-127 
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1024 


zf-C2H2 


Zinc finger, C2H2 type 


4.3e-53 


189.8 


10 


156-183:184- 

206:212-234:240- 

262:268-290:296- 

318:324-346:352- 

374:380-402:408- 

430 


1024 


KRAB 


KRAB box 


2.7e-26 


100.8 


1 


15-55 


1024 


TFIIS 


Transcription factor S-II 
(TFIIS) 


2.5 


-2.0 


1 


129-166 


1024 


zf-BED 


BED zinc finger 


3.9 


-3.8 


1 


225-263 


1024 


fer4 


4Fe-4S binding domain 


8.3 


-3.2 


1 


176-199 


1025 


RAG2 


Recombination activating 
protein 2 


0 


1380.2 


1 


1-527 


1025 


PHD 


PHD-fmger 


2.9 


-14.2 


1 


419-483 


1026 


zf-C3HC4 


Zinc fmger, C3HC4 type 
(RING finger) 


l.le-06 


35.6 


1 


16-54 


1029 


Keratin_B2 


Keratin, high sulfur B2 
protein 


3.8e-06 


16.4 


2 


5-149:150-281 


1031 


pro_isomeras 
e 


Cyclophilin type peptidyl- 
prolyl cis-tr 


4.1e-107 


369.3 


1 


5-165 


1035 


zf-C2H2 


Zinc fmger, C2H2 type 


1.2e-113 


391.0 


17 


100-122:132- 

154:160-182:188- 

210:216-238:244- 

266:272-294:300- 

322:328-350:356- 

378:384-406:412- 

434:440-462:468- 

490:496-5 1 8 : 524- 

546:552-574 


1035 


KRAB 


KRAB box 


7.1e-24 


92.8 


1 


A A A 

4-44 


1035 


zf-BED 


BED zinc finger 


0.25 


6.9 


2 


78-123:537-575 


1035 


LIM 


LIM domain 


1.5 


-13.2 




330-394 


1035 


TFIIS 


Transcription factor S-II 
(TFIIS) 


2.4 


-1.9 


1 


244-282 


1035 


zf-MIZ 


MIZ zinc finger 


8.5 


-20.9 




457-507 


1038 


Sulfatejrans 
P 


Sulfate transporter family 


4.3e-103 


355.9 


2 


2-284:441-751 


1038 


STAS 


STAS domain 


4.8e-20 


80.0 


1 


774-987 


1038 


BenE 


Benzoate membrane 
transport protein 


1.3 


-317.8 


1 


422-745 


1038 


UPF0118 


Domain of unknown 
function DUF20 


4.9 


-125.1 




454-754 


1038 


Transp_cyt_p 
ur 


Permease for 
cytosine/purines, uracil 


5.3 


-195.7 


i 


403-760 


1038 


voltage_CLC 


Voltage gated chloride 
channel 


5.5 


-238.5 




378-680 


1038 


FecCD 


FecCD transport family 


7.9 


-219.3 


i 


440-686 


1039 


ubiquitm 


Ubiquitm family 


1.9e-08 


41.5 






1042 


ig 


Immunoglobulin domain 


3.1e-30 


113.9 




62-129:163- 

229:264-316:349- 

400:433-501 


1043 


UPAR LY6 


u-PAR/Ly-6 domain 


7.7e-31 


115.9 




88-158 


1044 


7tm 5 


7TM chemoreceptor 


0.17 


-139.3 




29-293 


1044 


sugarjr 


Sugar (and other) 
transporter 


0.27 


-153.3 




1-419 
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1044 


Competence 


Competence protein 


1.8 


-92.8 




10-207 


1044 


oxidored_ql 


NADH- 

Ubiquinone/plastoquinone 
(complex I) 


2.6 


-160.0 


i 


185-410 


1044 


PNTB 


NAD(P) transhydrogenase 
beta subunit 


6.6 


-392.2 


1 


188-508 


1044 


COX1 


Cytochrome C and Quinol 
oxidase polypepti 


7.4 


-311.2 


i 


1-396 


1045 


Lipase_chap 


Proteobacterial lipase 
chaperone prot 


3.1 


-f a A A 

-124.4 


1 


95-336 ! 


1045 


Cys_MetJMe 
ta PP 


Cys/Met metabolism PLP- 
dependent enzy 


8.2 


-317.8 


1 


40-279 


1046 


PAP2 


PAP2 superfamily 


7.7 


<~l 1 A 

-31.4 




no -I <-i e 

oo-l / J 


1047 


MHC_I 


Class I Histocompatibility 
antigen, domains 


7.9e~148 


504.5 


i 


25-203 


1047 


is 


Immunoglobulin domain 


0.79 


16.0 


i 


220-285 


1050 


ig 


Immunoglobulin domain 


8.7e-07 


36.0 


1 


38-110 


1051 


PMP22_Clau 
din 


PMP- 

22/EMP/MP20/Claudin 
family 


0.0019 


-4.7 


1 


4-194 


1052 


GASA 


Gibberellin regulated 
protein 


2.6 


A >™7 A 

-47 .4 


1 


1-69 


1054 


SRCR 


Scavenger receptor 
cysteine-rich domain 


6.2e~25 


96.3 


1 


722-820 


1055 


P2X receptor 


ATP P2X receptor 


8.7e-303 


1019.3 




13-388 


1055 


Metallothio 5 


Metallothionein family 5 


7.1 


-0.8 


"1 


124-161 


1057 


zona_pellucid 

a 


Zona pellucida-like domain 


3.1e-81 


283.3 


1 


268-538 


1057 


trefoil 


Trefoil (P-type) domain 


0.02 


9.1 


1 


224-262 


1058 


Aa_trans 


Transmembrane amino acid 
transporter prot 


2.4e-22 


87.7 


1 


4-388 


1058 


oxidored_ql 


NADH- 

Ubiquinone/plastoquinone 
(complex I) 


2.6 


-159.9 


i 


120-362 


1058 


DUF286 


Caenorhabditis protein of 
unknown functio 


2.7 


-94.1 




83-246 


1058 


PUCC 


PUCC protein 


8.4 


-281.4 


1 


83-365 


1058 


Competence 


Competence protein 


8.4 


-105.8 




C\r\ /** a i 

99-341 


1059 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


1.5e-05 


10.6 


1 


33-401 


1060 


Na_sulph_sy 
mp 


Sodium: sulfate symporter 
transmembran 


4.5e-143 


488.7 


1 


-1 CCA 

1-554 


1060 


Na_H_antipor 
ter 


[ Na+/H+ antiporter family 


5.8 


-114.4 


1 


212-546 


1060 


LrgA 


LrgA family 


7.6 


-55.7 


1 


37-141 


1061 


oxidored_ql 


NADH- 

Ubiquinone/plastoquinone 
^complex L) 


9.5 


-171.4 




129-355 


1063 


MHCJ 


Class I Histocompatibility 
antigen, domains 


9.3e-06 


-8.4 




29-204 


1064 


ERG4 ERG2 
4 


Ergosterol biosynthesis 
ERG4/ERG24 family 


l.le-103 


357.8 




7-350 


1065 


Sulfatase 


Sulfatase 


0.28 


-144.2 




66-431 


1065 


CRCB 


CrcB-like protein 


2.6 


-51.2 




728-827 


1065 


Phosphodiest 


Type I phosphodiesterase / 


2.7 


-167.6 




43-368 



WO 03/054152 



PCT/US02/39555 



Table 4B 
475 



SEQ 
ID 


Model 


Description 


E-value 


Score 


Repeats 


Position 






nucleotide p 










1065 


Cad 


Cadmium resistance 
transporter 


7.5 


-136.9 




A 1 0 


1065 


Metalloenzy 
me 


Metalloenzyme superfamily 


9.3 


-43.8 




i qq 'ins 


1066 


ig 


Immunoglobulin domain 


1.2e-15 


65.4 




42-95:135- 

lyZ.ZJ 1-ZOo 


1067 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


1.5e-28 


108.3 


1 


ZO-/0O 


1068 


UPF0136 


Uncharacterised protein 
family (UPF0136) 


4e-45 


163.4 


1 


12-108 


1069 


pkinase 


Protein kinase domain 


1.6e-65 


231.1 




ono con 


1069 


Activin_recp 


Activin types I and II 
receptor domain 


6,2e-28 


106.2 


i 


20-107 


1069 


toxin 


Snake toxin H 


2.8 


-19.3 


i 


33-101 


1069 


UPAR LY6 


u-PAR7Ly-6 domain 


4 


-19.7 


1 


4 /- 1U 1 


1070 


PMP22_Clau 
din 


PMP- 

22/EMP/MP20/Claudin 
family 


0.00018 


13.2 




1 1 HI 


1070 


UPF0136 


Uncharacterised protein 
family 


3.6 


-32.2 




6 /-lo4 


1071 


MHCJ 


Class I Histocompatibility 
antigen, domains 


1.2e-12 


55.5 


1 


24-202 


1071 


ig 


Immunoglobulin domain 


3.6e-05 


30.6 




Z 1 o-ZcS4 


1072 


PMP22_Clau 
din 


PMP- 

22/EMP/MP20/Claudin 
family 


3.1 


-60.4 


i 


36-207 


1073 


sugar_tr 


Sugar (and other) 
transporter 


0.032 


-126.5 


1 






1073 


PUCC 


PUCC protein 


A £" 

4.5 


-2 /4.o 






1073 


Competence 


Competence protein 


6.5 


-103.7 


i 


115-316 


1073 


Nuc_H_symp 
ort 


Nucleoside H+ symporter 


9.6 


-276.3 






1074 


sugarjr 


Sugar (and other) 
transporter 


0.032 


-126.5 


i 


136-616 


1074 


PUCC 


PUCC protein 


4.5 


-274.8 


i 


218-520 


1074 


Competence 


Competence protein 


6.5 


-103.7 




2U3-4U4 


1074 


Nuc_H_symp 
ort 


Nucleoside H+ symporter 


9.6 


-276.3 


i 


160-611 


1075 


ank 


Ankyrin repeat 


1.7e-45 


164.6 


6 


31-63:64-96:97- 

Izy.l 3U-10Z. I03- 

195:196-228 


1076 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


5.7e-05 


-2.6 


1 


i in in a 
1 lo-3 /4 


1076 


DUF216 


Domain of unknown 
function DUF 


1.6 


-42.6 


I 




1077 


sugarjx 


Sugar (and other) 
transporter 


0.0039 


-100.2 


l 


40-4 /U 


1077 


OATP_C 


Organic Anion Transporter 
Polypeptide (OATP) 


0.018 


-180.1 


l 


64-361 


1077 


PUCC 


PUCC protein 


6.2 


-278.1 


l 


85-480 


1077 


PAP2 


PAP2 superfamily 


7.9 


-31.5 


l 


353-476 


1078 


UvrD- 
helicase 


UvrD/REP helicase 


7.1 


-248.3 


l 


299-774 


1078 


DUF246 


Plant protein family 


9.1 


-257.7 


l 


584-865 
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1079 


G_glujxansp 
ept 


Gamma- 

glutamyltranspeptidase 


3.2e-05 


-146.3 


1 


122-499 


1079 


DLH 


Dienelactone hydrolase 
family 


2.5 


-92.8 


1 


1 89-376 


1079 


DUF312 


Short repeats of unknown 
function 


3.3 


0.8 


1 


415-453 


1080 


TPR 


TPR Domain 


2e-19 


78.0 


5 


28-61:68-101:108- 

141:148-181:188- 

221 


1081 


ACAT 


Sterol O-acyltransferase 


1.9e-32 


121.3 


1 


300-406 


1082 


WD40 


WD domain, G-beta repeat 


2.5e-20 


81.0 


7 


1015-1050:1059- 
1097:1115- 
H5l:ll58- 
1194:1203- 
124U: 1240- 

lZo i:izyo-iozy 


1082 


V1R 


Vomeronasal organ 
pheromone receptor tamily 


3.2 


-162.0 


1 


421-676 


1083 


pldnase 


Protein kinase domain 


1.4e-54 


194.8 


— 


24o-jOo 


1083 


Activin_recp 


Activin types I and II 
receptor domain 


5.8e-38 


139.6 


1 


zo-lz/ 


1083 


UPAR LY6 


u-PAR/Ly-6 domain 


7.9 


-22.8 


i 


48-121 


1084 


pkinase 


Protein kinase domain 


1 A C A 

1.4e-54 


194.8 




Tin ^QA 


1084 


Activinrecp 


Activin types I and II 
receptor domain 


5.8e-38 


139.6 


i 


26-127 


1084 


UPAR LY6 


u-PAR/Ly-6 domain 


7.9 


-22.8 


i 


48-121 


1086 


C2 


C2 domain 


1.7e-06 


35.0 




233-jJ.o 


1087 


Cache 


Cache domain 


2.7e-24 


94.2 


i 


557-650 


1087 


vwa 


von Willebrand factor type 
A domain 


0.13 


-30.6 




361-537 


1088 


Glyco hydro 
31 


Glycosyl hydrolases family 
31 


1.6e-224 


759.3 




1-636 


1090 


SSrecog 


Structure-specific 
recognition protein 


2.7e-153 


522.7 


1 


^ /to C C C 

343-555 


1090 


HMG_box 


HMG (high mobility group) 
box 


3.6e-32 


120.3 




Do i-/4y 


1090 


DUF304 


Bacterial membrane flanked 
domain 


A C\ 

4.8 


-12.0 




1 /I yl nil 


1091 


serpin 


Serpin (serine protease 
inhibitor) 


8.1e-196 


663.9 




315-003 


1092 


trypsin 


Trypsin 


0.00023 


-4.5 


1 


334-Dzo 


1093 


Competence 


Competence protein 


9.1 


-106.5 




9-260 


1094 


Aa trans 


Transmembrane amino acid 
transporter pro 


1.5e-07 


4.2 


i 


115-561 


1094 


TrpJTyr_per 
m 


Tryptophan/tyrosine 
permease family 


2.7 


-283.0 


i 


118-558 


1094 


oxidored_ql 


NADH- 

T TV\i ni l-i n rvnr"»/"r\l o af~/^\<m 11 n r\~t~\ 
LJ till] UlIlVJIlW pidoLLH^UinOiiC 


J— * M 

8.1 


-170.0 


1 


1 17-338 


1097 


ATP-synt_C 


ATP synthase subunit C 


3.3e-18 


73.9 




72-140 


1098 


aconitase 


Aconitase family (aconitate 
hydratase) 


1.9e-177 


602.9 




162-744 


1098 


Aconitase_C 


Aconitase C-terminal 
domain 


8.9e-72 


251.9 




872-1043 


1103 


PAP2 


PAP2 superfamily 


6.3e-15 


63.0 




89-236 


1104 


ig 


Immunoglobulin domain 


1.3e-18 


75.3 


2 


80-148:183-251 
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1106 


TCTP 


Translationally controlled 
tumor protein 


3.5e-93 


323.0 


1 


1-166 ; 


1109 


efhand 


EF hand 


1.3e-I3 


58.7 


3 


33-61:102- 

13U: loo-loo 


1110 


ATP1G1JPL 
M MAT8 


ATP 1 G 1/PLM/M AT8 
family 


1.8e-13 


58.2 




no 1 a c 


1112 


ldl_recept_a 


Low-density lipoprotein 
receptor domain 


0.00073 


26.3 




115-153 


1112 


CUB 


CUB domain 


0.0032 


-3.1 




9-109 


1115 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


1.8e-05 


9.0 


1 


139-349 


1116 


7tm 1 


7 transmembrane receptor 


7.5e-21 


82.7 




114-363 


1116 


oxidored q5_ 
N 


NADH-ubiquinone 
oxidoreductase chain 4 


4.2 


-55.0 




213-282 


1116 


Abi 


CAAX amino terminal 
protease family 


9.4 


-28.2 




171-254 


1116 


PhaG_MnhG 
YufB 


Na+/H+ antiporter subunit 


9.8 


-57.7 




281-376 


1119 


tsp_l 


Thrombospondin type 1 
domain 


4.5e-38 


139.9 


10 


149-198:306- 

364:571-626:631- 

696:707-761:841- 

889:970- 

1021:1099- 

H4o:iziy- 

1269:1342-1398 


1120 


HMG14 17 


HMG14 and HMG17 


1.3e-34 


128.4 


1 


2-86 


1124 


lectin c 


Lectin C-type domain 


5.1 


-34.2 


1 


216-281 


1125 


ig 


Immunoglobulin domain 


2.1&-17 


71.3 


3 


84-153:185- 
255:292-347 


1136 


sugar_tr 


Sugar (and other) 
transporter 


0.029 


-125.4 


1 


23-504 


1136 


OATP^C 


Organic Anion Transporter 
Polypeptide (OATP) 


2.8 


-225.4 


1 


116-381 


1136 


PUCC 


PUCC protein 


5.8 


-277.4 


1 


83-476 


1136 


Frizzled 


Frizzled/Smoothened family 
membrane region 


8.3 


-247.2 


1 


54-264 


1137 


ATP-synt C 


ATP synthase subunit C 


3.3e-35 


130.4 


2 


1 A '7Q.Qn 1 <^ 

14- /y,yu-i j j 


1138 


DHDPS 


Dihydrodipicolinate 
synthetase family 


7.2e-31 


116.0 


1 


j4-j2j 


1139 


SSF 


Sodium: solute symporter 
family 


L5e-48 


174.7 


2 


jU-4oi.joy-yj3 


1139 


PNTB 


NAD(P) transhydrogenase 
beta subunit 


7.6 


-393.3 


1 


62-347 


1139 


Competence 


Competence protein 


8.9 


-106.3 


1 


fyA 787 
Oh— Zo / 


1140 


Cdc37 


Cdc37 family 


5.4 


-122.6 


1 


317-539 


1143 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


1.3e-44 


161.6 


1 


to in 

78-332 


1144 


Peptidase A8 


oignai peptidase (or asej 11 


7 7 


-70 3 


1 


117-211 


1145 


PID 


Phosphotyrosine interaction 
domain (PTB/PID) 


1.7e-99 


343.9 


2 


488-627:661-782 


1145 


WW 


WW domain 


1.7e-08 


41.7 


1 


373-401 


1145 


KdpC 


K+-transporting ATPase, c 
chain 


3.6 


-97.4 


1 


60-245 


1146 


PRA1 


Prenylated rab acceptor 
(PRA1) 


0.0071 


-32.6 


1 


1-153 
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1147 


DUF214 


Predicted permease 


9.5 


/CO c 

-00.5 


i 


"\i 1^1 


1148 


C2 


C2 domain 


2.1e-30 


114.4 


2 


87-165:240-320 


1149 


Reticulon 


Reticulon 


0.29 


-68.6 


1 


OD-ZOO 


1150 


Reticulon 


Reticulon 


1.8 


-81.1 


1 


/y-2oo 


1152 


7tm 1 


7 transmembrane receptor 


2.8e-27 


104.1 


1 


1 £L 1 A 1 n 

161-410 


1152 


oxidored_ql 


NADH- 

Ubiquinone/plastoquinone 
(complex I) 


5.3 


-166.1 


1 


90-329 


1153 


WD40 


WD domain, G-beta repeat 


9.8e-06 


32.5 


4 


26-62:73-1 09:2 Jo- 
271:430-467 


1155 


aa permeases 


Amino acid permease 


5.6e-06 


-179.9 


l 


193-613 


1155 


7tm 5 


7TM chemoreceptor 


1.6 


-156.9 


l 


199-448 


1155 


oxidored_ql 

■ 


NADH- 

Ubiquinone/plastoquinone 


3.7 


-162.9 


I 


■o /*\ pro 

306-552 


1155 


Spore_perme 

ase i 


Spore germination protein 


5.6 


-193.9 


1 


1 80-533 


1155 


Aa trans 


Transmembrane amino acid 
transporter p 


6.4 


-205. 1 


1 


1 79-5 89 


1155 


BPD transp_ 
2 


Branched-chain amino acid 
transport sy 


— 

6.5 


-123.5 


1 


4/-3Z/ 


1157 


PRA1 


Prenylated rab acceptor 
(PRA1) 


l.le-46 


168.5 


1 


23-149 


1158 


dUTPase 


dUTPase 


5.1e-29 


109.8 


1 


46-167 


1158 


rvp 


Retroviral aspartyl protease 


4.1e-22 


86.9 


1 


179-280 


1158 


G-patch 


G-patch domain 


0.0011 


25.7 


1 


285-329 


1158 


MIP 


Major intrinsic protein 


0.048 


-83.0 


1 


1-111 


1159 


synaptobrevin 


Synaptobrevin 


0.49 


-3.1 


1 


96-184 


1160 


laminin_EGF 


Laminin EGF-like 

(Domains III and V) 

... 


9.5e~132 


45L1 


14 


299-353:356- 

423:426-466:490- 

532:535-578:584- 

627:630-680:683- 

727:1265- 

1308:1311- 

1352:1355- 

1401:1404- 

1452:1686- 

1730:1 /33-1 /oo 


1160 


laminin_Nter 
m 


Laminin N-terminal 
(Domain VI) 


4e-77 


269.6 


1 


47-297 


1160 


laminin_G 


Laminin G domain 


6e-31 


H6.2 


3 


2625-2748:3015- 
3138:3185-3313 


1160 


laminin B 


Laminin B (Domain IV) 


l.le-09 


45.6 




1517-1651 


1160 


MCPsignal 


Methyl-accepting 
chemotaxis protein 


0.61 


-H9.3 


1 


2011-2237 


1160 


TSPN 


Thrombospondin N- 
terminal -like domain 


1.7 


-47.9 


1 


3143-331 1 


1160 


filament 


Intermediate filament 
protein 


2.2 


-193.4 


i 


1826-2081 


1160 


DSL 


Delta serrate ligand 


3.3 


-17.5 




472-519 


1160 


OEP 


Outer membrane efflux 
protein 


3.7 


-29.3 




2020-2211 


1160 


EB 


EB module 


6.5 


-10.6 




665-713 


1160 


Apidaecin 


Apidaecin 


7 


7.3 




907-934 


1160 


Acetate_kinas 
e 


Acetokinase family 


8.3 


-242.2 




2093-2412 
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1160 


spectrin 


Spectrin repeat 


8.4 


-20.0 


i 
i 


Z 1U3-ZZUD 


1161 


EGF 


EGF-like domain 


2.2e-26 


1 A 1 1 

101.1 


c 
0 




140:147-178:185- 


1161 


EB 


EB module 


1.7 


-4.2 




111 1 7R 


1161 


metalthio 


Metallothionein 


7.7 


-11.7 


i 


64-133 


1162 


interferon 


Interferon alpha/beta 
domain 


1.5e-17 


/1. 7 




i & \i\ 
l o- 1 /i 


1163 


IL1 


Interleukin-1 / 18 


8.7e-21 


82.5 




1 O 1 


1164 


PDGF 


Platelet-derived growth 
factor (PDGF) 


1.8e-51 


184.4 


l 


52-130 


1165 


IL1 


Interleukin-1 / 18 


3.7e-23 


90.4 




11 1 A A 


1167 


Palm_thioest 


Palmitoyl protein 
thioesterase 


2.4e-216 


732.1 


l 


28-282 


1168 


serpin 


Serpin (serine protease 
inhibitor) 


1.6e-202 


686.2 


i 


47-415 


1170 


Defensin_pro 
pep 


Defensin propeptide 


L2e-26 


102.0 


i 


38-90 


1170 


defensins 


Mammalian defensin 


7.3e-14 


59.5 


l 


103-131 


1172 


Y_phosphatas 
e 


Protein-tyrosine 
phosphatase 


1.8e-110 


380.4 


1 


271-499 


1179 


Metallophos 


Calcineurin-like 
phosphoesterase 


l.le-11 


52.3 


i 


70-285 


1184 


MHCJ 


Class I Histocompatibility 
antigen, domains 


6.2e-06 


-5.9 


l 


29-205 


1187 


PLA2_B 


Lysophospholipase catalytic 
domain 


8.3e-53 


188.9 


1 


orn OAfl 

3>> /-oUU 


1187 


C2 


C2 domain 


1.6e-06 


35.1 


i 


46-129 


1189 


lipocalin 


Lipocalin / cytosolic iatty- 
acid binding 


2.6e-39 


144.0 


1 


OQ 1 CO 

3y-ioo 


1189 


Kunitz_BPTI 


Kunitz/Bovine pancreatic 
trypsin mhibito 


4.7e-23 

m 


90.U 


— 


911 971-974-11 Q 


1190 


ig 


— lit*! * 

Immunoglobulin domain 


4e-U7 


n i 
5 /. 1 




18- 1 16 


1191 


ig 


Immunoglobulin domain 


4e-07 


37.1 


1 


38-116 


1196 


UPF0185 


Uncharacterised protein 
family (UPF01 85) 


3.6e~41 


1 CA O 

150.2 




1 ^ i n? 


1200 


zf-DHHC 


DHHC zinc finger domain 


3.7e-15 


63.8 




87-149 


1201 


KRAB 


KRAB box 


6.6e-22 


86.2 




33-73 


1202 


KRAB 


KRAB box 


6.6e-22 


86.2 




33-73 


1204 


trypsin 


Trypsin 


3e-31 


117.2 




56-239 


1208 


lipocalin 


Lipocalin / cytosolic fatty- 
acid binding pr 


0.12 


9.2 


"1 


38-186 


1212 


LRR 


Leucine Rich Repeat 


2.5e-54 


193.9 


15 


73-96:97-122:123- 

149:150-175:176- 

195:197-220:221- 

246 : 247-27 0 : 292- 

317:318-337:339- 

362-363-388:389- 

408:410-433:434- 

459 


1212 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.24 


17.9 


1 


43-71 


1212 


SCAN 


SCAN domain 


7.2 


-38.8 


1 


88-187 


1213 


Zn_carbOpep 
t 


Zinc carboxypeptidase 


1.9e-61 


217.6 


1 


50-285 



* 
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1214 


ig 


Immunoglobulin domain 


5.5 


Q 1 


— 


1 7-84 


1216 


death 


Death domain 


f~\ A 

2.4 


-U. 1 




y d ion 


1217 


death 


Death domain 


1.5 


2.0 


i 


73-160 


1221 


Clq 


Clq domain 


3.4e-41 


150.3 


— 


1 1A O^S 


1221 


Collagen 


Collagen triple helix repeat 
(20 copies) 


2.6e-10 


47 J 




41- LUU 


1223 


pkinase 


Protein kinase domain 


9e-75 


261.8 


— 


ICO /II 1 


1223 


SH3 


SH3 domain 


0.019 


7.4 




4Z j _£ t / O 


1223 


UBA 


UBA/TS-N domain 


0.035 


20.7 




987-1026 


1224 


vwa 


von Willebrand factor type 
A domain 


1.2e-05 


29.0 




c 1 1/10 
D I-Z4Z 


1225 


abhydrolase 


alpha/beta hydrolase fold 


3.7e-12 


53.8 




111-390 


1225 


abhydrolase_ 
2 


Phospholipase/Carboxyleste 
rase 


2.3 


-88.8 




64-270 


1227 


lectin c 


Lectin C-type domain 


9.6e-29 


108.9 


• 


54-166 


1237 


fibrinogen_C 


Fibrinogen beta and gamma 
chains, C-term 


1.5e-09 


-1.3 




242-385 


1257 


LRR 


Leucine Rich Repeat 


9e-65 


228.6 


14 


78-101:102- 

125:126-149:150- 

173:174-197:198- 

221:222-245:246- 

269:270-293:294- 1 

317:318-341:342- 

joj :3oo-o cSy . 

415 


1257 


LRRCT 


Leucine rich repeat C- 
terminal domain 


0.011 


22.3 


1 


423-475 


1257 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.23 


18.0 


1 


24- jz 


1263 


PepJV112B_p 
ropep 


Reprolysin family 
propeptide 


1.8e-28 


108.1 


1 


75-191 


1263 


disintegrin 


Disintegrin 


2.7e-ll 


51.0 


1 


419-494 


1263 


Reprolysin 


Reprolysin (M12B) family 
zinc met alio 


7.3e-07 


-22.2 




2Uo-4Uz 


1263 


EGF 


EGF-like domain 


in 


10.3 




roc 

ODD-ODD 


1266 


Clusterin 


Clusterin 


2.3e-298 


1004.6 


1 


2-394 


1267 


serpin 


Serpin (serine protease 
inhibitor) 


3.8e-140 


479.0 


1 


OA /IOC 

o 0-425 


1268 


serpin 


Serpin (serine protease 
inhibitor) 


6.8e-162 


551.2 


1 


98-461 


1269 


COX6C 


Cytochrome c oxidase 
subunit Vic 


2.3e-38 


140.9 


— 


1 HZ. 
L-/D 


1270 


Cadherin_C_t 
erm 


Cadherm cytoplasmic 
region 


4 


-/y.4 




1^1 Tfi 


1271 


spectrin 


Spectrin repeat 


8.2 


-19.9 




jU / -41 / 


1272 


W2 


eIF4-gamma/eIF5/eIF2- 
epsilon 


7.3 


-26.9 


1 

— 


325-395 


1272 


spectrin 


opecirm repeat 


O.J 


-19 9 




327-437 


1273 


Rhodanese 


Rhodanese-like domain 


1.5e-48 


174.1 




16-138:165-266 


1274 


trypsin 


Trypsin 


2.9e-80 


280.1 




121-346 


1279 


Calpain_III 


Calpain large subunit, 
domain III 


1.5e-25 


98.3 




30-179 


1284 


Ribosomal L 
35p 


Ribosomal protein L35 


2.4 


-18.1 




122-179 


1285 


Ribosomal L 


Ribosomal protein L35 


2.4 


-18.1 




122-179 
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35p 












1287 


zf-C2H2 


Zinc finger, C2H2 type 


A A *~ J* 

4.4e-66 


233.0 

i 


9 


173-195:201- 
22i:zZy-ZD i:zd /- 
279:285-307:313- 
o jj:j41-^o joy- 
391:397-419 


1287 


KRAB 


KRAB box 


4.9e-23 


90.0 


■ 


14-54 


1287 


zf-BED 


BED zinc finger 


2.4 


-1.9 




186-224 


1287 


LIM 


LIM domain 


9.3 


-19.9 




175-239 


1292 


EGF 


EGF-like domain 


0.36 


17.3 


1 


30-65 


1294 


vwc 


von Willebrand factor type 
C domain 


L3e-09 


45.4 


1 


69-127 


1295 


zf-C2H2 


Zinc finger, C2H2 type 


5e-50 


179.6 


8 


263-285:319- 

341:347-369:375- 

397:403-425:431- 

453:459-481:487- 

510 


1295 


KRAB 


KRAB box 


8.9e-26 


99.1 


1 


8-48 


1295 


TFIIS 


Transcription factor S-II 
(TFIIS) 


5.7 


-5.0 


1 


431-469 


1295 


zf-TRAF 


TRAF-type zinc finger 


7.3 


-8.4 


1 


369-419 


1295 


LIM 


LIM domain 


7.7 


-19.2 


1 


321-385 


1296 


zf-C2H2 


Zinc finger, C2H2 type 


3.2e-113 


389.6 


13 


190-212:218- 
240:246-268:274- 
296:302-324:330- 
352:358-380:386- 

A r\n A 1 A A'5£..AA r *> 

408:414-436:442- 

4o4:4 /u-4yz.4yo- 
520:526-548 


1296 


KRAB 


KRAB box 


6.7e-28 


106.1 


1 


14-54 


1296 


zf-BED 


BED zmc finger 


0.51 


4.1 


1 


511-549 


1296 


PHD 


PHD-fmger 


2.4 


-13.4 


1 


275-338 


1296 


TFIIS 


Transcription factor S-II 
(TFIIS) 


7.6 


-6.1 


1 


330-368 


1296 


LIM 


LIM domain 


8.1 


-19.4 


1 


388-456 


1299 


hemopexin 


Hemopexin 


7e-25 


96. 1 


4 


279-322:324- 

367:369-415:417- 

461 


1299 


Peptidase JVU 
0 


Matrixin 


8.4e-14 


58.8 


1 


1-176 


1299 


Astacin 


Astacin (Peptidase family 
M12A) 


1.4 


-U7.0 


1 


91-238 


1301 


ifi 


Immunoglobulin domain 


0.67 


16.4 


l 


33-116 


1305 


DUF25 


Domain of unknown 
function DUF25 


5.3e-76 


265.9 


1 


40-162 


1307 


kazal 


Kazal-type serine protease 
inhibitor domain 


1.3e-12 


55.3 


1 


21-68 


1309 


Delensin pro 
pep 


-Ueiensm propeptiue 


l .ze-zo 


1 AO A 
1UZ.U 


i 


74-1 96 


1309 


defensins 


Mammalian defensin 


7.3e-14 


59.5 


1 


139-167 


1312 


ras 


Ras family 


2.7e-59 


210.4 


1 


64-253 


1312 


GTPJEFTU 


Elongation factor Tu GTP 
binding domain 


0.91 


-79.4 


1 


61-226 


1312 


arf 


ADP-ribosylation factor 
family 


4.3 


-116.7 


1 


46-226 
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rosition 


1314 


£» >^(r»T TO 

zf~C2H2 


Zmc linger, C2H2 type 


l.l eo l 


1<$J. 1 


o 


1 1 1 Q 1 

loj-ioj. iy i- 
213:219-241:247- 

325:331-353:359- 

JOI 


1314 


LIM 


LIM domain 


1,8 


-13.9 


1 


221-285 


1 *J 1 A 

1314 


rHD 


rriJJ-iinger 


9 c 

Z.J 


-i j.j 


1 
J. 


'\c\AJ\fn 

jut-jO / 


1314 


zf-BED 


BED zinc finger 


2.8 


-2.4 


1 


232-270 


111^ 
1315 


Keratm__B2 


Keratin, nigh sulrur B2 
protein 


I/.UUjj 


-jU.U 


1 
1 


1 J- 1 J^4- 


1316 


Keratin_B2 


Keratin, high sulfur B2 
protein 


3.6e-15 


63.8 


1 


23-154 


1317 


Keratm_B2 


Keratm, high sulfur B2 
protein 


O O ~ OA 

8.2e-20 


O 


o 


zu- 1 o d . l ch- -zy j 


1 O 1 o 

1318 


Keratin_B2 


Keratin, high sulrur B2 
protein 


D.J 




i 
1 


1 1 77 
J-l 12. 


1319 


zf-C2H2 


Zinc finger, C2H2 type 


O A,-. CO 

2.9e-52 


lo./. 1 


n 

I 


jj-j j.oi-o j.oy- 

111.11 /"I J7. 1*+J- 

167:173-195:201- 


1319 


PHD 


PHD-fmger 


1.6 


-11.8 


1 


146-209 


1319 


zf-BED 


BED zinc finger 


3.4 


-3.2 


1 


74-112 


1321 


Keratin_B2 


Keratin, high sulfur B2 
protein 


0.00014 


-83 


1 


23-193 


1 1 1 1 
1331 


ig 


Immunoglobulin domain 


n *7» A<C 

9. /e-Ub 


IOC 
JZ.J 


1 


24- /O 


1332 


Kumtz_BPTI 


Tr • . /t-| • J • 

Kunitz/Bovme pancreatic 
trypsin inhibito 


O O^. 1 o 

2.2e-l& 




i 
1 


/o- izo 


1333 


ZI-C2H2 


Zinc finger, C2H2 type 


4.oe-j 5 


lyo.j 


o 


1 QQ OO 1 .00*7 

iyy-zzi.zz /- 

9AQ'9*\*\ 977'7 < ?'^ 
Z*ty.ZJJ-Z/ / .ZoJ- 

305:311-333:339- 


1 JJ J 


JsJ\_A.r> 


IVIv/YlD DOX 


^ 8p.9A 

j.oe-z't 




1 
X 




1333 


zf-BED 


BED zinc finger 


0.31 


6.1 


2 


240-278:352-390 


1336 


vwa 


von Willebrand factor type 
A aomain 


0.0038 


-8.1 


1 


32-193 


1336 


TIG 


IPT/TIG domain 


1.4 


4.5 


1 


199-291 


1337 


vwa 


von Willebrand factor type 
A domain 


0.0038 


-8.1 


1 


32-193 


1337 


TIG 


IPT/TIG domain 


1.4 


4.5 


1 


199-291 


1339 


PMP22_Clau 
din 


PMP- 

22/B Mr/ Mr ZU/ Claudin 
family 


A AAOO 

O.OOoo 


1 c o 

-16.3 


i 
1 


4-zUj 


i o a n 

1340 


PMrzz_Clau 
din 


r JVlr- 

22/EMP/MP20/Claudin 
iamiiy 


A AA 1 Q 


-4. / 


i 


OA 
*t - iyH* 


10/1 1 

1 341 


rsaL. 


i noxosysiem i reaction 
centre subunit XI 




111 o 

-1 1 i.o 


i 
i 


91-1 06 






Tmmnn nplfjhnl'iTi domain 

J.J. 11111 llll*-/^! UU 14.1111 UVslllcllll 


1 5e-13 




2 


81-158-204-278 


1344 


MARCKS 


MARCKS family 


6.7 


-95.4 


1 


124-395 


1347 


ig 


Immunoglobulin domain 


1.2e-07 


38.9 


2 


168-226:276-330 


1348 


ig 


Immunoglobulin domain 


1.5e-18 


75.1 


3 


157-217:269- 
325:373-427 


1348 


rhv 


picornavirus capsid protein 


1.4 


-52.0 


1 


28-258 


1353 


LRR 


Leucine Rich Repeat 


2e-39 


144.4 


9 


120-143:144- 
167:168-191:192- 
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215:216-239:240- 
Zoo :zoh— zo / .oil - 
334:335-358 


1353 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.0066 


23.1 


1 


92-119 


1353 


LRRCT 


Leucine rich repeat C- 
terminal domain 


1.5 


4.9 


1 


368-418 


1353 


Bgal_small_C 


Beta galactosidase small 
chain, C termin 


6.6 


A A A 

-44.4 


1 


105-1 /4 


1354 


oxidored _q4 


NADH- 

ubiquinone/plastoquinone 
oxidoreduct 


5.8 


-35.2 


1 


60-142 


1356 


ig 


Immunoglobulin domain 


2.1e-17 


71.3 


3 


0 /- IUO. loo- 


1357 


CD36 


CD36 family 


3.9e-231 


no 1 1 
/01.3 


1 
I 


O-H-fO 


1358 


DUF139 


Cysteine rich repeat 
(DUF139) 


1.2e-08 


42.2 


4 


195-211:245- 

261:288-304:316- 

332 


1359 


if? 


Immunoglobulin domain 


1.6e-06 


35.1 


2 


^i-lZ^f. lOO-ZOU 


1359 


Gag MA 


Matrix protein (MA), pl5 


0.0044 


-25.2 


1 


oqi /loo 
zy i-4zy 


1360 


ion trans 


Ion transport protein 


0.0031 


24.2 


1 




1360 


DUF214 


Predicted permease 


8.6 


-67.8 


1 


202-378 


1363 


spectrin 


Spectrin repeat 


9.5e-22 


85.7 


6 


n 101. 11/1 
1 /-iZl. 1Z4- 

zzo:zzy-o4u.o / z- 
476:678-785:788- 
896 


1363 


Remorin C 


Remorin, C-terminal region 


3.5 


-39.8 


1 




1366 


Clq 


Clq domain 


2.1e-31 


117.8 


1 


266-390 


1366 


Collagen 


Collagen triple helix repeat 
(20 copies) 


0.00029 


19.8 


1 


182-241 


1367 


rnaseA 


Pancreatic ribonuclease 


Lle-34 


128.7 


1 


27-132 


1368 


3HCDH_N 


3-hydroxyacyl-CoA 
dehydrogenase, NAD 
binding 


0.0021 


-60.2 


1 


ff\ 1/11 
50-141 


1369 


3HCDH_N 


3-hydroxyacyl-CoA 
dehydrogenase, NAD 
binding 


0.25 


-91.5 


1 


50-153 


1371 


voltage JXC 


Voltage gated chloride 
channel 


3.1e-199 


675.3 


1 


92-528 


1371 


CBS 


CBS domain 


4.9e-16 


66.7 


2 


559-617:761-814 


1371 


PH04 


Phosphate transporter 
family 


4.1 


-199.1 


1 


129-495 


1371 


K trans 


K+ potassium transporter 


5.8 


-566.6 


1 


ZO /- / Dy 


1371 


7tm 5 


7TM chemoreceptor 


6.3 


-167.6 


1 


254-486 


1372 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


0.2 


-84.7 


1 


oz-iyy 


1373 


ldh_C 


lactate/malate 

QenyurOgCIlaov/, aipna/ UCltt 

c-t 


0.69 


-61.5 


1 


3o-ZUU 


1374 


disintegrin 


Disintegrin 


7.2 


-30.4 


1 


241-306 


1375 


Peptidase_Cl 


Papain family cysteine 
protease 


le-120 


414.5 


1 


1 14-332 


1376 


7tm 1 


7 transmembrane receptor 


2.6e-50 


180.6 


1 


48-454 


1376 


7tm 5 


7TM chemoreceptor 


1 


-153.3 


1 


26-453 


1376 


Sec62 


Translocation protein Sec62 


1.4 


-124.2 


1 


307-477 
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1376 


DUF280 


Caenorhabditis protein of 
unknown function 


5.6 


-141.0 


i 


93-274 


1376 


Sre 


C. elegans Sre G protein- 
coupled chemorece 


7.2 


-136.9 




1-503 


1376 


Colicin_im 


Colicin immunity protein 


9.6 


-61.3 


i 


365-466 


1378 


7tm_2 


7 transmembrane receptor 
(Secretin family) 


4.9e-09 


32.5 




250-500 


1378 


GPS 


Latrophilin/CL-l-like GPS 
domain 


0.38 


4.4 


i 


197-248 


1378 


Competence 


Competence protein 


5.9 


-102.8 


i 


240-490 


1380 


MAGE 


MAGE family 


1.4e-91 


317.7 


1 


3-231 


1381 


ion trans 


Ion transport protein 


0.065 


16.8 


! 


158-323 


1381 


ABC2_memb 
rane 


ABC-2 type transporter | 


6.1 


-130.1 




150-339 


1381 


oxidored ql 


NADH- 

Ubiquinone/plastoquinone 


6.2 


-167.5 


! 


51-322 


1381 


OATP_C 


Organic Anion Transporter 
Polypeptide 


9.5 


-236.4 


1 


80-321 


1382 




Immunoglobulin domain 


7e-12 


52.9 




37-128:160-241 


1385 


MHCJ 


Class I Histocompatibility 
antigen, domains 


9.3e-06 


-8.4 


1 


29-204 


1388 


pro_jsomeras 
e 


Cyclophilin type peptidyl- 
prolyl cis-tr 


7e-88 


305.4 


1 


45-184 


1390 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


5.3e-06 


21.2 


1 


1-210 


1391 


Reticulon 


Reticulon 


6.4 


-89.9 


1 


52-257 


1392 


Collagen 


Collagen triple helix repeat 


0.92 


-48.2 




27-84 


1392 


MDM2 


p53-associated protein 
(MDM2) 


5.1 


-12.8 


1 


92-155 


1392 


Cytidylyltrans 


Phosphatidate 
cytidylyltransferase 


9.4 


-91.4 


1 


141-242 


1393 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


1.6e-12 


55.0 




99-329 


1393 


DUF40 


Domain of unknown 
function DUF40 


5.7 


-131.9 


-J 


97-267 


1395 


trefoil 


Trefoil (P-type) domain 


3.9 


-11.1 


! 


28-53 


1397 


rn3 


Fibronectin type III domain 


4.7e-78 


272.7 


5 


533-619:631- 

717:734-833:854- 

939:951-1039 


1397 


ig 


Immunoglobulin domain 


3.7e~42 


153.5 


5 


154-227:261- 

318:362-418:453- 

511:1253-1316 


1398 


DUF300 


Domain of unknown 
function 


1.4e-152 


520.3 


1 


51-341 


1404 


ig 


Immunoglobulin domain 


2.9e-08 


40.9 


2 


42-99:139-198 


1409 


ig 


Immunoglobulin domain 


8.1e-30 


112.5 


3 


142-199:263- 
321:359-438 


1410 


Tissue fac 


Tissue factor 


0.26 


-135.2 


1 


1-267 


1410 


HRM 


Hormone receptor domain 


8.4 


-13.9 


1 


28-101 


1412 


FecCD 


FecCD transport family 


7.2 


-218.5 


1 


144-342 


1413 


MetallothioJP 
ro 


Prokaryotic metallothionein 


6.2 


-8.0 


1 


464-511 


1414 


mito_carr 


Mitochondrial carrier 
protein 


3.5e-64 


226.7 


3 


40-130:137- 
227:238-322 


1416 


ig 


Immunoglobulin domain 


0.0011 


25.6 


1 


57-112 
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1417 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


6.6e-06 


33.1 


1 


27-67 


1418 


Synaptophysi 
n 


Synaptophysm / 
synaptoporin 


6.9e-7l 


248.9 


i 


Z /-Z / L 


1419 


PMP22_Clau 
din 


PMP- 

22/EMP/MP20/Claudin 
family 


5.2e-18 


73.3 


i 
i 




1421 


AmmomumJ 
ransp 


Ammonium Transporter 
Family 




79 & 
/z.o 


i 
i 




1421 


FecCD 


FecCD transport family 


1.8 


-206.7 


1 


97-311 


1421 


Bac_chlorC 


Bacteriochlorophyll C 
binding protein ! 


5.8 


-12.0 


1 


212-289 


1423 


rnaseA 


Pancreatic ribonuclease 


4. ie-zy 


110 1 


i 
i 


32-136 


1424 


PCI 


PCI domain 




Q 1 
-y. 1 


i 
i 


64-133 


1424 


Nrap 


Nrap protein 


9.7 


-257.5 


1 


162-764 


1425 


CbiM 


CbiM 


3.4 


-oO.o 


i 
i 




1427 


LRR 


Leucine Rich Repeat 


4.:>e-zo 


i nn n 

1UU.U 


o 


74*76-99-100- 
123-124-144 


1427 


fh3 


Fibronectin type 111 domain 


2. /e-uo 


41 .1/ 


i 
i 




1427 


# 

ig 


Immunoglobulin domain 


z.4e-u / 


17 0 




225-283 


1427 


LRRCT 


Leucine rich repeat L- 
terminal domain 


7 Oo A£ 


19 Q 


i 
i 


1 64-209 


1428 


GRAM 


GRAM domain 


0.067 


13.1 


1 


171-245 


1429 


YGGT 


YGGT family 


7.4 


-24.4 


1 


147-217 


1431 


Papilloma E5 


Papillomavirus E5 


6.7 


-34. y 


i 
i 




1435 


Rhomboid 


Rhomboid family 


1.5e-45 


104.0 


l 




1435 


efhand 


EF hand 


3.8 


11.0 


1 


38-66 


1437 


Sema 


Sema domain 


7e-202 


684.1 


l 


1 CY) ^ 1 ^ 


1437 


tsp_l 


Thrombospondin type 1 
domain 


l.le-42 


155.3 


O 


Do*?-Oj / .044- 

695:702-746:833- 

OO 3 .07li"7'rU.7H'J" 


1437 


PSI 


Plexin repeat 


2.5e-l4 


61.0 


1 


531-578 


1438 


ank 


Ankyrin repeat 


2e-l5 


64.7 


3 


336-368:383- 


1438 


ion trans 


Ion transport protein 


9.4e-07 


35.9 


1 


615-817 


1439 


ank 


Ankyrin repeat 


3.3e-12 


54.U 


J 


zl1 7 zlAQ-A^d- 

499:549-579 


1439 


ion trans 


Ion transport protein 


9.4e-07 


3D.y 


1 
1 




1442 


LRR 


Leucine Rich Repeat 


3.2e-20 


80.6 


5 


57-80:81-104:105- 

19R-1 ?Q 1 S9*1 ^7- 

176 


1442 


LRRCT 


Leucine rich repeat C- 
terminal domain 


2.3e-07 


37.9 


1 


1 50-Z4U 


1442 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.00013 


28.8 


1 


77 C^ 


1 A A "2 

1443 


/tm_z 


(Secretin family) 


7 2e-23 


89 4 


1 


166-476 


1443 


HMG_box 


HMG (high mobility group) 
box 


3.1e-05 


30.8 


1 


3-76 


1443 


GPS 


Latrophilin/CL-l-like GPS 
domain 


0.0066 


19.1 


1 


110-162 


1443 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


1.9 


-107.5 


1 


255-473 



I 
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Jrosition 


1443 


Frizzled 


Fnzzled/Smoothened family 
membrane region 


3.3 


-24 1.3 


i 




1444 


DUF214 


Predicted permease 


0.0051 


1 £L A 

-16.4 


i 
1 


130-2 /j 


1444 


secY 


eubacterial secY protein 


1.1 


-230.8 


1 


13-270 


1448 


ribonuc_red_s 
m 


Ribonucleotide reductase, 
small chain 


5.6 


-142.1 


1 


529-805 


1452 


mito_carr 


Mitochondrial carrier 
protein 


8.4e-58 


205.5 


2 


27-123:124-217 


1454 


mitocarr 


Mitochondrial carrier 
protein 


0.00049 


22.3 


2 


79-179:198-287 


1455 


WD40 


WD domain, G-beta repeat 


2.7e-26 


100.8 


4 


64-99:105- 

141 :23d-2 /3:z/y- 

O 1 J 


1456 


Furin-like 


Furin-like cysteine rich 

• 

region 


6.1e-89 


308.9 


1 


172-328 


1456 


Recep_L_do 
main 


Receptor L domain 


3.1e-80 


280.0 


2 


46-169:345-471 


1456 


pkinase 


Protein kinase domain 


7e-65 


229.0 


1 


986-1258 


1456 


fh3 


Fibronectin type III domain 


0.2 


11.5 


2 




1456 


fer4 


m -Y— « J ^— < 1*1* 1 * 

4Fe-4S binding domain 


3.7 


-1.0 


1 


236-20/ 


1456 


Keratin_B2 


Keratin, high sulfur B2 
protein 


5.2 


-79.5 


1 


143-285 


1456 


TIL 


Trypsin Inhibitor like 
cysteine rich d 


6.3 


-13.2 


1 


O0 1 T"7A 

221-2 /y 


1462 


Peptidase Ml 
0 


Matrixin 


1.5e-79 


277.7 


1 


62-227 


1462 


hemopexin 


Hemopexin 


1.4e-10 


48.6 


1 


309-350 


1463 


Armadillo_se 
g 


Armadillo/beta-catenm-like 
repeat 


0.028 


21.0 


4 


C A A CA1 . /£. /I 1 

549-591:041- 
685:686-737:738- 

TOO 

783 


1466 


C2 


C2 domain 


6.5e-07 


36.4 


1 


219-304 


1468 


RTC 


RNA 3 '-terminal phosphate 
cyclase 


6.1e-05 


-118.8 


1 


9-196 


1476 


CN_hydrolas 
e 


* • ill 

Carbon-nitrogen hydrolase 


0.0014 


-76.2 


1 


04-341 


1477 


zf-C2H2 


Zinc finger, C2H2 type 


4.8e-0o 


33.5 


3 


zo /-Zoy.zy^— 
317:419-442 


1477 


zf-C3HC4 


Zmc finger, C3HC4 type 
(RING finger) 


7.1 


-7.0 


1 


o^:q oaq 


1478 


-1—1 1 * 

Eukporm 


Eukaryotic porm 


O O. A/T 

i.ie-Uo 


-4Z. 1 


i 
1 


1 1 AO 


1482 


ras 


Ras family 


0.24 


-121.5 


1 


389-513 


1484 


BTB 


BTB/POZ domain 


Z.3e-13 


J /.o 


i 
1 


OO- lyH 


1487 


rvt 


Reverse transcriptase 


o.leo / 


O A*> O 

2U2.2 


i 
1 


Z^f 1 - J U / 


1489 


GalactosylJT 
2 


Galactosyltransferase 


4.7e-177 


601.6 


1 


97-404 


1489 


TT ORF2 


TT viral ORF2 


9 


-98.0 


1 


33^-403 


1490 


Orexin 


Prepro-orexm 


8.3 


o o i 

-32.3 


1 


1 119 
1-lZo 


1 A ftl 




fri t oil t/1 of*^— Kin din cr TYrr\f"f ai iTi 

oudnyiaic-Diiiuiiig piuLciii, 
C-terminal domain 


0 00059 


-84 4 


1 


2-155 


1493 


NUDIX 


NUDIX domain 


0.0011 


21.7 


1 


297-439 


1496 


Big_2 


Bacterial Ig-like domain 
(group 2) 


4.5e-16 


66.8 


1 


1177-1258 


1497 


ig 


Immunoglobulin domain 


1.9e-10 


48.2 


1 


38-112 


1498 


aldo ket red 


Aldo/keto reductase family 


3.7e-08 


-8.7 


1 


108-367 


1502 


Caveolin 


Caveolin 


1.6e-06 


13.3 


1 


90-262 
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jt os iii on 


1504 


sushi 


Sushi domain (SCR repeat) 


8.2e-16 


66.0 


2 


223-281:286-344 


1511 


zf-C2H2 


Zinc finger, C2H2 type 


l.le-16 


68.9 


2 


341-363:369-391 


1511 


KRAB 


KRAB box 


4.9e-09 


43.5 


1 


230-268 


1511 


zf-BED 


BED zinc finger 


4.1 


-3.9 


1 


354-392 


1516 


CBS 


CBS domain 


1.4e-30 


115.0 


3 


189-243:264- 
317:336-389 


1517 


zf-C2H2 


Zinc finger, C2H2 type 


2.4e-128 


439.8 


18 


279-301:307- 
329:335-357:363- 
385:391-413:419- 
441:447 '-469:40 r 5- 
497:503-525:868- 
890 : 896-9 1 8 : y24- 

y4o:y->2-y /4:you- 

i UUZ. 1UU6- 

1058:1064- 

1WOO. 117j7Z- 1 1 It 


1517 


KRAB 


KRAB box 


D.oe-4U 


1 A& O 
140. L 


Z 


7d 1 1 A*fi5U -71 1 


1517 


PHD 


PHD-fmger 


1 o 
1.2 


i a n 
-1U. / 




8Q7 o^n 


1517 


SCAN 


SCAN domain 


1.6 


-32.1 


i 


544-624 


1517 


LIM 


LIM domain 


C A 

5.4 


-1 f.y 




^7 Afil 


1517 


BolA 


BolA-like protein 


6.1 


-25 .1 


i 


283-347 


1517 


TFIIS 


Transcription factor S-II 
(TFIIS) 


9.1 


-6.8 




oco oon 


1518 


ig 


Immunoglobulm domain 


le-Oo 


35.7 


-J 


to i no 


1521 


CoA trans 


Coenzyme A transferase 


1 1 _ 1 £-\ 

l.le-lol 


CCA C 


_± 


/lO l&^'lll All 
q-Z-ZO L.J. / L / Z 


1521 


TGF-beta 


Transforming growth factor 
beta like domain 


o.8e-64 


Zzj. / 






1523 


Reprolysin 


Reprolysin (M12B) family 
zinc metallo 


6.6e-95 


328.7 




ZZJ-422 


1523 


Pep_M12B_p 
ropep 


Reprolysin family 
propeptide 


2e-27 


i n/i c 

104.5 




1 O 1 O/l Q 

lzl-z4o 


1523 


disintegrin 


Disintegrin 


1.9e-14 


61.5 




4,5 /-j 13 


1523 


EGF 


EGF-like domain 


0.0092 


22.6 


i 


661-693 


1523 


VSP 


Giardia variant-specific , 
surface prot 


4.9 


-258.5 




428-750 


1523 


Keratin_B2 


Keratin, high sulfur B2 

* 

protein 


7.2 


-81.7 


i 


427-538 


1524 


DSPc ! 


Dual specificity 
phosphatase, catalytic doma 


3.6e-71 


249.9 


1 


102-241 


1527 


Spermine_syn 
th 


Spermine/spermidine 

-a 

synthase 


8.2e-13 


56.0 


1 


254-440 


1528 


efhand 


EF hand 


3.4e-10 


47.3 


4 


65-91:145- 

173:329-357:365- 

393 


1529 


Epimerase 


NAD dependent 
epimerase/dehydratase 

i<m in y 


0.66 


-192.0 


1 


10-451 


1530 


rvt 


Reverse transcriptase 


7.7e-69 


242.1 


1 


125-395 


1531 


RPH3A_effec 
tor 


Rabphilin-3A effector 
domain 


1.9e-09 


-6.1 


1 


161-339 


1533 


TPR 


TPR Domain 


3.7e-10 


47.2 


3 


220-253:444- 
477:478-511 


1533 


CENP-B 


CENP-B protein 


1 .2e-06 


-7.7 


1 


541-671 


1534 


SAM 


SAM domain (Sterile alpha 


0.045 


18.0 


1 


688-752 
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motif) 










1534 


Atrophin-1 


Atrophin-1 family 


7.9 


-679.9 




731-1402 


1538 


pkinase 


Protein kinase domain 


1.5e-20 


81.7 


i 


42-242 


1540 


zf-C2H2 


Zinc finger, C2H2 type 


0.00035 


27.3 


1 


725-750 


1541 


COLFI 


Fibrillar collagen C-terminal 
domain 


3.9e-163 


555.3 




624-841 


1541 


Collagen 


Collagen triple helix repeat 
(20 copies) 


2.2 


-55.6 


1 


538-597 


1541 


TT ORF2 


TT viral ORF2 


3.9 


-93.3 


i 


508-610 


1542 


EGF 


EGF-like domain 


2.6e-ll 


51.0 




916-947:1004- 
1044:1050-1094 


1542 


granulin 


Granulin 


6.3 


-17.2 




1019-1074 


1542 


TIL 


Trypsin Inhibitor like 
cysteine rich domain 


9.4 


-15.2 


i 


938-1004 


1543 


ig 


Immunoglobulin domain 


1.5e-10 


48.5 




38-112 


1544 


UCH-l 


Ubiquitin carboxyl-terminal 
hydrolases famil 


8.5e-15 


62.6 


1 


477-508 


1544 


UCH-2 


Ubiquitin carboxyl-terminal 
hydrolase family 


7.3e-13 


56.2 




759-811 


1544 


zf-UBP 


Zn-finger in ubiquitin- 
hydrolases and other 


0.011 


20.7 


1 


350-420 


1545 


spectrin 


Spectrin repeat 


2.8 


-14.5 


i 


696-803 


1546 


LRR 


Leucine Rich Repeat 


l.le-30 


115.3 


9 


132-155:156- 

179:180-199:201- 

224:250-269:271- 

294:295-318:319- 

341:342-365 


1546 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.00082 


26.1 


1 


62-84 


1547 


Allantoicase 


Allantoicase repeat 


7e-70 


245.6 


2 


116-271:376-476 


1547 


DCX 


Doublecortin 


3.3e-17 


70.6 


1 


611-671 


1551 


Peptidase S9 


Prolyl oligopeptidase family 


1.7e-06 


35.0 


1 


389-469 


1551 


DPPIV_N_ter 
m 


Dipeptidyl peptidase IV 
(DPP IV) N-termi 


1.4 


-261 A 


1 


1-385 


1551 


DLH 


Dienelactone hydrolase 
family 


A 

4.7 


-96.4 




355-597 


1552 


Om_DAP_Ar 
g deC 


Pyr idoxal-dep endent 
decarboxylase, C- 


1.3e-05 


23 A 


1 


84-202 


1555 


rvt 


Reverse transcriptase 


0.00015 


7.9 


1 


505-718 


1557 


rvt 


Reverse transcriptase 


1.4e-69 


244.6 




247-520 


1557 


RcbX 


RbcX protein 


1.8 


-79.3 




1-108 


1558 


RhoGEF 


RhoGEF domain 


9.1e-26 


99.0 


1 


971-1155 


1558 


PDZ 


PDZ domain (Also known 
as DHR or GLGF) 


4.2e-12 


53.6 


1 


159-234 


1558 


PH 


PH domain 


0.081 


19.5 


1 


1199-1312 


1558 


RGS 


Regulator of G protein 
signaling domain 


2.5 


-34.3 




465-577 


1559 


pkinase 


Protein kinase domain 


9e-ll 


20.4 




1 ^1 AA1 
ID 1 -4HO 


1560 


FH2 


Formin Homology 2 
Domain 


4e-105 


362.7 




595-1038 


1561 


Keratin_B2 


Keratin, high sulfur B2 
protein 


9.6 


-83.6 




136-235 


1565 


HMG_CoA_s 
ynt 


Hydroxymethylglutaryl- 
coenzyme A synthas 


6.5e-300 


1009.7 




50-582 


1568 


GCV H 


1 Glycine cleavage H-protein 


4.3e-77 


269.5 




65-185 
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1570 


DLH 


Dienelactone hydrolase 
family 


0.039 


-69.3 


1 


168-431 


1577 


PeptidaseMl 


Peptidase family Ml 


5.6e-74 


259.2 


1 


42-399 


1577 


DUF59 


Domain of unknown 
function DUF59 


1.8 


a A A 

-24.4 




188-265 


1579 


rrm 


RNA recognition motif. 


0.0082 


22.8 




823-891 


1579 


zf-C2H2 


Zinc finger, C2H2 type 


9.5 


9.5 




584-608 


1580 


serpin 


Serpin (serine protease 
inhibitor) 


2.1e-107 


370.3 


1 


37-541 


1581 


CH 


Calponin homology (CH) 
domain 


0.012 


13.5 


1 


84-181 


1582 


LRR 


Leucine Rich Repeat 


9.6e-38 


138.8 


12 


70-93:94-117:118- 

141:142-165:166- 

189:190-213:214- 

237:238-261:262- 

285:286-310:311- 

335:336-359 


1582 


LRRCT 


Leucine rich repeat C- 
terminal domain 


5.6e-12 


53.2 


1 


369-421 


1582 


ig 


Immunoglobulin domain 


1.9e-05 


31.5 


! 


438-499 


1582 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.0099 


22.5 




28-68 


1583 


Nucleoplasmi 
n 


Nucleoplasmin 


4.4 


-94.1 


i 


131-199 


1587 


IBR 


IBR domain 


1.3e-13 


58.7 


i 


237-298 


1587 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


0.0053 


20.0 


1 


168-213 


1591 


TPR 


TPR Domain 


4.1e-13 


57.0 


3 


254-287:288- 
321:322-355 


1592 


PHD 


PHD-fmger 


0.042 


2.9 


2 


684-725:1117- 
1163 


1592 


DNA ligase 
ZBD 


NAD-dependent DNA 
ligase C4 zinc finge 


5.2 


0.3 


1 


904-927 


1593 


Collagen 


Collagen triple helix repeat 
(20 copies) 


5.9 


-64.0 


1 


302-360 


1593 


LIM 


LIM domain 


5.9 


"I O f\ 

-18.2 


1 


641-870 


1595 


Peptidase Ml 


Peptidase family Ml 


4 A A 

6.4e-44 


159.3 


1 


7-279 


1600 


thyroglobulin 

1 


Thyroglobulin type-1 repeat 


3.1e-06 


34.1 


1 




O O 1 1 o 

38-118 


1604 


ig 


T 1 1 1 * J * 

Immunoglobulin domain 


1 


8.2 


— = 




1605 


PHD 


T**T TT*\ J^* 

PHD-finger 


0.59 


-7.8 




JsQ^ &AA 


1606 


zf-CXXC 


CXXC zinc finger 


1.7e-21 


84.9 


i 


50-96 


1606 


F-box 


TH "11* 

F-box domain 


5e-06 


33.4 




j>4!>-0UZ: / 13- /OZ 


1606 


PHD 


PHD-finger 


0.062 


1.3 


i 


207-271 


1606 


rubredoxin 


Rubredoxin 


3.6 


-9.0 






1611 


L27 


L27 domain 


3.5 


S / 




285-341 


1612 


fh3 


Fibronectin type III domain 


0.68 


5.6 


1 


357-443 


1613 


rrm 


RNA recognition motif. 




0U.4 






1614 


SRCR 


Scavenger receptor 
cysteine-rich domain 


2.2e-26 


101.1 




52-145 


1616 


ldl_recept_a 


Low-density lipoprotein 
receptor domain 


0.12 


8.2 




68-114 


1616 


Tropomyosin 


Tropomyosin 


7.3 


-118.6 




216-441 


1617 


B56 


Protein phosphatase 2A 
regulatory B subunit 


3.3 


-315.0 




146-232 
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1618 


SI 


S 1 RNA binding domain 


6.7e-10 


46.3 


i 


352-429 


1618 


HHH 


Helix-hairpin-helix motif 


3.8 


12.3 




173-202 


1619 


trypsin 


Trypsin 


5.9e-09 


43.2 


- x 


30-190 


1623 


Flgl 


Flagellar P-ring protein 


6.7e-19 


76.2 




46-257 


1623 


FlgH 


Flagellar L-ring protein 


1.5 


-75.1 


i 


1-73 


1624 


Dihydroorota 
se 


Dihydroorotase-like 


9.9 


-195.7 


1 


1-187 


1625 


Semialdhyde 
dhC 


Semialdehyde 
dehydrogenase, dimerisat 


1.3e-54 


194.8 


1 


137-338 


1625 


Semialdhyde 
dh 


Semialdehyde 
dehydrogenase, NAD bindi 


2.3e-23 


91.1 


i 


7-128 


1626 


FDX-ACB 


Ferredoxin-fold anticodon 
binding domain 


le-43 


158.6 


i 


83-176 


1628 


HTHJ 


Bacterial regulatory helix- 
turn-helix 


1.5e-13 


58.4 


-i 


7-66 


1628 


LysR_substra 
te 


LysR substrate binding 
domain 


2.5e-08 


41.1 


1 


90-281 


1628 


Hpt 


Hpt domain 


7.8 


-35.7 


1 


65-156 


1631 


PEP- 

utilizers C 


PEP-utilizing enzyme, TIM 
barrel doma 


1.5e-10 


-43.4 




58-174 


1632 


OTCace 


Aspartate/ornithine 
carbamoyltransferase, A 


0.24 


-67.2 


1 


432-546 


1632 


Replicase 


Replicase family 


2.9 


-102.2 


1 


399-576 


1634 


ribonuclease_ ! 
T2 


Ribonuclease T2 family 


0.00089 


-47.2 




31-165 


1638 


efhand 


EF hand 


5.9e-10 


46.5 




38-66:74-102:111- 
139:147-175 


1639 


PCI 


PCI domain 


0.013 


19.5 




165-246 


1641 


Cysjmot 


Cystine-knot domain 


0.011 


-2.5 


"1 


227-337 


1641 


DAN 


DAN domain 


0.28 


-29.8 




199-327 


1643 


Ribosomal_L 
23 


Ribosomal protein L23 


9.6e-13 


55.8 


"1 


197-276 


1645 


tsp 1 


Thrombospondin type 1 
domain 


0.0026 


21.0 


5 


420-476:480- 

534:541-601:607- 

663:669-718 


1647 


Arginosuc__sy 
nth 


Arginosuccinate synthase 


1.3e-07 


-97.2 


1 


45-196 


1651 


vwa 


von Willebrand factor type 
A domain 


l.le-33 


125.4 


1 


89-326 


1652 


ig 


Immunoglobulin domain 


2.4e-09 


44.5 


2 


34-117:220-296 


1658 


thiored 


Thioredoxin 


6.9e-104 


358.5 


2 


324-432:459-570 


1658 


glutaredoxin 


Glutaredoxin 


7.4 


-4.2 


1 


482-544 


1662 


Hist deacetyl 


Histone deacetylase family 


5.9e-61 


215.9 


1 


857-1434 


1666 


wap 


WAP-type (Whey Acidic 
Protein) 'four-disulfi 


7.4e-15 


62.8 


1 


72-116 


1667 


KRAB 


KRAB box 


le-26 


102.2 


1 


106-146 


1672 


ig 


Immunoglobulin domain 


3.5e-ll 


50.6 


3 


146-203:245- 
295:331-405 


1674 


Transposase_ 
22 


LI transposable element 


1.5e-51 


184.7 


1 


6-281 


1676 


Ribosomal S 
2 


Ribosomal protein S2 


0.19 


-72.3 


1 


48-145 


1677 


Patatin 


Patatin-like phospholipase 


1.4e-36 


135.0 


1 


132-301 


1678 


ig 


Immunoglobulin domain 


7.6 


7.9 


1 


35-126 


1678 


Gemini mov 


Geminivirus putative 


9.4 


-42.6 


1 


51-131 
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Jrosition 






A J * 

movement protein 










1684 


adh short 


short chain dehydrogenase 


1.3e-32 


121.8 




2-244 


1685 


UPAR LY6 


u-PAR/Ly-6 domain 


0.0039 


12.2 




131-213 


1686 


pkinase 


Protein kinase domain 


2e-43 


157.7 


■ 


553-900 


1686 


Huntingtin 


Huntingtin 


7.3 


-163.8 




520-893 


1687 


HRM 


Hormone receptor domain 


7.8e-05 


29.5 




82-134 


1688 


SH3 


SH3 domain 


5.4e-22 


86.5 


1 


150-206 


1690 


ACAT 


Sterol O-acyltransferase j 


6 


-109.5 




181-304 


1693 


LRR 


Leucine Rich Repeat 


4.3e-34 


126.7 


10 


505-528:529- 
554:555-573:575- 
598:613-632:634- 
65 /:ojy-oo4.ooo- 
708:709-732:738- 

/CO 


1694 


SH3 


SH3 domain 


0.00089 


19.6 


1 


19-73 


1697 


Metallothio 1 
1 


Metallothionein family 1 1 


5.8 


-11.1 


1 


3o-ol 


1701 


cystatin 


Cystatin domain 


0.01 


ll. 8 


1 




1704 


TPR 


TPR Domain 


9 


6.5 


1 


122-155 


1705 


Ribosomal_S 
26e 


Ribosomal protein S26e 


2.1e-16 


68.0 


1 


133-231 


1706 


ank 


Ankyrin repeat 


5.5e-30 


H3.l 


3 


381-413:414- 
446:450-482 


1706 


TPR 


TPR Domain 


1.6e-06 


35. 1 


4 


3-36:43-76:164- 
197:205-238 


1706 


LRR 


Leucine Rich Repeat 


0.0014 


25.4 


4 


716-743:744- 

764:775-802:835- 

859 


1706 


Tropomoduli 
n 


Tropomodulin 


2.6 


-158.7 


1 


248-839 


1708 


zf-C2H2 


Zinc finger, C2H2 type 


3.6e-23 


90.4 


4 


51-74:205- 

227:233-255:261- 

284 


1708 


zf-BED 


BED zinc finger 


7.7 


-6.3 


1 


246-285 


1709 


Pep_M12B_p 
ropep 


Reprolysin family 
propeptide 


0.055 


-20. 1 


1 


120-202 


1710 


TIG 


IPT/TIG domain 


1.3e-100 


347.7 


10 


620-705:708- 

788:793-876:883- 

988:1084- 

1169:1174- 

1254:1257- 

1336:1341- 

1423:1424- 

1511:1516-1602 


1711 


pp-binding 


Phosphopantetheine 
attachment site 


0.048 


12.0 


1 


201-255 


1711 


HK 


Hydroxyethylthiazole 
kinase family 


2.9 


-161.9 


1 


12-232 


1711 


ketoacyl-synt 


Beta-ketoacyl synthase, N- 
terminal doma 


4 


-114.1 




265-397 


1712 


IGFBP 


Insulin-like growth factor 
binding pr 


8.8e-26 


99. 1 




40-99 


1712 


thyreoglobulin 
1 


Thyroglobulin type-1 repeat 


4.6e-05 


29.5 




264-323 


1714 


Paralemmin 


Paralemmin 


4.2e-26 


100.1 




101-439 
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1715 


rvt 


Reverse transcriptase 


3.1e-08 


40.8 


i 


2-147 


1717 


A2M_N 


Alpha-2-macroglobulin 
family N-terminal r 


0 


1082.3 


1 


52-684 


1717 


A2M 


Alpha-2-macroglobulin 
family 


0 


1184.5 


1 


776-1530 


1717 


TCTP 


Translationally controlled 
tumor protein 


0.0013 


-47.0 


i 


1172-1249 


1717 


prenyltrans 


Prenyltransferase and 
squalene oxidase re 


7.6 


3.0 


1 


1296-1332 


1717 


TSPN 


Thrombospondin N- 
terminal -like domain 


8.2 


-57.8 


i 


34-202 


1718 


cadherin 


Cadherin domain 


9.7e-29 


108.9 




68-159:341- 
425:439-543 


1721 


MBOAT 


MBOAT family 


0.38 


-121.1 


; 


3-173 


1722 


p450 


Cytochrome P450 


9.4e-48 


172.1 




86-527 


1723 


DUF270 


Protein of unknown 
function, DUF270 


0.12 


-272.7 


i 


9-284 


1725 


ig 


Immunoglobulin domain 


0.75 


16.2 


i 


34-116 


1729 


PA 


PA domain 


1.4e-29 


111.7 


; 


521-626 


1729 


Glyco hydro 
47 


Glycosyl hydrolase family 
47 


6.2e-13 


28.4 




99-364 


1731 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


3.1e-62 


220.2 


i 


50-293 


1731 


7tm 5 


7TM chemoreceptor 


1.7 


-157.3 


! 


31-289 


1731 


7tm 3 


7 transmembrane receptor 


6.1 


-155.5 


1 


39-302 


1732 


ig 


Immunoglobulin domain 


9.3e-16 


65.8 


2 


49-125:169-240 


1732 


Adeno E3 C 
Rl 


Adenovirus E3 region 
protein CR1 


8.5 


-24.2 


1 


171-251 


1734 


p450 


Cytochrome P450 


0.0069 


-119.1 


! 


51-492 


1735 


profilin 


Profilin 


5.1e-06 


33.3 


i 


955-1071 


1735 


Transposase 
22 


LI transposable element 


0.00026 


-102.7 




731-1016 


1735 


KE2 


KE2 family protein 


7.8 


-47.5 


1 


765-838 


1739 


IL3 


Interleukin-3 


8.7 


-38.1 


1 


121-224 


1742 


PAS 


PAS domain 


0.024 


21.2 


J 


348-413 


1743 


transmembran 
e4 


Tetraspanin family 


0.87 


-77.9 




5-194 


1744 


Dynein heav 
y 


Dynein heavy chain 


2e-253 


855.3 


"j 


4-781 


1745 


SCP 


SCP-like extracellular 
protein 


9.8e-36 


132.1 


1 


4-180 


1745 


granulin 


Granulin 


0.32 


-8.1 


! 


173-206 


1749 


sugartr 


Sugar (and other) 
transporter 


0.26 


-152.9 




260-692 


1749 


oxidored ql 

— A. 


NADH- 

Ubiquinone/plastoquinone 


1.6 


-155.3 


-i — 


391-674 


1749 


FecCD 


FecCD transport family 


4.7 


-214.9 


i 


473-686 


1749 


Herpes HEP 
A 


Herpesvirus DNA 
helicase/primase compl 


5.4 


-457.3 




77-605 


1749 


PsbT 


Photosystem II reaction 
centre T prote 


5.9 


5.1 




545-572 


1749 


Bac_rhodopsi 
n 


Bacteriorhodopsin 


6.8 


-137.3 




295-496 


1749 


C4dic_mal_tr 
an 


C4-dicarboxylate 
transporter/malic aci 


9.9 


-183.4 




265-477 
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1750 


ATP-synt A 


ATP synthase A chain 


8.1 


-80.8 


i 


837-958 


1751 


SDF 


Sodium:dicarboxylate 
symporter family 


2.7 


-254.2 




275-426 


1753 


ig 


Immunoglobulin domain 


9.9c-19 


75.7 




49-124:162- 

219:261-310:345- 

400:432-496 


1753 


fh3 


Fibronectin type III domain 


0.00052 


26.8 


I 


514-606 


1755 


sugarjx 


Sugar (and other) 
transporter 


0.13 


-144.2 




1-366 


1755 


PUCC 


PUCC protein 


2.3 


-267.4 


i 


21-314 


1755 


Nuc_H_symp 
ort 


Nucleoside H+ symporter 


4.4 


-267.2 


1 


1-356 


1755 


7tm 5 


7TM chemoreceptor 


4.8 


-165.5 


! 


59-281 


1755 


Competence 


Competence protein 


5.1 


-101.6 




16-209 


1755 


AmmoniuinJ 
ransp 


Ammonium Transporter 
Family 


5.4 


-242.0 


1 


67-380 


1755 


BenE 


Benzoate membrane 
transport protein 


8.1 


-335.2 


1 


69-351 


1757 


Exo_endo_ph 

OS 


Endonuclease/Exonuclease/ 
phosphatase fa 


1 .3e-20 


81.9 


1 


367-788 


1758 


K_tetra 


K4- channel tetramerisation 
domain 


2.9 


-38.6 


i 


114-214 


1759 


HK 


Hydroxyethylthiazole 
kinase family 


3 


-162.1 


i 


303-502 


1760 


Peptidase C2 
0 


Type IV leader peptidase 
family 


6.6 


-187.3 




29-187 


1762 


SSF 


S odium : solute symporter 
family 


4.2e-256 


864.2 


- x — 


39-477 


1762 


MVIN 


Virulence factor MVIN 


7.1 


-250.2 




68-548 


1762 


PNTB 


NAD(P) transhydrogenase 
beta subunit 


8.1 


-394.0 




11-300 


1762 


7tm 5 


7TM chemoreceptor 


9.1 


-170.5 




75-352 


1762 


oxidoredql 


NADII- 

Ubiquinone/plastoquinone 
(complex I) 


9.1 


-170.9 




145-463 


1763 


LRR 


Leucine Rich Repeat 


2e-37 


137.7 


n 


61-84:85-108:109- 

132:133-156:157- 

180:181-204:205- 

228:253-276:277- 

300:301-324:349- 

371 


1763 


LRRCT 


Leucine rich repeat C- 
terminal domain 


1.9e-07 


38.2 


l 


358-411 


1763 


ig 


Immunoglobulin domain 


8.5e-05 


29.4 


l 


428-486 


1763 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.00023 


27.9 


l 


30-59 


1764 


Reprolysin 


Reprolysin (M12B) family 
zinc metalloprote 


le-21 


85.6 


l 


156-359 


1764 


tsp_l 


Thrombospondin type 1 
domain 


2.5e-06 


34.5 


l 


422-491 


1769 


C2 


C2 domain 


2.5e-66 


233.8 


2 


249-337:383-471 


1771 


cadherin 


Cadherin domain 


2.7e-79 


276.8 


5 


140-235:249- 

343:359-451:465- 

556:607-697 


1774 


transmembran 


Tetraspanin family 


6.4e-59 


209.2 


1 


458-702 
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XT' vrn 1 f I A 




rvcpcciLs 


ml 

IT UolllUll 




e4 












1 T~l 

YIId 


7tm_ 1 


/ ti ansmemorane receptor 
^rnoaopsin iamiiyj 


/.ye-u:? 






—J JJJ 


1 / ID 


arg 


ceiegans org iamiiy 
integral membrane prote 


Q 1 




1 


111-306 


1 1 /o 


synapiODrevm 


o ynapio Dre vin 








110-198 


1779 


trefoil 


Trefoil (P-type) domain 


3.9 


-11.1 


i 


229-254 


i ion 

1780 


MHL_1 


Class l riistocompatiDiiity 
antigen, domains 


z.oe-oy 






95-1 Q0 


i ion 

1780 


• 


irnmunogioDuim aomam 








1 Q4-259 


1782 


ABC ten 


ABC transporter 


2.4e-19 


77.7 


i 


448-626 


1783 


SNF 


Sodium:neurotransmitter 
symporter famil 


U 


1 1 so ^ 




747 


1783 


Herpes_glyco 
P 


Herpesvirus glycoprotein M 


1 .L 


-ZOZ.-3 






1783 


UPF0118 


Domain of unknown 
function DUF20 


8 


-131.6 




240-582 


1783 


Cyto_ox_2 


Cytochrome oxidase subunit 
II 


y 






HrO f-0¥0 


1819 


Y_phosphatas 
e 


Protem-tyrosine 
phosphatase 




30 /.o 




1 777 1 AQA 


1 Q1Q 




rioronccim type in uouidui 




lOJ . J. 


7 


461-541*^52- 
634:641-719:730^ 
812:819-900:908- 
990:998-1087 


1821 


7tm__l 


7 transmembrane receptor 
(rhodopsin family) 


4.4 


-116.1 


1 


206-413 


1822 


Metallophos 


Calcineurin-like 
phosphoesterase 


1.5 


2.4 


1 


66-252 



t 
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SEQFOL 
D score 


APOLIPOPROTEIN(A); 
CHAIN: A; 


APOLIPOPROTEIN(A); 
CHAIN: A; 


PLASMINOGEN; CHAIN: 
A, B; M PROTEIN; 
CHAIN: C, D; 


PLASMINOGEN; CHAIN: 
A, B; M PROTEIN; 
CHAIN: C 3 D; 


NKG2-D; CHAIN: A; 


HYDROLASE (SERINE 
PROTEINASE) GAMMA- 
*CHYMOTRYPSIN *A 
(E.C.3.4.21.1) ($P*H7.0) 
1GCT3 


HYDROLASE (SERINE 
PROTEINASE) GAMMA- 
*CHYMOTRYPSIN *A 
(E.C.3.4.21.1) ($P*H7.0) 
1GCT3 


COAGULATION FACTOR 
XA-TRYPSIN CHIMERA; 
CHAIN: A; D-PHE-PRO- 
ARG- 

CHLOROMETHYLKETO 
NE (PPACK) WITH 
CHAIN: I; 


NE (PPACK) WITH 
CHAIN: I; 


Compound 


HYDROLASE APO(A); 
ALIPOPROTEIN(A), KRINGLE, 


HYDROLASE APO(A); 
ALIPOPROTEIN(A), KRINGLE, 
PROTEIN-LIGAND INTERACTION, 
2 LYSINE BINDING, CRYSTAL 
STRUCTURE 


BLOOD CLOTTING HUMAN 
PLASMINOGEN KRINGLE-2, 
KRINGLES, VEK-30 


BLOOD CLOTTING HUMAN 
PLASMINOGEN KRINGLE-2, 
KRINGLES, VEK-30 


APOPTOSIS HOMODIMER, CIS- 
PROLINE 






COMPLEX 

(PROTEASE/INHIBITOR) TRYPSIN, 
COAGULATION FACTOR XA 5 
CHIMERA, PROTEASE, PPACK, 2 
CHLOROMETHYLKETONE, 
COMPLEX 

(PROTEASE/INHIBITOR) 


COMPLEX 

(PROTEASE/INHIBITOR) 
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SEQFOL 
D score 


ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO- 
MAI; CHAIN: P; 


COMPLEMENT 
PROTEASE CIR; CHAIN: 
NULL; 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


Compound 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN IIA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 


COMPLEMENT COMPLEMENT, 
EGF, CALCIUM BINDING, SERINE 
PROTEASE 
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SEQFOL 
D score 


NITROGENASE 
MOLYBDENUM-IRON 
PROTEIN ALPHA CHAIN; 
CHAIN: A, C; 
NITROGENASE 
MOLYBDENUM-IRON 
PROTEIN BETA CHAIN; 
CHAIN: B, D; 


NITROGENASE 
MOLYBDENUM-IRON 
PROTEIN ALPHA CHAIN; 
CHAIN: A, C; 
NITROGENASE 
MOLYBDENUM-IRON 
PROTEIN BETA CHAIN; 
CHAIN: B 5 D; 
NITROGENASE IRON 
PROTEIN; CHAIN: E 5 F, G, 
H; 


FTSY; CHAIN: NULL; 


NITROGENASE IRON 
PROTEIN; CHAIN: A, B, 
QD; 


PROTEIN; CHAIN: A, B, 
QD; 


Compound 


OXIDOREDUCTASE 
NITROGENASE COMPONENT I, 
DINITROGENASE, NIFD; 
NITROGENASE COMPONENT I, 
DINITROGENASE, NIFK; 
NITROGENASE COMPONENT II, 
NITROGENASE REDUCTASE, 
NITROGEN-FIXATION, FE 


OXIDOREDUCTASE 
NITROGENASE COMPONENT I, 
DINITROGENASE, NIFD; 
NITROGENASE COMPONENT I, 
DINITROGENASE, NIFK; 
NITROGENASE COMPONENT II, 
NITROGENASE REDUCTASE, 
NITROGEN-FIXATION, FE 
PROTEIN, MOFE PROTEIN, P- 
CLUSTER AND 2 FEMO 
COFACTOR 


SIGNAL RECOGNITION PARTICLE 
RECEPTOR SIGNAL 
RECOGNITION PARTICLE 
RECEPTOR, GTPASE, PROTEIN 2 
TARGETING 


OXIDOREDUCTASE 
NITROGENASE COMPONENT II, 
NITROGENASE REDUCTASE, 
NITROGENASE, NITROGEN 
FIXATION, NUCLOTIDE, MGADP, 
FE 2 PROTRTN AV? 


NITROGENASE COMPONENT II, 
NITROGENASE REDUCTASE, 
NITROGENASE, NITROGEN 
FIXATION, NUCLOTIDE, MGADP, 
FE 2 PROTRTN A V7 
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QCRP2 (LIMl); CHAIN: 
NULL; 


QCRP2 (LIMl); CHAIN: 
NULL; 


QCRP2 (LIMl); CHAIN: 
NULL; 


QCRP2 (LIMl); CHAIN: 
NULL; 


NEONATAL FC 
RECEPTOR; CHAIN: A, C, 
E; BETA-2- 
MICROGLOBULIN; 
CHAIN: B, D, F; 


NEONATAL FC 

RECEPTOR; CHAIN: A, C, 
, E; BETA-2- 
1 MICROGLOBULIN; 

CHAIN: B, D, F; 


LEUKOCYTE ANTIGEN 
(HLA)-CW4 CHAIN: A; 
BETA-2 

MICROGLOBULIN; 
CHAIN: B; HLA-CW4 
SPECIFIC PEPTIDE; 
CHAIN: C; 


Compound 


LIM DOMAIN CONTAINING 
PROTEINS LIM DOMAIN 
CONTAINING PROTEINS, METAL- 


LIM DOMAIN CONTAINING 
PROTEINS LIM DOMAIN 
CONTAINING PROTEINS, METAL- 
BINDING PROTEIN, ZINC 2 
FINGER 


LIM DOMAIN CONTAINING 
PROTEINS LIM DOMAIN 
CONTAINING PROTEINS, METAL- 
BINDING PROTEIN, ZINC 2 
FINGER 


LIM DOMAIN CONTAINING 
PROTEINS LIM DOMAIN 
CONTAINING PROTEINS, METAL- 
BINDING PROTEIN, ZINC 2 
FINGER 


FCRN, BRAMBELL RECEPTOR; 
COMPLEX 

(IMMUNOGLOBULIN/BINDING 
PROTEIN) 


FCRN, BRAMBELL RECEPTOR; 
COMPLEX 

(IMMUNOGLOBULIN/BINDING 
PROTEIN) 


IMMUNOGLOBULIN (IG)-LIKE 
DOMAIN, ALPHA HELIX, BETA 
SHEET, 2 IMMUNE SYSTEM 
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AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 


AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 


AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 


AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 


AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 
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CRPl; CHAIN: A; 


CRPl; CHAIN: A; 


QCRP2 (LIMl); CHAIN: 
NULL; 


QCRP2 (LIMl); CHAIN: 
NULL; 




Compound 


METAL-BINDING PROTEIN LIM t 
DOMAIN CONTAINING PROTEINS | 


METAL-BINDING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS 
ICTL 15 


METAL-BINDING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS 
ICTL 15 


METAL-BINDING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS 
ICTL 15 


METAL-BINDING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS 
ICTL 15 


CONTRACTILE LIM DOMAIN, 1 
CRP, NMR, MUSCLE 
DIFFERENTIATION, 
CONTRACTILE 


CONTRACTILE LIM DOMAIN, 
CRP, NMR, MUSCLE 
DIFFERENTIATION, 
CONTRACTILE 


LIM DOMAIN CONTAINING 
PROTEINS LIM DOMAIN 
CONTAINING PROTEINS, METAL- 
BINDING PROTEIN, ZINC 2 
FINGER 


LIM DOMAIN CONTAINING 
PROTEINS LIM DOMAIN 
CONTAINING PROTEINS, METAL- 
, BINDING PROTEIN, ZINC 2 
FINGER 


BINDING PROTEIN, ZINC 2 
FINGER 
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CRPl; CHAIN: A; 


CRPl; CHAIN: A; 


CRPl; CHAIN: A; 


QCRP2 (LIMl); CHAIN: 
NULL; 


QCRP2 (LIMl); CHAIN: 
NULL; 


QCRP2 (LIMl); CHAIN: 
NULL; 


ZINC FINGER PROTEIN 
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GUI; CHAIN: A; DNA; 
CHAIN: C, D; 
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CONTRACTILE LIM DOMAIN, 
CRP, NMR, MUSCLE 


CONTRACTILE LIM DOMAIN, 
CRP, NMR, MUSCLE 
DIFFERENTIATION, 
CONTRACTILE 


CONTRACTILE LIM DOMAIN, 
CRP, NMR, MUSCLE 
DIFFERENTIATION, 
CONTRACTILE 


LIM DOMAIN CONTAINING 
PROTEINS LIM DOMAIN 
CONTAINING PROTEINS, METAL- 
BINDING PROTEIN, ZINC 2 
FINGER 


LIM DOMAIN CONTAINING 
PROTEINS LIM DOMAIN 

CONTAINING PROTEINS, METAL- 
BINDING PROTEIN, ZINC 2 
FINGER 


LIM DOMAIN CONTAINING 
PROTEINS LIM DOMAIN 
CONTAINING PROTEINS, METAL- 
BINDING PROTEIN, ZINC 2 
FINGER 
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GLI, ZINC FINGER, COMPLEX 
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COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PR OTRTN/HN A i 


(DNA-BINDING PROTEIN/DNA ^ 
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SEQFOL 
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CYSTEINE RICH 
INTESTINAL PROTEIN; 
CHAIN: NULL; 
LAMININ; CHAIN: NULL; 


COMPLEMENT 
CONTROL PROTEIN; 
CHAIN: A, B, C; 


TUMOR NECROSIS 
FACTOR RECEPTOR; 
CHAIN: A, B; 


CYSTEINE AND 
GLYCINE-RICH PROTEIN 
CRP2; CHAIN: A; 


CYSTEINE AND 
GLYCINE-RICH PROTEIN 
CRP2; CHAIN: A; 


CYSTEINE AND 
GLYCINE-RICH PROTEIN 
CRP2; CHAIN: A; 


AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 


AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 


AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 


CRP1; CHAIN: A; 




Compound 


METAL-BINDING PROTEIN CRIP; 
METAL-BINDING PROTEIN, LIM 
DOMAIN PROTEIN 
GLYCOPROTEIN GLYCOPROTEIN 


IMMUNE SYSTEM BETA, MODULE 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, 
SIGNALLING PROTRTN 


SIGNALING PROTEIN LIM 

DOMAIN CONTAINING PROTEINS, 
METAL-BINDTNG PR OTPTN 


SIGNALING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS, 

METAL-RTNDTNfr PR OTFTW 


SIGNALING PROTEIN LIM 

DOMAIN CONTAINING PROTEINS, 
METAL-BTNDTNG PROTPTM 


METAL-BINDING PROTEIN LIM 

DOMAIN CONTAINING PROTEINS 
ICTL 15 


METAL-BINDING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS 
ICTL 15 


METAL-BINDING PROTEIN LIM 

DOMAIN CONTAINING PROTEINS 
ICTL 15 


CONTRACTILE LIM DOMAIN, 
CRP, NMR, MUSCLE 

DIFFERENTIATION, 
CONTRACTTLR 


DIFFERENTIATION, 
CONTRACTU "R 
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98.88 


82.37 


SEQFOL 
D score 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C 5 D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


Compound 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI 3 ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


1 COMPLEX (DNA-BINDING 
' PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 
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SEQFOL 
D score 


IMMUNOGLOBULIN 
HEAVY CHAIN; CHAIN: 
H, K; 

IMMUNOGLOBULIN 
LIGHT CHAIN; CHAIN: L 3 
M; 


IMMUNOGLOBULIN 
HEAVY CHAIN; CHAIN: 

H, K.; 

IMMUNOGLOBULIN 
LIGHT CHAIN; CHAIN: L, 
M; 


IMMUNOGLOBULIN 
HEAVY CHAIN; CHAIN: 
H, K; 

IMMUNOGLOBULIN 
LIGHT CHAIN; CHAIN: L, 
M; 


IGG2A KAPPA 
ANTIBODY CB41 (LIGHT 
! CHAIN); CHAIN: A; 
IGG2A KAPPA 
ANTIBODY CB41 
(HEAVY CHAIN); CHAIN: 
B; PEPTIDE 5; CHAIN: C; 


MAJOR POLLEN 
ALLERGEN BET V 1-A; 
CHAIN: A, D, G, J; 
IMMUNOGLOBULIN 
KAPPA LIGHT CHAIN; 
CHAIN: B, E, H, K; 
ANTIBODY HEAVY 
CHAIN FAB; CHAIN: C, F, 
I,L; 




Compound 


IMMUNE SYSTEM IGG; 
IMMUNOGLOBULIN, ANTIBODY, 
B12 

1 


IMMUNE SYSTEM IGG; 
IMMUNOGLOBULIN, ANTIBODY, 
B12 


IMMUNE SYSTEM IGG; 
IMMUNOGLOBULIN, ANTIBODY, 
B12 


COMPLEX (ANTIBODY/PEPTIDE) 
POLYSPECIFICITY, 
CROSSREACTIVITY, FAB- 
FRAGMENT, PEPTIDE, 2 HIV-1 


IMMUNE SYSTEM BET V LA, 
BETVI ALLERGEN; BV16 FAB- 
FRAGMENT, KAPPA MOPC21 
CODING SEQUENCE; HEAVY 
CHAIN OF THE MONOCLONAL 
ANTIBODY MST2; BET V 1, BV16 
FAB FRAGMENT, ANTIBODY 
ALLERGEN COMPLEX 
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SEQFOL 
D score 


CALCIUM-BINDING 
PROTEIN CALMODULIN 
(VERTEBRATE) ICLL 3 


CALCIUM-BINDING 
PROTEIN CALMODULIN 
(VERTEBRATE) ICLL 3 


CALCIUM-BINDING 
PROTEIN CALMODULIN 
(VERTEBRATE) ICLL 3 j 


CALCIUM-BINDING 
PROTEIN CALMODULIN 
(VERTEBRATE) ICLL 3 


CALCIUM-BINDING 
PROTEIN CALMODULIN 
(VERTEBRATE) ICLL 3 


CALCIUM-BINDING 
PROTEIN CALMODULIN 
COMPLEXED WITH 
CALMODULIN-BINDING 
DOMAIN OF ICDM 3 
CALMODULIN- 
DEPENDENT PROTEIN 
KINASE II ICDM 4 


CALCIUM-BINDING 
PROTEIN CALMODULIN 
COMPLEXED WITH 
CALMODULIN-BINDING 
DOMAIN OF ICDM 3 
CALMODULIN- 
DEPENDENT PROTEIN 
KINASE II ICDM 4 


CALMODULIN- 
DEPENDENT PROTEIN 
KINASE II ICDM 4 


Compound 
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D score 


MYOSIN; CHAIN: A, B 5 C; 


MYOSIN; CHAIN: A, B, C; 


MYOSIN; CHAIN: A, B, C; 


MYOSIN; CHAIN: A, B, C; 


SCALLOP MYOSIN; 
CHAIN: A, B, C; 


SCALLOP MYOSIN; 
CHAIN: A, B, C; 


SCALLOP MYOSIN; 
CHAIN: A, B, C; 


SCALLOP MYOSIN; 
CHAIN: A, B, C; 


SCALLOP MYOSIN; 
CHAIN: A, B, C; 




Compound 


MUSCLE PROTEIN MUSCLE 
PROTEIN, MYOSIN 
SUBFRAGMENT-1, MYOSIN HEAD, 
2 MOTOR PROTEIN 


MUSCLE PROTEIN MUSCLE 
PROTEIN, MYOSIN 
SUBFRAGMENT-1, MYOSIN HEAD, 
2 MOTOR PROTEIN 


MUSCLE PROTEIN MUSCLE 
PROTEIN, MYOSIN 
SUBFRAGMENT-1, MYOSIN HEAD, 
2 MOTOR PROTEIN 


MUSCLE PROTEIN MUSCLE 
PROTEIN, MYOSIN 
SUBFRAGMENT-1, MYOSIN HEAD, 
2 MOTOR PROTEIN 


MUSCLE PROTEIN MYOSIN, 
CALCIUM BINDING PROTEIN, 
MUSCLE PROTEIN 


MUSCLE PROTEIN MYOSIN, 
CALCIUM BINDING PROTEIN, 
MUSCLE PROTEIN 


MUSCLE PROTEIN MYOSIN, 
CALCIUM BINDING PROTEIN, 
MUSCLE PROTEIN 


MUSCLE PROTEIN MYOSIN, 
CALCIUM BINDING PROTEIN, 
MUSCLE PROTEIN 


MUSCLE PROTEIN MYOSIN, 
CALCIUM BINDING PROTEIN, 
MUSCLE PROTEIN 


SIGNALLING, 2 

COMPLEX(CALCIUM-BINDING 
PROTEIN/PEPTIDE) 
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D score 


HEMOLIN; CHAIN: A, B; 


IMMUNOGLOBULIN; 
CHAIN: A, B; 


ANTIBODY (LIGHT 
CHAIN); CHAIN: L; 
ANTIBODY (HEAVY 
CHAIN); CHAIN: H; 


IGG4 REA; CHAIN: A; RF- 
AN IGM/LAMBDA; 
CHAIN: H, L; 


IGG4 REA; CHAIN: A; RF- 
AN IGM/LAMBDA; 
CHAIN: H, L; 


2E8 (IGG1=KAPPA=) 
ANTIBODY; CHAIN: L, H, 
M,P; 


2E8 (IGG1=KAPPA=) 
ANTIBODY; CHAIN: L, H, 
M,P; 


RELATED PROTEIN A; 
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Compound 


INSECT IMMUNITY INSECT 
IMMUNITY, LPS-BINDING, 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN, KAPPA 
LIGHT-CHAIN DIMER HEADER 


IMMUNE SYSTEM 
IMMUNOGLOBULIN; 
IMMUNOGLOBULIN ANTIBODY 
ENGINEERING, HUMANIZED AND 
CHIMERIC ANTIBODY, FAB, 2 X- 
RAY STRUCTURE, THREE- 
DIMENSIONAL STRYCTURE, 
GAMMA- 3 INTERFERON, 
IMMUNE SYSTEM 


COMPLEX 

(IMMUNOGLOBULIN/AUTOANTIG 
EN) COMPLEX 

(IMMUNOGLOBULIN/AUTOANTIG 
EN), RHEUMATOID FACTOR 2 
AUTO-ANTIBODY COMPLEX 


COMPLEX 

(IMMUNOGLOBULIN/AUTOANTIG 
EN) COMPLEX 

(IMMUNOGLOBULIN/AUTOANTIG 
EN), RHEUMATOID FACTOR 2 
AUTO-ANTIBODY COMPLEX 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN 


IMMUNOGLOBULIN 
j IMMUNOGLOBULIN 


MHC-I, COMPLEX, IMMUNE 
SYSTEM 
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ANGIOGENIN; CHAIN: B, 
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DNA; CHAIN: A, B; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC 
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CHAIN: C, F, G; 
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IMMUNOGLOBULIN FV 


IMMUNOGLOBULIN FV 
FRAGMENT OF 
HUMANIZED ANTIBODY 
4D5,VERSION8 1FVC3 


IMMUNOGLOBULIN FV 
FRAGMENT OF A 
HUMANIZED VERSION 
OF THE ANTI-CD 18 IFGV 
3 ANTIBODY H52» 
(HUH52-AA FV)1FGV4 


IMMUNOGLOBULIN FV 
FRAGMENT OF A 
HUMANIZED VERSION 
OF THE ANTI-CDl 8 IFGV 
3 ANTIBODY 'H52* 
(HUH52-AAFV)1FGV4 


IMMUNOGLOBULIN FV 
FRAGMENT OF A 
HUMANIZED VERSION 
OF THE ANTI-CD18 IFGV 
3 ANTIBODY 'H52' 
(HUH52-AAFV)1FGV4 


IGMMEZ 

IMMUNOGLOBULIN; 
CHAIN: L; IGM MEZ 
IMMUNOGLOBULIN; 
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IGMMEZ 

IMMUNOGLOBULIN; 
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IMMUNOGLOBULIN FOLD, 
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IMMUNE SYSTEM 
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FAB VH3 3 SPECIFICITY 


PDB annotation 



C7* 
>— * 

CD 



WO 03/054152 



PCT/US02/39555 



849 



1497 


1497 


1497 


1497 


1497 


1497 


1497 




^ a a 


lwtl 




i — » 

>— « . 


1— * 
v— » > 


■ 

1 


lfvd 


lfvc 






> 




r» 


f 




> 


> 




CHAI 
NID 


tO 
UJ 




to 


to 




to 


to 




STAR 
TAA 


1— ' 

to 


OJ 


to 


s 


vo 
u> 


1 — I 


i — ^ 

to 






k> 

CD 
1 

o\ 


4.2e-57 


4.2e-64 


4.2e-64 


oo 

CD 
VO 


2.8e-64 


5.6e-64 




Psi Blast 








O 

bo 




1.00 






Verify 
score 








LOO 




• 

O 

o 






PMF 
score 


127.46 


116.77 


123.98 




113.29 




125.91 




SEQFOL 
D score 


IMMUNOGLOBULIN 
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DOMAIN FROM 
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WAT, A VARIABLE 
DOMAIN FROM 
IMMUNOGLOBULIN 
LIGHT-CHAIN IWTL 3 
(BENCE-JONES 
PROTEIN) IWTL 4 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN M 
(IG-M) FV FRAGMENT 
1IGM3 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN M 
(IG-M) FV FRAGMENT 
1IGM3 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN M 
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FRAGMENT OF 
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IMMUNOGLOBULIN FV 
FRAGMENT OF 
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FRAGMENT OF 
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HULYS 11; CHAIN: A, B, 
D, E; LYSOZYME; 
CHAIN: C, F; 


FAB FRAGMENT; 
CHAIN: L, H, J, K; 
VASCULAR 
ENDOTHELIAL 
GROWTH FACTOR; 
CHAIN: V, W; 


IMMUNOGLOBULIN; 
CHAIN: A, B; 


ANTIBODY (LIGHT 
CHAIN); CHAIN: L; 
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BENCE-JONES KAPPA I 
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BLOOD COAGULATION 
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IMMUNE SYSTEM BENCE-JONES; 
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IMMUNE SYSTEM BENCE-JONES; 
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IMMUNE SYSTEM 


BLOOD COAGULATION FACTOR 
STUART FACTOR; BLOOD 
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GROWTH FACTOR LIKE DOMAIN 


COAGULATION FACTOR 


PDB annotation 



8- 



WO 03/054152 



PCT/US02/39555 



886 



4^ 



0>* 
CD 



4^ 



o 

CD 



4^ 



f 



2 H S3 




to 



to 



to 



to 



to 



ft 



-o 



4^ 
^1 



to 



)0 



O 



► 9 



3 



4^ 

CD 
i 

-J 
4^ 



^3 



to 
bo 

CD 
i 

o 



to 

bo 

CD 
i 

-o 



4^ 

CD 
i 



to 
bo 

CD 
I 

VD 



On 

CD 
i 



On 
On 



2 



O « 

3 s; 



o 
o 



8 *« 

o 



to 

"to 
-o 



O 

to 



oo 
to 



« 

o 



O O 






17 H3 

H o H 
S3 Ed K 



§ 




OOO 
> S3 > 







O 
O 

re 



o 

o 
r 



1 

o 
a 
a 




Z 5 



TO CO 
O O 



to 



o 

to 



H 

S3 
E3 



52 

G 
H 





> w o > 
2 O O z 

I r G 




> ^ o 

H O 




S 

O 

e-fr- 

&. 

p 



WO 03/054152 



PCT/US02/39555 



887 



4^ 

OO 



Oh 
Oo 



to 
oo 



vo 



o 



to 

bo 
CD 

oo 



to 

CD 
i 

o> 

VO 



4^ 
OJ 



<5 



to 
oo 



to 
vo 



to 

05 
VO 



oo 
oo 



o 



Oh 
4^ 

L>0 



OO 



VO 

OO 



to 

cd 
■ 

o 



to 

OO 



Oh 



to 

VO 



to 

CD 
I 

as 
oh 



vo 
Oh 



-o 

<? 

on 
oo 



Oh 
4^ 
OO 



ft 



to 
oo 



4ix 
Oh 



to 
bo 

CD 
I 

VO 



o 

oo 



o 



2 Q co 



9 * 



53 




H ^o 




2 

W3 



o « 



H 

p. 

Oh 



oj 



vo 

* 

o 
oo 



oo 

t— ' 

oo 



to 
oo 

vo 

OS 



4^. 
4^ 

vo 







2 

W 
I— * 

D 

3 

OO 



a 





CD O 



n 

o 

o 
s 



w 

S3 

8 

O 



WO 03/054152 



PCT/US02/39555 



888 



4^ 

LO 



L/l 

4^ 
LO 



4 

CD 



Lh 



C3 

I 



L/i 
4^ 
LO 



L/l 
4^ 

LO 



L/l 
4^ 
Lo 



L/l 
4^ 
LO 



5? 



L/l 

4^ 

LO 



L/l 

LO 



55 a 



Cd 




to 

LO 



to 

LO 



to 

LO 



to 

LO 



to 

LO 



to 

LO 



LO 



H co 
H 

5d 



LO 



CD 



4^ 
L/l 



LO 



LO 
-4 



LO 
-O 



LO 



4u 
L/i 



LO 



to 
bo 

CD 
r 



bo 

CD 
i 

Lh 



to 
bo 

CD 
I 



to 

bo 

? 
On 



LO 

CD 
t 

L/l 

oo 



to 

CD 



4^ 

to 

CD 
I 

^4 



to 

bo 

CD 
l 

o\ 

OO 



S3 



-o 
to 



^1 

LO 



O 

o 



o 
o 



> M 



° 3 



I 

O 
Ui 



-o 

In) 
oo 



s 

o 
o 

O 
w 

c 
E 



to 
to 

-J 



LO 

to 

oo 



o ^ *Z 







LO 



to 

« 

o 

— 1 



LO 
4^ 

as 
-0 








4^ 
Lh 

CO 
I — t 

§ 

oo 



O 

LO 



C/5 

o 

3 



CO 

W 
O 

o 



n 

o 

5 

O 
S3 



a 

o 
© 

S3 



WO 03/054152 



PCT/US02/39555 



889 



oh 

4^ 
On 



Oh 

4^ 
Oh 



Oh 
4^ 
Oh 



Oh 

4^ 
Oh 



Oh 
6 



Oh 
4^ 



^ H £g 
° ^ 



45». 



JO 



o 



o 

3 



to 




oo 



ON 



ON 



On 



to 



to 



H 
1* 



U) 
On 
oo 



oo 
o 

OO 



OO 

o 
oo 



oo 
o 
o-> 



4^ 
Oh 



o 




cd 

to 
Oh 



to 

CD 

o 
oo 



to 

4^ 
CD 

O 



OJ 

On 
CD 
I 

ON 



CD 



to 
bo 

CD 
1 

OS 
0\ 



^3 



O 
O 



i 

■ 

4^ 
OO 



o 

oo 



1 

o 

o 
oo 



oo 
o 



o 2 



o 
o 



bo 

4^ 



O 
NO 



o 

Oh 



9 ^ 

CD ~ 



I— ^ 

CD 
Oh 



OJ 

i— * 

On 



OC/3 
CD O 

r 



£ 

O 

a 

o 

s 

> 






w o H 

'PA 






© 

3 

© 
© 



O 
Q 



3 

i — i 

O 

H 
W 



x 

o 
o 

< 



n 

§ 



9 



... . . H 



o 



r 1 



o 



5 

Pd 




w 

a 
o 



o 
© 



WO 03/054152 



PCT/US02/39555 



890 



#5 



4^ 

ON 



4^. 



4^ 



4^ 



{3 



5 « 



o 




to 



S3 



oo 



to 
to 



H W3 



to 

I — 1 



4^ 



to 

on 



to 

ON 



to 

-4 



to 



to 

bo 

CP 

■ 

o 
VP 



GO 
? 



ft) 



oo 



oo 

CD 
i 

to 
o 



ON 

O 
i 



to 

bo 

CD 
I 

o 



O 
-J 



to 
to 



to 



o 



o 



4^ 



oo 



o 



* 

4^ 

OO 



is 



O 



o 



S3. 

s 

Is* 



O £ 

3 Si- 
ce ^ 



3 *a 



cm w 
re O 



H 

I; 



O O 



ci 

to 

s 



o o o 



• • • • 

o ^ S 

£ cj 3* 



to to 
DO > 



3 



O O ?d 

to to 
W > 



a 

to 

I 



n o o 



3 




n o o c 

tc tr! bd to 




> 





2 5 § 

W > /O 
O O £ 

to to 



W > ^ 



p p p s 
i? ^ ^ j> 

J V* [ ' 

to to a 
W > 5 



o o o 
S 1 1 
a n ?o 



to to 



W > S 




ca a 



J?? o 







n 
o 



5 



n 

e 

I 

o 
p- 



a 

g 

O 

o 
0 



WO 03/054152 



PCT/US02/39555 



891 



1546 


1546 


1546 


1546 


1546 


1546 


1546 


1546 


1546 


1546 




^ H SS 

Q w M 

(O 


Idee 


IdOb 


IdOb 


IdOb 


IdOb 


IdOb 


ST 

vo 


VO 
S3 


vo 


la9n 




03 


> 


> 


> 


> 


> 


> 


o 


o 


o 


o 




CHAI 
NID 


oo 


oo 

ON 


-o 
o 


to 

ON 

vo 


to 

LO 


to 
oo 


to 

Lh 
Lh 


to 
o 

On 


oo 


* 

u> 

ON 




STAR 
TAA 


to 

ON 
h— * 


to 

Lh 
ON 


to 
o 

-o 


! 4^ 
oo 


4^ 
ON 


LO 

to 
vo 


LO 
oo 

LO 


LO 

4s* 
lo 


to 
vo 


to 

ON 
Lh 






1.4e-13 


«o 

CD 
i 

to 

4^ 


8.4e-20 


2.8e-19 


OO 

• 

4^ 
? 

to 


to 

• 

4^ 

o 

f 

to 

-o 


6e-17 


8.4e-18 


1.3e-08 


to 

4^ 
CD 
t 

t— * 

vo 




Psi Blast 


0.60 


0.52 


0.39 


0.41 


0.20 


0.44 


0.43 


0.66 


0.09 


0.74 




Verify 
score 


1.00 


1.00 


0.98 


0.99 


0.63 


1.00 


0.87 


0.58 


0.57 


0.83 




PMF 
score 
























SEQFOL 
D score 


RAB 

GERANYLGERANYLTRA 


INTERNALIN B; CHAIN: 

A; 


INTERNALIN B; CHAIN: 
A; 


INTERNALIN B; CHAIN: 
A; 


INTERNALIN B; CHAIN: 
A; 


INTERNALIN B; CHAIN: 
A; 


U2RW 
CHAIN: 
CHAIN: 
CHAIN: 


U2RW 
CHAIN; 
CHAIN: 
CHAIN: 


U2RW 
CHAIN: 
CHAIN: 
CHAIN: 


CHAIN: 
CHAIN: 
CHAIN: 


CHAIN: B, D; 


Compound 


l HAIRPIN IV; 
: Q, R; U2 A'; 
: A, C; U2 B"; 
:B, D; 


i HAIRPIN IV; 
: Q, R; U2 A'; 
: A, C; U2 B"; 
:B,D; 


*» v» 

D O po £ 
ts> to a 

w > 5 


l HAIRPIN IV; 
: Q, R; U2 A'; 
: A, C; U2 B M ; 
:B, D; 


TRANSFERASE CRYSTAL 
STRUCTURE, RAB 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


SNRNP,RIBONUCLEOPROTEIN 
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SEQFOL 
D score 


GTPASE-ACTIVATING 
PROTEIN RNA 1 SCHPO; 
CHAIN: A, B; 


TIP ASSOCIATING 
PROTEIN; CHAIN: A, B, 
C, D, E; 


SKP2; CHAIN: A, C; SKPl; 
CHAIN: B, D; 


NUCLEAR RNA EXPORT 
FACTOR 1; CHAIN: A, B; 


NUCLEAR RNA EXPORT 
FACTOR 1; CHAIN: A, B; 


OUTER ARM DYNEIN; 
CHAIN: A; 


OUTER ARM DYNEIN; 
CHAIN: A; 




Compound 


TRANSCRIPTION RNAIP; 
RANGAP; GTPASE-ACTIVATING 
PROTEIN FOR SPIl, GTPASE- 
ACTIVATING PROTEIN, GAP, 


RNA BINDING PROTEIN TAP; 
RIBONUCLEOPROTEIN (RNP, 
RRM, RBD) AND LEUCINE-RICH- 
REPEAT 2 (LRK) DOMAINS 


LIGASE CYCLIN A/CDK2- 
ASSOCIATED P45; CYCLIN 
A/CDK2-ASS0CIATED P19; SKPl, 
SKP2, F-BOX, LRRS, LEUCINE- 
RICH REPEATS, SCF, 2 UBIQUITIN, 
E3, UBIQUITIN PROTEIN LIGASE 


RNA BINDING PROTEIN TAP 
(NFXl); RIBONUCLEOPROTEIN 
(RNP,RBD OR RRM) AND 
LEUCINE-RICH-REPEAT 2 (LRR) 


RNA BINDING PROTEIN TAP 
(NFXl); RIBONUCLEOPROTEIN 
(RNP,RBD OR RRM) AND 
LEUCINE-RICH-REPEAT 2 (LRR) 


CONTRACTILE PROTEIN 
LEUCINE-RICH REPEAT, BETA- 
BETA-ALPHA CYLINDER, 
DYNEIN, 2 CHLAMYDOMONAS, 
FLAGELLA 


CONTRACTILE PROTEIN 
LEUCINE-RICH REPEAT, BETA- 
BETA-ALPHA CYLINDER, 
DYNEIN, 2 CHLAMYDOMONAS, 
FLAGELLA 


DYNEIN, 2 CHLAMYDOMONAS, 
FLAGELLA 
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SEQFOL 
D score 


INTEGRIN; CHAIN: A; 


INTEGRIN; CHAIN: A; 
COLLAGEN; CHAIN: B, 
C,D; 


INTEGRIN; CHAIN: A; 
COLLAGEN; CHAIN: B, 
C,D; 


INTEGRIN; CHAIN: A; 
COLLAGEN; CHAIN: B, 
C,D; 


INTEGRIN; CHAIN: A; 
COLLAGEN; CHAIN: B, 
C,D; 


INTEGRIN ALPHA-1; 
CHAIN: A, B; 


INTEGRIN ALPHA-1; 
CHAIN: A, B; 


Al DOMAIN OF VON 
WILLEBRAND FACTOR; 
CHAIN: NULL; 


Al DOMAIN OF VON 
WILLEBRAND FACTOR; 
CHAIN: NULL; 


Al DOMAIN OF VON 
WILLEBRAND FACTOR; 
CHAIN: NULL; 


VON WILLEBRAND 
FACTOR; CHAIN: A, B; 


VON WILLEBRAND 
FACTOR; CHAIN: A, B; 


Compound 


INTEGRIN INTEGRIN, COLLAGEN 


INTEGRIN INTEGRIN, COLLAGEN 


INTEGRIN INTEGRIN, COLLAGEN 


INTEGRIN INTEGRIN, COLLAGEN 


INTEGRIN INTEGRIN, COLLAGEN 


STRUCTURAL PROTEIN I- 
DOMAIN, METAL BINDING, 
COLLAGEN, ADHESION 


STRUCTURAL PROTEIN I- 
DOMAIN, METAL BINDING, 
COLLAGEN, ADHESION 


WILLEBRAND WILLEBRAND, 
BLOOD COAGULATION, 
PLATELET, GLYCOPROTEIN 


WILLEBRAND WILLEBRAND, 
BLOOD COAGULATION, 
PLATELET, GLYCOPROTEIN 


WILLEBRAND WILLEBRAND, 
BLOOD COAGULATION, 
PLATELET, GLYCOPROTEIN 


COLLAGEN-BINDING COLLAGEN- 
BINDING, HEMOSTASIS, 
DINUCLEOTIDE BINDING FOLD 


COLLAGEN-BINDING COLLAGEN- 
BINDING, HEMOSTASIS, 
DINUCLEOTIDE BINDING FOLD 
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v. , 1 

f—i 1 

- • OO 
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t~-i 

* * 

X 

1— ( 


NEURAMINIDASE; 
CHAIN: N; SINGLE 
CHAIN ANTIBODY; 
CHAIN: H, L; 


ALPHAl BETAl 
INTEGRIN; CHAIN: A; 
ALPHAl BETAl 
INTEGRIN; CHAIN: B; 


ALPHAl BETAl 
INTEGRIN; CHAIN: A; 
ALPHAl BETAl 
INTEGRIN; CHAIN: B; 


ALPHAl BETAl 
INTEGRIN; CHAIN: A; 
ALPHAl BETAl 
INTEGRIN; CHAIN: B; 


CDllA; 1LFA5 CHAIN: 
A, B; ILFA 6 


INTEGRIN; CHAIN: 
NULL; 


INTEGRIN; CHAIN: 
NULL; 




Compound 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN, HAPTEN 


COMPLEX (ANTIBODY/ANTIGEN) 
COMPLEX (ANTIBODY/ANTIGEN), 
SINGLE-CHAIN ANTIBODY, 2 
GLYCOSYLATED PROTEIN 


CELL ADHESION INTEGRIN, CELL 
ADHESION 


CELL ADHESION INTEGRIN, CELL 
ADHESION 


CELL ADHESION INTEGRIN, CELL 
ADHESION 


CELL ADHESION LFA-1, ALPHA- 
LV.BETA-2 INTEGRIN, A-DOMAIN; 
ILFA 8 


CELL ADHESION PROTEIN A- 
DOMAIN INTEGRIN, CELL 
ADHESION PROTEIN, 
GLYCOPROTEIN, 
EXTRACELLULAR 2 MATRIX, 
CYTOSKELETON 


CELL ADHESION PROTEIN A- 
DOMAIN INTEGRIN, CELL 
ADHESION PROTEIN, 
GLYCOPROTEIN, 
EXTRACELLULAR 2 MATRIX, 
CYTOSKELETON 


CYTOSKELETON 
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SEQFOL 
D score 


IGG FAB (IGG3, KAPPA); 


IGGl ANTIBODY 1696 
(LIGHT CHAIN); CHAIN: 
L; IGGl ANTIBODY 1696 
(VARIABLE HEAVY 
CHAIN); CHAIN: H; IGGl 
ANTIBODY 1696 
(CONSTANT HEAVY 
CHAIN); CHAIN: I; 


CATALYTIC ANTIBODY 
19A4 (LIGHT CHAIN); 
CHAIN: L; CATALYTIC 
ANTIBODY 19A4 
(HEAVY CHAIN); CHAIN: 
H; 


CATALYTIC ANTIBODY 
19A4 (LIGHT CHAIN); 
CHAIN: L; CATALYTIC 
ANTIBODY 19A4 
(HEAVY CHAIN); CHAIN: 
H; 


CAMPATH- 1H:LIGHT 
CHAIN; CHAIN: L; 
CAMPATH- 1H:HEAVY 
CHAIN; CHAIN: H; 
PEPTIDE ANTIGEN; 
CHAIN: P; 


CAMPATH- 1H:LIGHT 
CHAIN; CHAIN: L; 
CAMPATH- 1H:HEAVY 
CHAIN; CHAIN: H; 
PEPTIDE ANTIGEN; 
CHAIN: P; 


PEPTIDE ANTIGEN; 
CHAIN: P; 


Compound 


IMMUNOGLOBULIN MBR96 FAB 


IMMUNE SYSTEM 
IMMUNOGLOBULIN, IGGl; 
IMMUNOGLOBULIN, IGGl; 
IMMUNOGLOBULIN, IGGl FAB 
FRAGMENT, CROSS-REACTIVITY, 
HTV1 PROTEASE, ENZYME 2 
INHIBITION, IMMUNOGLOBULIN 

i 


CATALYTIC ANTIBODY 
CATALYTIC ANTIBODY, 
TERPENOID SYNTHASE, 
CARBOCATION, 2 CYCLIZATION 
CASCADE 


CATALYTIC ANTIBODY 
CATALYTIC ANTIBODY, 
TERPENOID SYNTHASE, 
CARBOCATION, 2 CYCLIZATION 
CASCADE 


ANTIBODY THERAPEUTIC, 
ANTIBODY, CD52 

* 


ANTIBODY THERAPEUTIC, 
ANTIBODY, CD52 
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69.28 




SEQFOL 
D score 


FAB HGR-2 F6; CHAIN: L; 
FAB HGR-2 F6; CHAIN: 
H; 


IGM-KAPPA COLD 
AGGLUTININ (LIGHT 
CHAIN); CHAIN: A, C; 
IGM-KAPPA COLD 
AGGLUTININ (HEAVY 
CHAIN); CHAIN: B, D; 


IGM-KAPPA COLD 
AGGLUTININ (LIGHT 
CHAIN); CHAIN: A, C; 
IGM-KAPPA COLD 
AGGLUTININ (HEAVY 
CHAIN); CHAIN: B, D; 


IGM-KAPPA COLD 
AGGLUTININ (LIGHT 
CHAIN); CHAIN: A, C; 
IGM-KAPPA COLD 
AGGLUTININ (HEAVY 
CHAIN); CHAIN: B, D; 


CHLOROPLAST 
THIOREDOXIN M CH2; 
CHAIN: A; 


CHAIN: L, H; 


Compound 


IMMUNE SYSTEM GLUCAGON 
RECEPTOR, MONOCLONAL 
ANTIBODY, FAB, RECEPTOR 2 
ANTAGONIST, TYPICAL 
IMMUNOGLOBULIN FOLD, LIGHT 
CHAIN, HEAVY 3 CHAIN, 
ANTIGEN BINDING SITE, 
COMPLEMENTARITY- 
DETERMINING 4 REGIONS 


IMMUNE SYSTEM FAB, IGM, 
ANTIBODY, COLD AGGLUTININ, 
HUMAN 


IMMUNE SYSTEM FAB, IGM, 
ANTIBODY, COLD AGGLUTININ, 
HUMAN 


IMMUNE SYSTEM FAB, IGM, 
ANTIBODY, COLD AGGLUTININ, 
HUMAN 


OXIDOREDUCTASE 
THIOREDOXIN M 5 THIOREDOXIN 
CH2, CHLOROPLASTIC 
THIOREDOXIN 


(IMMUNOGLOBULIN); 
IMMUNOGLOBULIN C REGION, 
GLYCOPROTEIN, 
TRANSMEMBRANE 
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0.93 
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Verify 
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-0.09 




PMF 
score 


67.56 


55.11 


56.09 












66.72 






SEQFOL 
D score 


THIOREDOXIN M; 
CHAIN: A, B; 


THIOREDOXIN F; 
CHAIN: A; 


THIOREDOXIN F; 
CHAIN: A, B; 


THIOREDOXIN; CHAIN: 
NULL; 


CHLOROPLAST 
THIOREDOXIN M CH2; 
CHAIN: A; 


CHLOROPLAST 
THIOREDOXIN M CH2; 
CHAIN: A; 


CHLOROPLAST 
THIOREDOXIN M CH2; 
CHAIN: A; 


CHLOROPLAST 
THIOREDOXIN M CH2; 
CHAIN: A; 


CHLOROPLAST 
THIOREDOXIN M CH2; 
! CHAIN: A; 

i 


THIOREDOXIN; CHAIN: 
A; REF-1 PEPTIDE; 
CHAIN: B; 


CHAIN: B; 


Compound 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 


OXIDOREDUCTASE DIMER, 
THIOREDOXIN, X-RAY 
CRYSTALLOGRAPHY, 
OXIDOREDUCTASE 


OXIDOREDUCTASE 
THIOREDOXIN M, THIOREDOXIN 
CH2, CHLOROPLASTIC 
THIOREDOXIN 


OXIDOREDUCTASE 
THIOREDOXIN M, THIOREDOXIN 
CH2, CHLOROPLASTIC 
THIOREDOXIN 


OXIDOREDUCTASE 
THIOREDOXIN M, THIOREDOXIN 
CH2, CHLOROPLASTIC 
THIOREDOXIN 


OXIDOREDUCTASE 
THIOREDOXIN M, THIOREDOXIN 
CH2, CHLOROPLASTIC 
THIOREDOXIN 


OXIDOREDUCTASE 
THIOREDOXIN M, THIOREDOXIN 
CH2, CHLOROPLASTIC 
THIOREDOXIN 


COMPLEX (ELECTRON 
TRANSPORT/PEPTIDE) COMPLEX, 
ELECTRON TRANSPORT/PEPTIDE 


ELECTRON TRANSPORT/PEPTIDE 
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0.50 


LY0 


0.17 
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Verify 
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* 

o 
o 




1.00 


0.83 


-0.05 
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1.00 


PMF 
score 


93.85 






99.73 
















SEQFOL 
D score 


PROTEIN DISULFIDE 
ISOMERASE; CHAIN: 
NULL; 


PROTEIN DISULFIDE 
ISOMERASE; CHAIN: 
NULL; 


PROTEIN DISULFIDE 
ISOMERASE; CHAIN: 
NULL; 


PROTEIN DISULFIDE 
ISOMERASE; CHAIN: 
NULL; 


PROTEIN DISULFIDE 
ISOMERASE; CHAIN: 
NULL; 


PROTEIN DISULFIDE 
ISOMERASE; CHAIN: 
NULL; 


THIOREDOXIN M; 
CHAIN: A, B; 


THIOREDOXIN M; 
CHAIN: A, B; 


THIOREDOXIN M; 
CHAIN: A, B; 


THIOREDOXIN M; 
CHAIN: A, B; 


THIOREDOXIN M; 
CHAIN: A, B; 


Compound 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT, REDOX- 
ACTIVE CENTER, ISOMERASE, 2 
ENDOPLASMIC RETICULUM 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT, REDOX- 
ACTIVE CENTER, ISOMERASE, 2 
ENDOPLASMIC RETICULUM 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT, REDOX- 
ACTIVE CENTER, ISOMERASE, 2 
ENDOPLASMIC RETICULUM 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT, REDOX- 
ACTIVE CENTER, ISOMERASE, 2 
ENDOPLASMIC RETICULUM 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT, REDOX- 
ACTIVE CENTER, ISOMERASE, 2 
ENDOPLASMIC RETICULUM 


ELECTRON TRANSPORT _ 
ELECTRON TRANSPORT, REDOX- 
ACTIVE CENTER, ISOMERASE, 2 
ENDOPLASMIC RETICULUM 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 
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FRAGMENT OF AN 
IG*G1 IPFC 4 


IG GAMMA-2A CHAIN C 
REGION; CHAIN: A, B; 


CLASS II 

HISTOCOMPATIBILITY 
ANTIGEN, M ALPHA 
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ANTIGEN, M BETA 
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MAJOR 

HISTOCOMPATIBILITY 
COMPLEX ALPHA 
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CHAIN: B, E; MYELIN 
BASIC PROTEIN; CHAIN: 
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(IGG2A,KAPPA) 
FRAGMENT (2640) 


IDIOTYPIC FAB 730.1.4 
(IGGl) OF VIRUS IIAI 5 
CHAIN: L, H; IIAI 7 
ANTI-IDIOTYPIC FAB 
409.5.3 (IGG2A); IIAI 9 
CHAIN: M 5 1 IIAI 10 


IDIOTYPIC FAB 730.1.4 
(IGGl) OF VIRUS IIAI 5 
CHAIN: L, H; IIAI 7 
ANTI-IDIOTYPIC FAB 
409.5.3 (IGG2A); IIAI 9 
CHAIN: M, I IIAI 10 
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L, H, M, I; 
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FRAGMENT OF 
HUMANIZED ANTIBODY 
4D5, VERSION 4 IFVD 3 
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FRAGMENT OF 
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(KAPPA LIGHT CHAIN) 
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Compound 
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COMPLEX (IMMUNOGLOBULIN 
IGG1/IGG2A) 


IMMUNOGLOBULIN PROTEIN 
ENGINEERING, ANTIBODY 
DESIGN, IMMUNOGLOBULIN 2 
STRUCTURE, ANTIGEN-BINDING 
SITE, CANONICAL 
CONFORMATION, 3 
COMPLEMENTARITY- 
DETERMINING REGION 
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SEQFOL 
D score 


CAG-ALPHA; CHAIN: A, 


HIGH-AFFINITY 
BRANCHED-CHAIN 
AMINO ACID CHAIN: A 


HIGH-AFFINITY 
BRANCHED-CHAIN 
AMINO ACID CHAIN: A 


MALTOSE TRANSPORT 
PROTEIN MALK; CHAIN: 
1,2; 


YEAST INITIATION 
FACTOR 4A; CHAIN: A, 
B; 


HYPOTHETICAL ABC 
TRANSPORTER ATP- 
BINDING PROTEIN 
CHAIN: A; 


HYPOTHETICAL ABC 
TRANSPORTER ATP- 
BINDING PROTEIN 
CHAIN: A; 


GUANYLATE KINASE; 
CHAIN: A; 


CHROMOSOME 
SEGREGATION SMC 
PROTEIN; CHAIN: A, B, 
C, D, E, F; 




Compound 


HYDROLASE TRAFFIC ATPASE; 


TRANSPORT PROTEIN BETA- 
CORE DOMAIN, ABC SPECIFIC- 
BETA-STRAND DOMAIN ALPHA- 2 
HELIX DOMAIN 


TRANSPORT PROTEIN BETA- 
CORE DOMAIN, ABC SPECIFIC- 
BETA-STRAND DOMAIN ALPHA- 2 
HELIX DOMAIN 


SUGAR BINDING PROTEIN MALK; 
ATPASE, ACTIVE TRANSPORT, 
MALTOSE UPTAKE AND 
REGULATION 


TRANSLATION EUKARYOTIC 
INITIATION FACTOR 4A; IF4A, 
HELICASE, DEAD-BOX PROTEIN 


STRUCTURAL GENOMICS 
TRANSPORTER 


STRUCTURAL GENOMICS 
TRANSPORTER 


TRANSFERASE GUANYLATE 
KINASE, SUBSTRATE-INDUCED 
FIT, DOMAIN MOVEMENT, 2 GMP, 
ATP, SUBSTRATE SPECIFICITY, X- 
RAY DIFFRACTION 


CHROMOSOME SEGREGATION 
SMC, STRUCTURAL 
MAINTENANCE OF 
CHROMOSOMES, COILED COIL 


2 KINASE, TRANSFERASE 
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SEQ ID 


Position 


Maximum score 


Average score 


916 


1-35 


0.978 


0.879 


918 


1-21 


0.953 


0.900 


921 


1-19 


0.957 


0.919 


923 


1-45 


0.961 


0.594 


924 


1-64 


0.994 


0.484 


925 


1-25 


0.936 


0.635 


926 


1-32 


0.997 


0.899 


927 


1-21 


0.991 


0.946 


928 


1-25 


0.929 


0.613 


929 


1-35 


0.850 


0.485 


930 


1-39 


0.861 


0.465 


931 


1-29 


0.906 


0.675 


932 


1-24 


0.905 


0.496 


933 


1-24 


0.993 


0.956 


934 


1-31 


0.937 


0.811 


935 


1-21 


0.989 


0.939 


936 


1-20 


0.973 


0.941 


937 


1-11 


0.800 


0.691 


938 


1-24 


0.912 


0.763 


939 


1-22 


0.992 


0.851 


940 


1-20 


0.975 


0.934 


941 


1-16 


0.978 


0.797 


942 


1-23 


0.966 


0.816 


943 


1-19 


0.905 


0.660 


944 


1-56 


0.966 


0.498 


945 


1-40 


0.859 


0.533 


946 


1-20 


0.953 


0.751 


947 


1-31 


0.833 


0.502 


948 


1-39 


0.986 


0.641 


949 


1-26 


0.978 


0.932 


950 


1-67 


0.976 


0.450 


951 


1-20 


0.991 


0.708 


952 


1-36 


0.954 


0.637 


953 


1-16 


0.844 


0.480 


954 


1-32 


0.953 


0.521 


955 


1-58 


0.989 


0.500 


956 


1-25 


0.981 


0.896 


957 


1-28 


0.954 


0.836 


958 


1-19 


0.982 


0.943 


959 


1-55 


0.988 


0.465 


960 


1-33 


0.925 


0.630 


961 


1-40 


0.916 


0.477 


962 


1-13 


0.816 


0.645 


963 


1-17 


0.946 


0.734 


964 


1-17 


0.985 


0.930 


965 


1-51 


0.991 


0.500 


966 


1-49 


0.963 


0.508 


967 


1-51 


0.971 


0.493 


968 


1-15 


0.959 


0.731 


969 


1-13 


0.822 


0.537 


970 


1-16 


0.862 


0.496 


971 


1-31 


0.956 


0.674 


972 


1-32 


0.855 


0.601 


973 


1-15 


0.826 


0.532 


974 


1-57 


0.954 


0.648 
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SEQ ID 


Position 


Maximum score 


Average score 


975 


1-20 


0.976 


0.854 


976 


1-22 


0.873 


0.586 


977 


1-26 


0.985 


0.839 


978 


1-49 


0.963 


0.774 


979 


1-26 


0.843 


0.609 


980 


1-57 


0.966 


0.561 


981 


1-16 


0.912 


0.705 


982 


1-22 


0.962 


0.827 


983 


1-13 


0.803 


0.580 


984 


1-51 


0.932 


0.462 


985 


1-20 


0.982 


0.890 


986 


1-20 


0.977 


0.923 


987 


1-21 


0.977 


0.868 


988 


1-23 


0.838 


0.580 


989 


1-39 


0.905 


0.628 


990 


1-23 


0.989 


0.864 


991 


1-20 


0.884 


0.650 


992 


1-19 


0.985 


0.920 


993 


1-28 


0.887 


0.505 


994 


1-27 


0.955 


0.743 


995 


1-27 


0.992 


0.934 


996 


1-53 


0.953 


0.723 


997 


1-30 


0.905 


0.522 


998 


1-20 


0.954 


0.746 


999 


1-19 


0.821 


0.482 


1000 


1-20 


0.978 


0.953 


1001 


1-45 


0.890 


0.608 


1037 


1-31 


0.921 


0.630 


1038 


1-36 


0.972 


0.563 


1039 


1-39 


0.976 


0.551 


1040 


1-26 


0.937 


0.703 


1041 


1-74 


0.991 


0.543 


1042 


1-23 


0.945 


0.797 


1043 


1-16 


0.977 


0.506 


1044 


1-21 


0.967 


0.759 


1045 


1-22 


0.861 


0.539 


1046 


1-27 


0.934 


0.682 


1047 


1-18 


0.983 


0.962 


1048 


1-22 


0.827 


0.517 


1049 


1-34 


0.980 


0.703 


1050 


1-15 


0.987 


0.955 


1051 


1-28 


0.995 


0.945 


1052 


1-18 


0.995 


0.977 


1053 


1-25 


0.935 


0.739 


1054 


1-19 


0.976 


0.950 


1055 


1-49 


0.933 


0.538 


1056 


1-22 


0.918 


0.723 


1057 


1-25 


0.972 


0.902 


1058 


1-52 


0.981 


0.622 


1059 


1-75 


0.969 


0.541 


1060 


1-75 


0.979 


0.817 


1061 


1-22 


0.957 


0.756 


1062 


1-45 


0.978 


0.852 


1063 


1-29 


0.984 


0.954 


1064 


1-36 


0.994 


0.713 
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SEQ ID 


Position 


Maximum score 


Average score 


1065 


1-34 


0.914 


0.608 


1066 


1-25 


0.986 


0.952 


1067 


1-38 


0.990 


0.909 


1068 


1-54 


0.908 


0.565 


1069 


1-20 


0.870 


0.708 


1070 


1-27 


0.985 


0.856 


1071 


1-23 


0.966 


0.812 


1072 


1-54 


0.967 


0.524 


1160 


1-36 


0.948 


0.776 


1161 


1-26 


0.962 


0.783 


1162 


1-31 


0.919 


0.543 


1164 


1-26 


0.988 


0.886 


1166 


1-53 


0.998 


0.751 


1167 


1-20 


0.993 


0.968 


1168 


1-24 


0.987 


0.924 


1169 


1-21 


0.973 


0.760 


1170 


1-56 


0.962 


0.585 


1171 


1-39 


0.978 


0.768 


1172 


1-25 


0.969 


0.937 


1173 


1-79 


0.967 


0.612 


1174 


1-23 


0.990 


0.934 


1175 


1-23 


0.990 


0.932 


1176 


1-42 


0.977 


0.677 


1177 


1-22 


0.969 


0.897 


1178 


1-17 


0.869 


0.641 


1179 


1-45 


0.987 


0.656 


1180 


1-31 


0.956 


0.853 


1181 


1-64 


0.996 


0.558 


1182 


1-50 


0.986 


0.566 


1183 


1-38 


0.987 


0.864 


1184 


1-28 


0.987 


0.924 


1185 


1-48 


0.911 


0.557 


1186 


1-58 


0.914 


0.462 


1187 


1-20 


0.992 


0.794 


1188 


1-24 


0.894 


0.608 


1189 


1-19 


0.986 


0.967 


1190 


1-23 


0.965 


0.817 


1191 


1-23 


0.965 


0.817 


1192 


1-53 


0.884 


0.504 


1193 


1-31 


0.982 


0.857 


1194 


1-18 


0.805 


0.583 


1195 


1-20 


0.950 


0.843 


1196 


1-25 


0.803 


0.575 


1197 


1-26 


0.869 


0.476 


1198 


1-23 


0.982 


0.866 


1199 


1-21 


0.959 


0.915 


1200 


1-30 


0.983 


0.652 


1201 


1-21 


0.932 


0.654 


1202 


1-21 


0.932 


0.654 


1203 


1-20 


0.932 


0.697 


1204 


1-26 


0.946 


0.775 


1205 


1-20 


0.986 


0.948 


1206 


1-64 


0.865 


0.475 


1207 


1-44 


0.881 


0.488 


1208 


1-19 


0.991 


0.973 
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SEQ ID 


Position 


Maximum score 


Average score 


1209 


1-38 


0.938 


0.459 


1210 


1-17 


0.954 


0.741 


1211 


1-20 


0.916 


0.802 


1212 


1-24 


0.961 


0.879 


1213 


1-20 


0.938 


0.716 


1214 


1-19 


0.959 


0.752 


1215 


1-55 


0.946 


0.682 


1216 


1-23 


0.965 


0.883 


1217 


1-23 


0.965 


0.883 


1218 


1-21 


0.909 


0.693 


1219 


1-17 


0.971 


0.886 


1220 


1-22 


0.833 


0.607 


1221 


1-16 


0.948 


0.836 


1222 


1-27 


0.968 


0.788 


1223 


1-23 


0.853 


0.523 


1224 


1-23 


0.842 


0.544 


1225 


1-19 


0.937 


0.713 


1226 


1-47 


0.947 


0.637 


1227 


1-21 


0.984 


0.933 


1228 


1-34 


0.969 


0.882 


1229 


1-15 


0.894 


0.745 


1230 


1-26 


0.966 


0.760 


1231 


1-28 


0.883 


0.474 


1232 


1-32 


0.939 


0.693 


1233 


1-2S 


0.915 


0.655 


1234 


1-28 


0.990 


0.959 


1235 


1-39 


0.926 


0.696 


1236 


1-17 


0.882 


0.558 


1237 


1-16 


0.967 


0.933 


1238 


1-19 


0.942 


0.799 


1239 


1-19 


0.910 


0.773 


1240 


1-19 


0.947 


0.805 


1241 


1-20 


0.976 


0.875 


1242 


1-20 


0.976 


0.875 


1243 


1-15 


0.973 


0.942 


1244 


1-16 


0.965 


0.911 


1245 


1-17 


0.985 


0.956 


1246 


1-21 


0.883 


0.620 


1247 


1-13 


0.874 


0.527 


1248 


1-21 


0.945 


0.852 


1249 


1-24 


0.903 


0.540 


1250 


1-26 


0.959 


0.786 


1251 


1-30 


0.934 


0.585 


1252 


1-27 


0.962 


0.740 


1253 


1-25 


0.842 


0.507 


1254 


1-19 


0.965 


0.797 


1255 


1-20 


0.970 


0.804 


1256 


1-19 


0.956 


0.791 


1257 


1-21 


0.977 


0.868 


1258 


1-23 


0.932 


0.770 


1259 


1-31 


0.960 


0.635 


1260 


1-24 


0.954 


0.803 


1261 


1-43 


0.949 


0.467 


1262 


1-28 


0.951 


0,667 


1263 


1-28 


0.963 


0.844 
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SEQ ID 


Position 


Maximum score 


Average score 


1264 


1-60 


0.971 


0.472 


1265 


1-17 


0.950 


0.878 


1266 


1-22 


0.984 


0.852 


1267 


1-32 


0.939 


0.646 


1268 


1-50 


0.939 


0.491 


1269 


1-42 


0.841 


0.500 


1270 


1-23 


0.923 


0.602 


1271 


1-20 


0.881 


0.514 


1272 


1-20 


0.881 


0.514 


1273 


1-21 


0.839 


0.518 


1274 


1-33 


0.955 


0.547 


1275 


1-26 


0.884 


0.488 


1276 


1-27 


0.867 


0.555 


1277 


1-18 


0.822 


0.521 


1278 


1-20 


0.814 


0.562 


1279 


1-24 


0.921 


0.756 


1280 


1-50 


0.845 


0.490 


1332 


1-27 


0.968 


0.912 


1335 


1-38 / 


0.937 


0.565 


1338 


1-23 


0.993 


0.924 


1339 


1-28 


0.995 


0.945 


1340 


1-28 


0.995 


0.945 


1345 


1-73 


0.997 


0.744 


1346 


1-24 


0.985 


0.690 


1347 


1-16 


0.983 


0.952 


1348 


1-16 


0.949 


0.869 


1349 


1-16 


0.996 


0.959 


1354 


1-49 


0.989 


0.784 


1355 


1-77 


0.963 


0.578 


1356 


1-24 


0.974 


0.882 


1357 


1-38 


0.979 


0.810 


1359 


1-24 


0.981 


0.938 


1361 


1-24 


0.998 


0.972 


1362 


1-22 


0.963 


0.884 


1364 


1-33 


0.997 


0.864 


1366 


1-20 


0.890 


0.648 


1367 


1-23 


0.996 


0.932 


1370 


1-38 


0.983 


0.894 


1371 


1-19 


0.960 


0.521 


1372 


1-18 


0.986 


0.589 


1373 


1-20 


0.929 


0.768 


1375 


1-17 


0.977 


0.921 


1377 


1-19 


0.985 


0.935 


1379 


1-40 


0.961 


0.521 


1382 


1-21 


0.985 


0.922 


1384 


1-51 


0.976 


0.781 


1385 


1-29 


0.984 


0.954 


1386 


1-19 


0.981 


0.939 


1387 


1-17 


0.979 


0.938 


1388 


1-33 


0.953 


0.828 


1390 


1-42 


0.974 


0.616 


1394 


1-28 


0.989 


0.912 


1396 


1-33 


0.979 


0.698 


1397 


1-24 


0.927 


0.726 


1399 


1-46 


0.991 


0.727 
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SEQ ID 


Position 


Maximum score 


Average score 


1400 


1-44 


0.900 


0.657 


1401 


1-17 


0.996 


0.872 


1402 


1-39 


0.989 


0.941 


1403 


1-21 


0.975 


0.604 


1404 


1-16 


0.957 


0.870 


1405 


1-20 


0.990 


0.875 


1407 


1-71 


0.981 


0.566 


1408 


1-18 


0.985 


0.928 


1410 


1-14 


0.941 


0.861 


1412 


1-32 


0.935 


0.549 


1419 


1-20 


0.989 


0.960 


1420 


1-50 


0.987 


0.583 


1421 


1-27 


0.977 


0.849 


1422 


1-45 


0.981 


0.817 


1423 


1-28 


0.988 


0.923 


1430 


1-73 


0.997 


0.731 


1432 


1-23 


0.992 


0.956 


1433 


1-23 


0.992 


0.956 


1437 


1-60 


0.976 


0.737 


1442 


1-27 


0.975 


0.937 


1467 


1-57 


0.931 


0.514 


1468 


1-15 


0.810 


0.544 


1469 


1-41 


0.926 


0.451 


1470 


1-31 


0.914 


0.618 


1471 


1-33 


0.889 


0.542 


1472 


1-20 


0.941 


0.754 


1473 


1-55 


0.929 


0.541 


1474 


1-36 


0.951 


0.625 


1475 


1-18 


0.897 


0.608 


1476 


1-22 


0.980 


0.917 


1477 


1-73 


0.959 


0.450 


1478 


1-18 


0.946 


0.791 


1479 


1-22 


0.951 


0.739 


1480 


1-73 


0.968 


0.456 


1481 


1-25 


0.879 


0.674 


1482 


1-20 


0.946 


0.806 


1483 


1-24 


0.992 


0.825 


1484 


1-17 


0.836 


0.620 


1485 


1-53 


0.917 


0.508 


1486 


1-19 


0.872 


0.724 


1487 


1-58 


0.954 


0.452 


1488 


1-19 


0.966 


0.927 


1489 


1-36 


0.979 


0.862 


1490 


1-25 


0.976 


0.803 


1491 


1-26 


0.898 


0.658 


1492 


1-22 


0.948 


0.844 


1493 


1-22 


0.894 


0.673 


1494 


1-52 


0.937 


0.660 


1495 


1-37 


0.930 


0.614 


1496 


1-25 


0.979 


0.914 


1497 


1-22 


0.972 


0.929 


1498 


1-21 


0.978 


0.816 


1499 


1-24 


0.839 


0.692 


1500 


1-15 


0.823 


0.560 


1501 


1-16 


0.960 


0.845 
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SEQ ID 


Position 


Maximum score 


Average score 


1502 


1-29 


0.940 


0.683 


1503 


1-12 


0.832 


0.523 


1504 


1-36 


0.945 


0.749 


1505 


1-33 


0.843 


0.488 


1506 


1-23 


0.901 


0.626 


1507 


1-20 


0.834 


0.505 


1508 


1-25 


0.919 


0.690 


1509 


1-44 


0.885 


0.513 


1510 


1-23 


0.974 


0.825 


1511 


1-13 


0.941 


0.847 


1512 


1-42 


0.885 


0.513 


1513 


1-18 


0.903 


0.592 


1514 


1-30 


0.948 


0.816 


1515 


1-16 


0.944 


0.818 


1516 


1-23 


0.849 


0.631 


1517 


1-13 


0.813 


0.610 


1518 


1-20 


0.962 


0.921 


1519 


1-23 


0.823 


0.495 


1520 


1-2 


0.893 


0.458 


1521 


1-17 


0.902 


0.762 


1522 


1-20 


0.952 


0.772 


1523 


1-23 


0.991 


0.925 


1524 


1-15 


0.858 


0.647 


1525 


1-15 


0.979 


0.961 


1526 


1-23 


0.926 


0.669 


1527 


1-23 


0.942 


0.686 


1528 


1-22 


0.967 


0.923 


1529 


1-22 


0.913 


0.519 


1530 


1-27 


0.867 


0.508 


1531 


1-15 


0.807 


0.631 


1532 


1-39 


0.907 


0.475 


1533 


1-24 


0.823 


0.503 


1534 


1-16 


0.892 


0.668 


1535 


1-16 


0.860 


0.607 


1536 


1-28 


0.920 


0.686 


1537 


1-20 


0.801 


0.618 


1538 


1-28 


0.887 


0.643 


1539 


1-21 


0.927 


0.568 


1540 


1-22 


0.997 


0.951 


1541 


1-22 


0.985 


0.827 


1542 


1-24 


0.962 


0.871 


1543 


1-22 


0.972 


0.932 


1544 


1-23 


0.908 


0.678 


1545 


1-27 


0.899 


0.579 


1546 


1-16 


0.988 


0.955 


1547 


1-16 


0.873 


0.622 


1548 


1-28 


0.840 


0.493 


1549 


1-15 


0.889 


0.689 


1550 


1-21 


0.889 


0.633 


1551 


1-44 


0.981 


0.617 


1552 


1-18 


0.881 


0.649 


1553 


1-16 


0.805 


0.631 


1554 


1-22 


0.965 


0.807 


1555 


1-26 


0.928 


0.744 


1556 


1-37 


0.934 


0.475 
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SEQ ID 


Position 


Maximum score 


Average score 


1557 


1-31 


0.935 


0.631 


1558 


1-20 


0.990 


0.868 


1559 


1-13 


0.853 


0.530 


1560 


1-21 


0.833 


0.604 


1561 


1-25 


0.922 


0.642 


1562 


1-24 


0.960 


0.683 


1563 


1-31 


0.908 


0.677 


1564 


1-22 


0.980 


0.937 


1565 


1-17 


0.857 


0.523 


1566 


1-15 


0.878 


0.521 


1567 


1-19 


0.861 


0.660 


1568 


1-18 


0.963 


0.812 


1569 


1-32 


0.965 


0.659 


1570 


1-21 


0.919 


0.457 


1571 


1-33 


0.931 


0.799 


1572 


1-11 


0.962 


0.656 


1573 


1-22 


0.969 


0.862 


1574 


1-18 


0.907 


0.508 


1575 


1-19 


0.952 


0.751 


1576 


1-16 


0.949 


0.807 


1577 


1-20 


0.984 


0.912 


1578 


1-47 


0.956 


0.583 


1579 


1-30 


0.965 


0.639 


1580 


1-33 


0.921 


0.650 


1581 


1-21 


0.996 


0.915 


1582 


1-18 


0.992 


0.969 


1583 


1-19 


0.820 


0.492 


1584 


1-33 


0.895 


0.469 


1585 


1-37 


0.987 


0.701 


1586 


1-19 


0.882 


0.606 


1587 


1-19 


0.896 


0.652 


1588 


1-11 


0.896 


0.803 


1589 


1-22 


0.977 


0.769 


1590 


1-21 


0.916 


0.742 


1591 


1-15 


0.909 


0.760 


1592 


1-13 


0.925 


0.649 


1593 


1-26 


0.963 


0.782 


1594 


1-13 


0.840 


0.530 


1595 


1-35 


0.984 


0.835 


1596 


1-17 


0.813 


0.642 


1597 


1-20 


0.822 


0.535 


1598 


1-22 


0.996 


0.965 


1599 


1-17 


0.866 


0.603 


1600 


1-28 


0.983 


0.873 


1601 


1-36 


0.963 


0.813 


1602 


1-16 


0.942 


0.750 


1603 


1-16 


0.827 


0.545 


1604 


1-19 


0.990 


0.935 


1605 


1-28 


0.910 


0.588 


1606 


1-28 


0.990 


0.772 


1607 


1-30 


0.944 


0.835 


1608 


1-36 


0.963 


0.842 


1609 


1-18 


0.937 


0.839 


1610 


1-25 


0.966 


0.910 


1611 


1-33 


0.963 


0.577 
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SEQ ID 


Position 


Maximum score 


Average score 


1612 


1-27 


0.985 


/*\ ^-f mm mm 

0.755 


1613 


1-20 


0.970 


0.737 


1614 


1-11 


0.872 


0.706 


1615 


1-19 


0.932 


0.541 


1616 


1-14 


0.987 


0.968 


1617 


1-52 


0.948 


0.570 


1618 


1-31 


0.842 


0.473 


1619 


1-27 


0.985 


0.927 


1620 


1-17 


0.822 


0.503 


1621 


1-61 


0.982 


0.770 


1622 


1-20 


0.876 


0.549 


1623 


1-19 


0.872 


0.552 


1624 


1-16 


0.928 


0.878 


1625 


1-20 


0.914 


0.495 


1626 


1-17 


0.874 


0.653 


1627 


1-16 


0.969 


0.917 


1628 


1-21 


0.886 


0.568 


1629 


1-45 


0.931 


0.487 


1630 


1-25 


0.937 


0.771 


1631 


1-23 


0.811 


0.514 


1632 


1-21 


0.923 


0.724 


1633 


1-17 


0.833 


0.543 


1634 


1-24 


0.979 


0.927 


1635 


1-20 


0.972 


0.802 


1636 


1-22 


0.932 


0.577 


1637 


1-18 


0.894 


0.643 


1638 


1-29 


0.952 


0.789 


1639 


1-38 


0.918 


0.617 


1640 


1-26 


0.848 


0.707 


1641 


1-23 


0.989 


0.864 


1642 


1-67 


0.995 


0.642 


1643 


1-18 


0.970 


0.681 


1644 


1-16 


0.982 


0.941 


1645 


1-24 


0.971 


0.908 


1646 


1-25 


0.969 


0.794 


1647 


1-40 


0.863 


0.548 


1648 


1-20 


0.984 


0.822 


1649 


1-15 


0.803 


0.568 


1650 


1-57 


0.986 


0.558 


1651 


1-26 


0.975 


0.763 


1652 


1-19 


0.996 


0.953 


1653 


1-34 


0.994 


0.961 


1654 


1-37 


0.973 


0.869 


1655 


1-21 


0.901 


0.716 


1656 


1-21 


0.990 


0.932 


1657 


1-42 


0.959 


0.693 


1658 


1-18 


0.928 


0.723 


1659 


1-16 


0.914 


0.756 


1660 


1-16 


0.841 


0.615 


1661 


1-70 


0.932 


0.496 


1662 


1-25 


0.835 


0.567 


1663 


1-39 


0.958 


0.854 


1664 


1-19 


0.977 


0.790 


1665 


1-19 


0.962 


0.794 


1666 


1-22 


0.991 


0.954 
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SEQ ID 


Position 


Maximum score 


Average score 


1667 


1-39 


0.842 


0.525 


1668 


1-24 


r-\ jr-\ #-\ ay 

0,907 


0.722 


1669 


1-52 


XV rf— V /-• XV 

0.950 


0.667 


1670 


1-17 


0.940 


0.583 


1671 


1-26 


0.926 


0.642 


1672 


1-20 


0.964 


0.902 


1673 


1-22 


0.947 


0.788 


1674 


1-27 


xv ^v «x *»x 

0.889 


xv x** ^ ^ 

0.642 


1675 


1-16 


x% x*v rv j" 

0.985 


0.900 


1676 


1-42 


XX X^ XX 

0.869 


0.503 


1677 


1-73 


0.976 


0.512 


1678 


1-21 


/— V XX ^* •< 

0.951 


/-\ 0 1 X*" 

0.816 


1679 


1-25 


0.946 


0.771 


1680 


1-18 


0.975 


XV x^v /4 

0.954 


1681 


1-19 


Xv xx xx ^* 

0.926 


/X X- T,-X /TV 

0.729 


1682 


1-16 


XV XX 

0.975 


0.891 


1683 


1-25 


xv xx x"^* 

0.967 


0.881 


1684 


1-30 


0.877 


XV y| XX /"» 

0.492 


1685 


1-23 


0.990 


XX XX /x /— 

0.905 


1686 


1-16 


0.985 


XV XX X" 

0.964 


1687 


1-20 


/X ^x XX 

0.969 


xv n 

0.803 


1688 


1-23 


Xv Xv ^ 

0.965 


/X f-i x^ 

0.829 


1689 


1-20 


X\ : — 1 XV ^ 

0.894 


XV 

0.502 


1690 


1-23 


0.884 


xv xv r\ 

0.582 


1691 


1-19 


0.972 


XV XV ^ 

0.705 


1692 


1-16 


yX #X X* >— T 

0.887 


/x f\ 

0.670 


1693 


1-21 


/X XX x" ■* 

0.961 


0.74 1 


1694 


1-16 


/X /\ -4 y-X 

0.913 


0.452 


1695 


1-25 


/X v-x ^| f 

0.946 


x-v 0 ^> ^ 

0.829 


1696 


1-44 


XV /-v /X 

0.985 


0.510 


1697 


1-32 


XV X*\/X. /x 

0.920 


0.468 


1698 


1-38 


/X XV /X X^ 

0.886 


0.5 19 


1699 


1-15 


r\ xx ^7 /x 

0.978 


r\f\ A 

0.904 


1700 


1-20 


/X X\ /X /X 

0.930 


0.666 


1701 


«4 X^ ^ 

1-25 


/X /X /X /X 

0.988 


0.903 


1702 


1-26 


XX /\ /X -f 

0.901 


0.507 


1703 


1-38 


0.971 


0.793 


1704 


1-20 


0.977 


0.888 


1705 


1-30 


xv xv /x xv 

0.822 


0.550 


1706 


1-22 


fX XV jl J 

0.844 


/\ A TX 

0.472 


1707 


1-16 


XX XX XX /X 

0.923 


0.64 1 


1708 


1-21 


/X XX yX 

0.906 


XX >^/x >t 

0.624 


1709 


1-22 


XX XX ^9 

0.977 


0.880 


1710 


1-19 


0.946 


XV x-/xv O 

0.798 


1711 


1-24 


0.928 


0.557 


1712 


1-27 


0.969 


xv x%. r\ *l 

0.901 


1713 


1-30 


0.926 


0.755 


1714 


1-20 


0.975 


0.913 


1715 


1-23 


0.927 


0.606 


1716 


1-18 


0.924 


0.715 


1717 


1-31 


0.826 


0.618 


1718 


1-33 


0.977 


0.811 


1719 


1-26 


0.934 


0.636 


1720 


1-38 


0.960 


0.483 


1721 


1-36 


0.962 


0.537 
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SEQ ID 


Position 


Maximum score 


Average score 


1722 


1-70 


0.993 


0.825 


1723 


1-33 


0.928 


0.642 


1724 


1-38 


0.967 


0.807 


1725 


1-18 


0.988 


0,897 


1726 


1-16 


0.974 


0.949 


1727 


1-42 


0.951 


0.468 


1728 


1-29 


0.904 


0.595 


1729 


1-41 


0.938 


0.600 


1730 


1-22 


0.829 


0.545 


1731 


1-52 


0.990 


0.466 


1732 


1-26 


0.970 


0.909 


1733 


1-24 


0.971 


0.865 


1734 


1-39 


0.996 


0.867 


1735 


1-45 


0.961 


0.593 


1736 


1-22 


0.991 


0.950 


1737 


1-25 


0.978 


0.623 


1738 


1-18 


0.991 


0.947 


1739 


1-47 


0.926 


0.486 


1740 


1-20 


0.985 


0.959 


1741 


1-32 


0.995 


0.972 


1742 


1-17 


0.958 


0.656 


1743 


1-62 


0.964 


0.638 


1744 


1-18 


0.892 


0.717 


1745 


1-27 


0.992 


0.934 


1746 


1-21 


0.989 


0.962 


1747 


1-67 


0.991 


0.685 


1748 


1-22 


0.898 


0.623 


1749 


1-15 


0.981 


0.945 


1750 


1-19 


0.971 


0.786 


1751 


1-36 


0.983 


0.664 


1752 


1-30 


0.988 


0.830 


1753 


1-28 


0.953 


0.866 


1754 


1-64 


0.967 


0.707 


1755 


1-56 


0.967 


0.554 


1756 


1-35 


0.945 


0.589 


1757 


1-19 


0.922 


0.737 


1758 


1-30 


0.990 


0.739 


1759 


1-25 


0.951 


0.864 


1760 


1-55 


0.990 


0.483 


1761 


1-29 


0.982 


0.792 


1762 


1-27 


0.936 


0.759 


1763 


1-29 


0.960 


0.805 


1764 


1-18 


0.978 


0.906 


1765 


1-22 


0.962 


0.920 


1766 


1-21 


0.997 


0.955 


1767 


1-25 


0.942 


0.847 


1768 


1-32 


0.989 


0.830 


1769 


1-25 


0.880 


0.630 


1770 


1-26 


0.872 


0.514 


1771 


1-29 


0.990 


0.648 


1772 


1-38 


0.901 


0.670 


1773 


1-25 


0.992 


0.955 


1774 


1-22 


0.839 


0.574 


1775 


1-33 


0.970 


0.887 


1776 


1-43 


0.987 


0.716 
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Table 6 
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SEQ ID 


Position 


Maximum score 


Average score 


1777 


1-29 


0.831 


0.548 


1778 


1-42 


0.992 


0.654 


1779 


1-20 


0.936 


0.580 


1780 


1-24 


0.985 


0.941 


1781 


1-48 


0.996 


0.882 


1782 


1-58 


0.973 


0.595 


1783 


1-22 


0.899 


0.602 


1784 


1-46 


0.958 


0.672 


1785 


1-10 


0.881 


0.707 


1786 


1-22 


0.818 


0.577 


1787 


1-53 


0.903 


0.465 


1788 


1-77 


0.976 


0.483 


1789 


1-72 


0.973 


0.475 


1790 


1-79 


0.972 


0.734 


1791 


1-71 


0.958 


0.519 


1792 


1-25 


0.996 


0.966 


1793 


1-75 


0.975 


0.571 


1794 


1-43 


0.853 


0.587 


1795 


1-17 


0.948 


0.513 


1796 


1-26 


0.981 


0.938 


1797 


1-33 


0,986 


0.753 


1798 


1-21 


0.866 


0.619 


1799 


1-31 


0.965 


0.885 


1800 


1-51 


0.989 


0.477 


1801 


1-65 


0.977 


0.706 


1802 


1-21 


0.994 


0.891 


1803 


1-24 


0.986 


0.956 


1804 


1-25 


0.880 


0.602 


1805 


1-24 


0.991 


0.954 


1806 


1-24 


0.914 


0.580 


1807 


1-71 


0.956 


0.528 


1808 


1-38 


0.901 


0.670 


1809 


1-43 


0.975 


0.765 


1810 


1-14 


0.814 


0.590 


1811 


1-14 


0.909 


0.490 


1812 


1-54 


0.990 


0.503 


1813 


1-76 


0.999 


0.992 


1814 


1-61 


0.912 


0.473 


1815 


1-20 


0.975 


0.944 


1816 


1-45 


0.983 


0.465 


1817 


1-24 


0.936 


0.769 


1818 


1-62 


0.997 


0.748 


1819 


1-74 


0.986 


0.539 


1820 


1-59 


0.959 


0.559 


1821 


1-22 


0.860 


0.539 


1822 


1-30 


0.996 


0.822 
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SEQ ID 


Chromsomal location 


1 


13ql2-ql4 


2 


9q33-q34 


3 


2 


4 


13ql2-ql4 


5 


13ql2-ql4 


6 


13ql2-ql4 


7 


13ql2-ql4 


8 


13ql2-ql4 


9 


13ql2-ql4 


10 


17qlL2 


11 


19ql3.3-ql3.4 


12 


2 


13 


13ql2-ql4 


14 


8q 


15 


8q 


16 


Ilql3.1-ql3.3 


17 


20ql2 


18 


20ql2 


19 


20ql2 


20 


4 


21 


4 


22 


9q33-q34 


23 


9q33-q34 


24 


9q33-q34 


25 


9q33-q34 


26 


9q33-q34 


27 


9q33-q34 


28 


9q33-q34 


29 


9q33-q34 


30 


9q33-q34 


31 


9q33-q34 


32 


2 


33 


2 


34 


13ql2-ql4 


35 


13ql2-ql4 


36 


13ql2-ql4 


37 


13ql2-ql4 


38 


13ql2-ql4 


39 


13ql2-ql4 


40 


13ql2-ql4 


41 


13ql2~ql4 


42 


13ql2-ql4 


43 


13ql2-ql4 


44 


13ql2-ql4 


45 


13ql2-ql4 


46 


13ql2-ql4 


47 


13ql2-ql4 


J§ * 


13ql2-ql4 


49 


13ql2-ql4 


50 


13ql2-ql4 


51 


13ql2-ql4 


52 


13ql2-ql4 


53 


13ql2-ql4 


54 


13ql2-ql4 


55 


13ql2-ql4 
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Table 7 
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SEQ ID 


Chromsomal location 


56 


13ql2-ql4 


57 


13ql2-ql4 


58 


13ql2-ql4 


59 


13ql2-ql4 


60 


13ql2-ql4 


61 


13ql2-ql4 


62 


13ql2-ql4 


63 


13ql2-ql4 


64 


13ql2-ql4 


65 


13ql2-ql4 


66 


13ql2-ql4 


67 


13ql2-ql4 


68 


13ql2~ql4 


69 


13ql2-ql4 


70 


13ql2-ql4 


71 


13ql2-ql4 


72 


13ql2-ql4 


73 


13ql2-ql4 


74 


13ql2-ql4 


75 


13ql2-ql4 


76 


13ql2-ql4 


77 


2 


78 


7 


79 


2 


80 


2 


81 


19ql3.3-ql3.4 


82 


19ql3.3-ql3.4 


83 


19ql3.3-ql3.4 


84 


19ql3.3-ql3.4 


85 


19ql3.3~ql3.4 


86 


I9ql3.3-ql3.4 


87 

; 


19ql3.3-ql3.4 


88 


19ql3.3-ql3.4 


89 


19ql33-ql3.4 


90 


19ql3.3-ql3.4 


91 


9q33-q34 


92 


13ql2-ql4 


93 


20ql2 


94 


7 


95 


4 


96 


4 


97 


4 


98 


9q33-q34 


99 


9q33-q34 


100 


9q33-q34 


101 


9q33-q34 


102 


9q33-q34 


103 


9q33-q34 


104 


9q33-q34 


105 


2 


106 


I3ql2-ql4 


107 


13ql2-ql4 


108 


13ql2-ql4 


109 


13ql2-ql4 


110 


13ql2-ql4 
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SEQ ID 


Chromsomal location 


111 


13ql2-ql4 


112 


13ql2-ql4 


113 


13ql2-ql4 


114 


13ql2-ql4 


115 


13ql2-ql4 


116 


13ql2-ql4 


117 


13ql2-ql4 


118 


13ql2-ql4 


119 


13ql2-ql4 


120 


2 


121 


2 


122 


2 


123 


19ql3.3-ql3.4 


124 


19ql3.3-ql3.4 


125 


19ql3.3-ql3.4 


126 


13ql2-ql4 


127 


13ql2-ql4 


128 


20ql2 


129 


4 


130 


4 


131 


9q33-q34 


132 


2 


133 


2 


134 


2 


135 


13ql2-ql4 


136 


13ql2-ql4 


137 


13ql2-ql4 


138 


13ql2-ql4 


139 


13ql2-ql4 


140 


17 


141 


6p21.3 


142 


13ql2-ql4 


143 


13ql2~ql4 


144 


13ql2-ql4 


145 


13ql2-ql4 


146 


13ql2-ql4 


147 


13ql2-ql4 


148 


13ql2-ql4 


149 


13ql2-ql4 


150 


13ql2-ql4 


151 


13ql2-ql4 


152 


13ql2-ql4 


153 


13ql2-ql4 


154 


13ql2-ql4 


155 


13ql2-ql4 


156 


2 


157 


2 


158 


19ql3.3-ql3.4 


159 


19ql3.3-ql3.4 


160 


19ql3.3-ql3.4 


161 


19ql3.3-ql3.4 


162 


4 


163 


4 


164 


9q33-q34 


165 


13ql2-ql4 
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Table 7 
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SEQ ID 


Chromsomal location 


166 


13ql2-ql4 


167 


13ql2-ql4 


168 


13ql2-ql4 


169 


22ql2. 1-12.3 


170 


13ql2-ql4 


171 


13ql2-ql4 


172 


13ql2-ql4 


173 


13ql2-ql4 


174 


13ql2-ql4 


175 


13ql2-ql4 


176 


8q 


177 


20ql2 


178 


20ql2 


179 


4 


180 


4 


181 


4 


182 


9q33-q34 


183 


9q33-q34 


184 


9q33-q34 


185 


3 


186 


9q33-q34 


187 


9q33-q34 


188 


9q33-q34 


189 


9q33-q34 


190 


9q33-q34 


191 


9q33-q34 


192 


9q33-q34 


193 


2 


194 


2 


195 


13ql2-ql4 


196 


13ql2-ql4 


197 


13ql2-ql4 


198 


13ql2-ql4 


199 


13ql2-ql4 


200 


13ql2-ql4 


201 


13ql2-ql4 


202 


13ql2-ql4 


203 


13ql2-ql4 


204 


13ql2-ql4 


205 


13ql2-ql4 


206 


13ql2-ql4 


207 


13ql2-ql4 


208 


13ql2-ql4 


209 


13ql2-ql4 


210 


13ql2-ql4 


211 


13ql2-ql4 


212 


13ql2-ql4 


213 


13ql2-ql4 


214 


19ql3 


215 


13ql2-ql4 


216 


13ql2-ql4 


217 


13ql2-ql4 


218 


13ql2-ql4 


219 


13ql2-ql4 


220 


13ql2-ql4 
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Table 7 
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SEQ ID 


Chromsomal location 


221 


13ql2-ql4 


222 


13ql2-ql4 


223 


13ql2-ql4 


224 


13ql2-ql4 


225 


13ql2-ql4 


226 


13ql2-ql4 


227 


13ql2-ql4 


228 


13ql2-ql4 


229 


13ql2-ql4 


230 


13ql2-ql4 


231 


13ql2-ql4 


232 


13ql2-ql4 


233 


13ql2-ql4 


234 


13ql2-ql4 


235 


13ql2-ql4 


236 


13ql2-ql4 


237 


13ql2-ql4 


238 


5 


239 


5 


240 


19ql3.3-ql3.4 


241 


19ql3.3-ql3.4 


242 


19ql3.3-ql3.4 


243 


19ql3.3-ql3.4 


244 


19ql3.3-ql3.4 


245 


19ql3.3-ql3.4 


246 


19ql3.3-ql3.4 


247 


19ql3.3-ql3.4 


248 


19ql3.3-ql3.4 


250 


6 


256 


lp32 


259 


Ilql2-ql3.1 


268 


18pll.22-pll.21 


269 


7qll.23 


270 


3 


272 


9q34.3 


278 


9q32-q33 


279 


14qlL2 


280 


14qll.2 


283 


2 


285 


13 


298 


20ql2-ql3.1 


303 


1 


305 


19 


306 


19 


310 


4pl6-pl5 


311 


6 


316 


3p22-p21.3 


326 


lp3Ll-p22.3 


335 


llpter-pl5.5 


336 


3 


350 


llq 


353 


3 


355 


8p21-pl2 


356 


Iq23-q25.1 


357 


Iq23-q25.1 | 
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Table 7 
1122 



SEQ ID 


Chromsomal location 


359 


20ql3.2-ql3.33 


360 


13 


361 


13 


362 


22ql3.1 


369 


6 


373 


2p24.3-p24.1 


374 


2p24.3-p24.1 


383 


llql3 


384 


19ql3.2 


388 


17qll-q2Ll 


398 


8p23.2-p23.1 


403 


19 


407 


11 


421 


19ql3.1 


427 


20ql2-ql3.12 


428 


17 


429 


17 


431 


llql3 


432 


7q35 


433 


X 


435 


6q22.2-22.33 


437 


19 


439 


16q24.3 


445 


19ql3 


446 


12 


452 


14 


453 


3 


456 


14 


457 


10 


458 


10 


459 


3p21.1-ql3.13 


460 


lp36 


470 


14q31 


475 


4qll-ql3 


476 


6 


477 


15q21-q22 


478 


15q24-q26 


493 


19 


496 


6q24. 1-24.3 


497 


15 


501 


llql3 


505 


lq21 


506 


6p21.3 


507 


3 


510 


lq213 


512 


14 


513 


9 


515 


1 


521 


7q32-34 


522 


7q32-34 


532 


2 


545 


q2 1.2-2 1.3 


547 


11 


551 


llq22 


556 


10, 
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Table 7 
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SEQ ID 


Chromsomal location 


557 


9p24. 1-24.3, 


559 


16 


560 


18 


561 


9 


562 


22qll.22-12.3. 


565 


6q22.3-23.3 


566 


6q22. 1-22.33, 


567 


1 


568 


11 


569 


6q21 


571 


18, 


572 


15, 


573 


5 


574 


5 


575 


6p2 1.1-2 1.2. 


576 


8, 


577 


10 


578 


1 


579 


12 


580 


1 


581 


6ql6.3-22.1, 


582 


3 


583 


10 


585 


3 


587 


1 


588 


5 


590 


4 


592 


lq24. 


593 


1 


594 


8 


595 


17, 


596 


11 


597 


5 


598 


16 


599 


11 


600 


13, 


601 


Hq 


603 


2 


604 


6q25.2-26, 


606 


1 


607 


15 


608 


9 


609 


lq24 


610 


1 


611 


4 


612 


17, 


613 


8 


614 


17, 


615 


15 


616 


6, 


617 


15 


618 


2 


619 


3 


620 


17 


621 


8 
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Table 7 
1124 



SEQ ID 


Chromsomal location 


622 


14 


623 


6q24.1-25.2, 


624 


2 


625 


9 


626 


8 


627 


12q24 


629 


20, 


630 


NA 


631 


2 


632 


2 


633 


Xpl 1.21-11.22 


634 


17 


636 


2 


637 


22 


638 


1 


639 


3 


640 


19, 


641 


1 


642 


7 


643 


5 


645 


17 


647 


1, 


649 


6 


650 


7 


651 


5 


652 


19, 


653 


20qll. 1-11.23, 


654 


1 


655 


3 


656 


8 


657 


16 


659 


9 


660 


18 


662 


6p21.3, 


665 


X 


666 


5 


667 


1 


669 


2 


670 


13, 


671 


1 


672 


15 


673 


9, 


674 


10 


675 


9, 


676 


7 


678 


16 


679 


X 


680 


11, 


681 


19, 


683 


8 


684 


4 


685 


6ql6.1-16.3, 


686 


10, 


687 


14 


688 


4 
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SEO ID 


Chromsomal location 


689 


6q26-27 s 


690 


lp35.1-36.13, 


692 


6q26-27, 


695 


12 


700 


9, 


701 


5 


702 


2 


703 


8 


704 


10, 


705 


19 


706 


14 


708 


19, 


709 


17 


711 


16 


712 


16 


713 


5 


714 


16 


715 


19 


716 


16 


717 


12 


718 


15 


719 


15 


720 


14 


721 


16 


722 


5 


723 


6, 


724 


12q 


725 


1 


726 


8 


727 


15, 


728 


2 


729 


7 


730 


17, 


731 


21q22.1, 


732 


20pl2 


733 


11 


734 


1 


735 


15 


736 


2 


737 


lp36.21-36.33. 


738 


1 


739 


6q21-22.31, 


740 


15 


742 


12 


743 


UL 


744 


13q33.3-34, 


745 


17, 


746 


12 


747 


2 


749 


16 


752 


17 


754 


16 


755 


20ql2-13.12. 


756 


19 


757 


14 
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SEQ ID 


Chrornsomal location 


758 


8, 


759 


6, 


760 


22 


761 


19 


763 


13, 


764 


lq32.1-41 


765 


8, 


767 


20 


769 


llq, 


770 


1 


771 


6p22.3-24. 1, 


772 


9p23-24.3, 


773 


17 


774 


19 


775 


3 


776 


3 


777 


22ql2 


779 


lp36.13-36.31 5 


780 


17 


781 


7 


783 


4 


784 


6, 


785 


1 


786 


21q22.1, 


787 


6 


788 


1 


789 


15 


790 


20, 


791 


9 


793 


10, 


794 


16 


796 


18 


797 


18 


798 


10 


799 


8, 


800 


16 


801 


UNK 


803 


9, 


807 


20ql3.1. 


808 


10, 


809 


3 


810 


UNK 


815 


20. 


818 


Iq25. 1-31.1, 


820 


10 


821 


20pl2.2-13, 


822 


20 


823 


4, 


824 


1 


826 


5 


827 


8 


829 


7 


830 


lp21. 3-22.3, 


832 


1 


833 


12q 
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1127 



SEQ ID 


Chromsomal location 


834 


6 


836 


6, 


837 


10 


838 


9pl 1.2-21, 


839 


11 


840 


5 


841 


17 


842 


llq 


844 


1 


845 


Hq, 


846 


9 


848 


13, 


849 


12 


852 


1 


853 


16 


854 


15 


856 


6q25.3-26 


857 


6, 


858 


1 


859 


11 


860 


5p, 


861 


11 


862 


6, 


864 


Iq22-q23, 


865 


12p 


866 


lq3Z2~41 


867 


2 


868 


13q33.1-34, 


869 


6p21.3, 


870 


3 


872 


1 


873 


3 


874 


19, 


876 


9, 


877 


16pl3 


878 


5 


879 


16 


880 


5 


881 


16 


882 


15 


883 


15 


884 


9 


885 


Spll.2, 


886 


17 


887 


5 


888 


6, 


889 


13, 


890 


11 


891 


11 


892 


Xq22.3-23, 


894 


9, 


895 


10 


896 


15 


897 


11 


898 


U 
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SEQ ID 


Chromsomal location 


899 


6q25.2-26, 


900 


12 


901 


1 


902 


19 


903 


19, 


904 


4 


905 


5 


906 


5 


908 


19 


909 


4, 


910 


19, 


911 


15 
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1129 



SEQ ID 


Number of 


For each Transmembrane Domain, its Transmembrane 


NO: 


Transmembrane Domains 


Domain Position in SEQ ID NO: and its TM Pred Score 


1037 


1 


100-115:1952 


1038 


1 


732-749:2593 


1039 


1 


181-201:2410 


1040 


2 


53-68:1828 132-149:2533 


1041 


2 


53-69:2959 121-140:2878 


1042 


1 


536-560:2906 


1043 


1 


63-82:2545 


1044 


5 


86-102:1766 189-205:2721 229-244:1878 273-300:1714 
385-405:1946 


1045 


1 


629-645:2430 


1046 


1 


59-75:2149 


1047 


1 


306-332:2773 


1048 


1 


118-136:2329 


1049 


2 


98-113:2861 220-243:2391 


1050 


1 


151-169:2618 


1051 


2 


94-110:2524 124-146:2138 


1052 


1 


73-87:2180 


1053 


1 


206-226:2584 


1054 


1 


402-419:2096 


1055 


1 


343-361:1953 


1056 


1 


132-154:2199 


1057 


1 


590-613:2402 


1058 


2 


89-105:1748 155-173:2433 


1059 


1 


201-222:2190 


1060 


4 


254-277:2256 317-332:1771 442-460:2005 530-544:2110 


1061 


2 


169-186:1866 239-259:2042 


1062 


1 


63-77:1794 


1063 


1 


227-248:3456 


1064 


1 


133-148:2558 


1065 


3 


435-453:1849 505-526:2495 697-712:2057 


1066 


1 


317-340:2214 


1067 


1 


173-192:2637 


1068 


1 


63-79:1933 


1069 


1 


124-146:3384 


1070 


3 


82-102:2213 115-135:1769 160-185:2317 


1071 


1 


312-329:2354 


1072 


2 


116-131:3056 188-209:2254 


1073 


6 


48-71:1708 174-196:2300 237-254:1918 359-378:1887 
413-435:1864 501-518:2625 


1074 


6 


136-159:1708 262-284:2300 325-342:1918 447-466:1887 
501-523:1864 589-606:2625 


1075 


1 


352-376:2946 


1076 


3 


106-125:2854 226-241:1973 277-300:2759 


1077 


4 


85-105:2047 208-225:1907 309-330:2122 454-471:2461 


1079 


1 


60-75:2189 


1081 


3 


192-214:1705 236-259:1933 436-453:2349 


1082 


1 


459-477:1896 


1083 


1 


144-159:3028 


1084 


1 


144-159:3028 


1085 


1 


436-455:2525 


1086 


1 


705-724:2525 


1093 


z 


93-108:2014 249-264:2324 


1094 


5 


145-165:2633 316-331:2180 399-412:1770 481-496:2328 
541-560:2589 
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Table 8 
1130 



SEQ ID 


Number of 


roreacn iraDsmemnr<ine uum<un 9 iv» aiuiimiacihui 411c 


NO: 


Transmembrane Domains 


Lf Omul 11 JrOalllOU ill JUL/ 1^1 \J* «1IIU lla JL It J. riCU Otui c 


1095 


i 


/o-yz. iyi) i 


1 AQ7 

luy / 


i 
l 


OJ-1U.J.Z17J 


1 Ann 


i 


LOU™ 1 / O.ZUo J 


1 i aa 
1100 


1 


9-~£ 979.9A.oc 

ZDO-Z /Z.ZUoO 


11/11 
1101 


1 
1 


7 1 A 77£»7A8C. 
Z1U-ZZ0.ZU03 


1 i no 

1 102 


1 


0;5-y4.3Z3y 


1 1 AO 

1103 


2 


1 Q/1 HftLIIQl 1/1^ 9£9«1R19 
lo4-ZU 1:2 1 o3 Z43-ZOZ. lolZ 


1 1 A /f 

1104 


1 
1 


97£ 7Q<;.1A0A 

Z /0-Zy3.o0oU 


1 1 A*C 

1 106 


1 


i /y-iy4.zozu 


1 1 1 A 

1 110 


1 
1 


111 1 OQ.7^. 1 Q 

1 1 i-izy.zD iy 


1 1 1 O 

1112 


1 
1 


ioy-i.yu.zoou 


1113 


3 


oi-oz.zi-H yy- 1 j*t. l / 13) i iy- ioy.z/o3> 


111/1 
1 1 14 


i 
1 


OC.1711 
O0-53. 1 / 1 j 


1 1 K 
1 1 1 J 


3 


191 117-9/^a 1 on o i &'icnA ^nnj} I A* 9^8 55 
iz j** l j / .zot't iyu-zio.zu/^+ j)uu-o i*+.z joo 


I 1 1 /C 

I I lo 


2 


yo- .z,dD'+ z /u-zyj>.z i^o 


1117 
111/ 


i 

1 


77 09 «1 7Q1 

/ /-yz. i / y i 


1110 

1 1 lo 


i 
1 


00.7£77 
Oo-OO.ZO /Z 


I 1 1 A 

I I iy 


i 
1 


1 C.7/1 1 ^47>9Q1Q 

idz^i— i /.zy^y 


1 1 O 1 

1 121 


i 
1 


AC 1 1 'l.OQCQ 

yo-i i^.zyDo 


1 1 Ol 

1 123 


2 


G7 1A7«1Q91 1 £7 1 70.77/CA 

yz- 1 u / . 1 yzo i oz- 1 / o .z / ou 


1 124 


i 
1 


-1 Q/1.1 OOC 

/ i-y4: loJD 


1 1 oc 

1 123 


i 
1 


17Q /1A1.1991 

^ /y-^vuo.ozzi 


1127 


1 


152-182:1795 


1 1 OA 

1 129 


3 


7A1 on.O/ll? Ilfi 1^1«17A1 AzlQ 

ZUi-Zl /.Z4o / jjo-jJj. 1 /01 4 ii +y--frOO.ZDoy 


linn 

1130 


2 


AA 11/1.17C/1 1 AO 11A.7711 

yy-1 14:1/34 lUo-1 j5U:Z /j> 1 


1 1 O 1 

1131 


1 


1 1 '7*7 1 IQl.iniQ 

l i / /-l iyj.\5U3o 


1132 


1 


QC 111 -71A1 

yD-i 1 1 .Ld\) i 


1133 


2 


7AC 777.171 ^ 1A7 '377-17'J^ 

203-22 /: 1 / 13 3U /-32Z. 1 /33 


1 1 n 1 

1134 


i 
I 


1AO A.7J 1 1 

30o-330:Z4,31 


1136 


3 


AO 1 A7.1 71/1 7QO C1 1 1 -7A/£1 "5 *£1 17Q-1 77 A 1Q9 OQQ. 1QQQ 

y2-lU/. 1 /34 zyo-3 1 i:ZU03 303-3 /o. 1 /ZU JOZ-jyy. lyoo 
433-4 / 1 .ZU4u 


1 1 o7 

1137 


2 


7^.0/101 197 1/lQ«07/^0 
3o-/3:Z4ol iz/-i4o.zzoy 


1 1 1 o 

113o 


1 


ooo 7C1 .1 077 

ZZ0-Z3 1 : 1 oZZ 


1 ion 

i i3y 


4 


Q7 11^-IQft^ 177 1QA-1Q1Q SSQ Q0^-7A^ QS5?_ 1 nDS'9n77 

y/-iiD.iyfj 1 / /-ly^f.iyiy ooy-yuj.zuo^ yoo-iuuo.^vz,/ 


11/11 

1141 


i 
1 


991 9^19. 9071 

zzj-z4z.zy / 1 


1143 


3 


A1 CS'71fiQ 177 1GA-1Q7S 971 95JR-91R^ 

Oo-oo.zioy 1 / /-iy4. 10 /o z / i-zoo.z 1 so 


1 1 /i /i 
1 144 


Z 


11^ 11^-91Qfi 1 7Q 1QA-9^fi 

lio-ijo.zjyu i fv" ly-t-.zj ju 


ii/i^ 
1140 


7 

2 


/c/c 07.77m 1 1 n 19^-17-;-; 

O0-oZ,Z/Ul 1 1 U-1ZO. 1 / jj 


1 1 A 7 

1 14 / 


o 

2 


HA 1H£'9^0A 17Q 1 Q^S 
/4- LUO.ZDoU i->y-l JO. 1 y3o 


1 1 A Q 

1 14o 


i 
1 


J ZZ-3 H -r . ZO^tH- 


1 1 /lO 

1 14V 


7 

2 


Q7*9A9A 9A0-91 ^*997-S 


1 1 <A 

1 13U 


i 

1 


9nn 91 6«997<: 
zuu-zio.zz. /»? 


i i r 1 

1151 


i 
1 


Q9 IflQ-O^OO 

yz-iuy.z3oo 


1 1 -CO 

1 152 


o 

2 


14^ 1 /^Q- 1 ^ 1 7 14/^-1 QQ1 

143-ioy. ioo4 .> 1 /-j'+o. ioy 1 


1 1 f c 

1155 


/i 
4 


AI/1 C5.9Q/1 fi 71 8 717.7H1 < /I ^7 A Qf\* 1 C9Q ^1^_^^\1 • 1 QQQ 

04-o3:zy4o zi o-Z3z:zuio 43Z-4oU. 1 ozy z)33-jj3. lyyy 


1 1 C £Z 

1156 


t 
1 


11 1 11A»9^9/t 
31 1-33U.Z3Z4 




1 


78-1 1 1 '2597 

1 O 111.* i-i-J 7 i 


1159 


1 


163-180:2270 


1330 


1 


264-282:2674 


1331 


1 


105-128:2620 


1332 


1 


141-164:3091 


1333 


1 


394-409:1914 


1334 


1 


390-410:1947 


1335 


1 


67-82:1917 
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Table 8 
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SEQ ID 


Number of 


For each Transmembrane Domain, its iiansmemorane 


NO: 


Transmembrane Domains 


Domain Position in MiiV2 DJ invj. ana its irvi rrcu ocui c 


1336 


1 


301-317:3356 


1337 


1 


301-317:3356 


1338 


1 


427-443:1894 


1339 


2 


103-119:2524 133-155:213o 


1340 


2 


94-110.2524 124-146.2138 


1341 


2 


61-78:1768 212-233:2126 


1342 


1 


139-160:2023 


1343 


1 


331-353:1902 


1344 


1 


99-117:2054 


1345 


1 


52-67:1997 


1346 


1 


67-80:2411 


1347 




367-384:2888 


1348 


1 


457-476:2898 


1349 


2 


202-217:1923 272-288:2760 


1350 


5 


266-284:2563 328-346:2525 390-408:2516 452-4 /U.Z45o 






514-532:2395 


1351 


1 


94-1 16:2763 


1352 




45-62:2164 71-85:1838 


1353 


1 


482-504:3155 


1354 




1-17:2284 75-90:1824 1 45-160. 20 /y 


1355 


1 


12-28:1796 


1356 


1 


332-356:3221 


1357 


1 


9-30:2498 


1358 




202-215:2016 256-281: 1796 2oJ-3 12: lyOU 


1359 


1 


267-284:3050 


1360 




229-245:2437 316-333:1885 429-44o:Z5ey 


1361 


1 


1-17:2914 


1362 


1 


4-21:2014 


1363 


1 


962-980:1762 


1364 


1 


12-31:2460 


1365 


1 


38-54:3201 


1366 


1 


103-1 19:2300 


1367 


1 


1-15:2225 


1368 




ag /■j .ni 1 

48-64:1711 


1369 




AO f A nil 

48-64:1711 


1370 


1 


21-36:2592 


1371 




290-307:2420 419-433.2394 


1372 


1 


66-81:2345 


1373 


1 


3-18:1914 


1374 


1 


1 10-124:2032 


1375 


1 


3-18:1862 


1376 




37-56:2540 102-lZ/:l/0o iy4-ZUy.Z/o4 4J4-4D.3.zujo 


1377 


1 


1-20:2264 


1378 




259-274:2102 330-35o.244o 4 10-4^4. Z/OZ 


1379 


1 


24-43:2348 


1380 


1 


196-211:1704 


1381 




i en i7<.is^ 9^S-?^0 ?fiQ0 ^0^-^19-2565 


1382 




262-288:2789 


1383 




60-89:2283 119-140:1980 


1384 




21-41:2833 


1385 




227-248:3456 


1386 




3-19:2081 


1387 




162-180:3181 


138S 




12-27:2098 
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Table 8 
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SEQ ID 


Number of 


For each Transmembrane Domain, its Transmembrane 


NO: 


Transmembrane Domains 


Domain Position in SEQ ID NO: and its TM Pred Score 


1389 


1 


487-503:2039 


1390 


2 


95-111:2234 155-173:2180 


1391 


2 


87-106:2547 186-206:1903 


1392 


3 


142-162:2149 169-184:1733 265-283:2036 


1393 


3 


83-110:1977 142-159:1729 238-254:2361 


1394 


1 


12-30:2458 


1395 


1 


61-76:2746 


1396 


1 


14-31:1963 


1397 


1 


1060-1079:2742 


1398 




220-244:2016 


1399 


1 


17-36:2664 


1400 


1 


21-41:1851 


1401 


1 


37-69:2282 


1402 


1 


24-60:1926 


1403 




55-75:2438 


1404 


1 


237-252:2493 


1405 


1 


1-20:1866 


1406 


1 


-- am*. ^ ammm -M 

172-190:1781 


1407 


1 


90-111:2881 


1408 




6-23:2294 


1409 


1 


477-499:2659 


1410 


1 


231-249:2797 


1411 




64-84:1993 95-110:1932 149-167:2173 


1412 




166-182:1937 216-239:1819 326-341:1779 


1413 


1 


154-169:2224 


1414 


1 


241-254:1760 


1415 


2 


71-97:1818 234-254:2475 


1416 


1 


_* awm. ^ a^ A ata. Am Am *A 

143-164:2551 


1417 


2 


104-122:1815 147-162:2031 


1418 


2 


121-135:2273 209-234:2186 


1419 


2 


8-27:2032 82-102:1765 


1420 


1 


26-47:2557 


1421 


3 


- - . j am*, am*. Amm, A AMm. At amw A a+*t A*m, A*\_ Mt A+m tA *4 /— V >tf 

18-34:1798 220-243:1742 301-318:1915 


1422 


1 


27-42:3053 


1423 


1 


6-21:1974 


1424 


1 


345-360:2050 


1425 


1 


44-65:2195 


1426 


1 


763-779:1916 


1427 


1 


448-469:2686 


1428 


1 


597-614:2205 


1429 


1 


194-210:2157 


1430 


2 


- — - _ & ama atA. Amf ABA /"^f -Al A^L a^*L J~X 

22-44:2983 52-67:1999 


1431 


2 


22-39:1830 91-106:2145 


1432 


1 


6-26:2422 


1433 


1 


6-26:2422 


1434 


1 


72-89:1942 


1435 


2 


128-146:1770 337-352:3007 


1436 


1 


88-105:1803 


1437 


1 


1013-1035:2346 


1438 


4 


574-589:1938 650-668:2275 720-735:1944 793-810:1729 


1439 


4 


655-670:1938 731-749:2275 801-816:1944 874-891:1729 


1440 


1 


65-81:2339 


1441 


1 


131-151:1772 


1442 


2 


5-22:1872 248-273:2467 
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Table 8 
1133 



SEQ ID 
NO: 


Number of 

Transmembrane Domains 


For each Transmembrane Domain, its Transmembrane 
Domain Position in JSJSQ ID INO: ana us IM Fred bcore 


1443 


3 


179-194:2318 387-403:3014 44^-4o:> . 1 5 / 1 


1444 


2 


58-74:2684 238-264:2576 


1720 


1 


89-107:1832 


1721 


1 


225-240:2165 


1722 




13-29:2422 49-63:2158 


1723 


1 


129-144:1704 


1724 


1 


12-28:1900 


1725 




1-22:1840 143-158:1938 lol-19o:2aoU /4/-2dd:z^o4 


1726 


1 


1-15:1790 


1727 




100-122:2447 199-217:2147 


1728 


1 


12-28:1802 


1729 




21-37:1706 


1730 


1 


328-343:2384 


1731 




33-60:2908 185-211:2596 237-258:1815 


1732 




279-309:1795 


1733 


1 


6-26:2040 


1734 


1 


14-34:1939 


1735 


1 


17-38:1933 


1736 




1/1 '"» C A O A C\C 1 1 A.I TOO 

10-25:2484 95-110:1787 


1737 




5-21:1862 109-125:2714 


1738 


1 


l-2l:l88l 


1739 


1 


393-414:2402 


1740 


1 


3-21:2189 


1741 




15-33:3158 109-123:2786 364-380:2232 


1742 




564-580:1725 


1743 


\ 


40-60:2740 169-189:3006 


1744 


1 


214-228:2582 


1745 


1 


3-19:2099 


1746 




4-22:2753 247-263:2558 


1747 


1 


43-60:2160 


1748 


1 


195-217:2199 


1749 


4 


304-319:1707 427-447:1983 547-562:2389 657-683:2065 


1750 


5 


2-17:2136 753-768:2335 810-831:1715 906-928:1710 
1009-1031:2357 


1751 


1 


337-352:2864 


1752 


1 


1-24:3347 


1753 


1 


621-644:2150 


1754 


1 


88-106:2133 


1755 


4 


67-89:2584 130-148:1837 239-261 :lyz2 i34-J4y:^ozi 


1756 


1 


20-35:1702 


1757 


1 


471-488:1852 


1758 




8-25:2374 247-265:1721 


1759 


1 


10-26:2268 


1760 




37-55:2350 79-102:1867 


1761 


1 


3-16:1751 


1762 




10-29:2708 158-183:2332 413-430:1706 514-532:2104 


1763 




jjL-JJO,Zjo1 j 


1764 




1-15:2095 


1765 




3-20:1765 


1766 




3-27:2376 


1767 




11-39:1868 


1768 




150-169:1896 


1769 




91-112:2655 


1770 




347-362:1792 
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Table 8 
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SEQ ID 


Number of 


H 1 W fill fi ■ ■ • A F ■ 1 . _ _ • ^ 

For each Transmembrane Domain, its Transmembrane 


NO: 


Transmembrane Domains 


Domain Position in ID NO: and its 1M Fred Score 


1 771 

1 /71 


1 


"71 T TOT.OAOA 

717-737:30o0 


1 770 

1772 


1 


O A /t • 1 TOT 

9-24: 1787 


1 /73 


o 

2 


1 1 iC. 1 ATT OC 1 1 A. 0071 

1-16:19/7 85-110:29/1 


1 T~1 A 

1774 


3 


A r\r\ ri A.n/T/: r n COO . 1 tct /C7C £Q/Z- 1 OAO 

492-5 10:22oo 5 1 /-53Z: I /52 o /5-o9o: 1 0U0 


1775 


3 


16-33:2461 94~113:24o5 221-24 1 .zZdZ 


lllo 


l 


r\ a o A A 1 VI A 

24-39:2149 


1777 


l 


108-123:2281 


1778 


2 


23-42:3401 179-201:3489 


1779 


1 


r\ s~ -1 -1 tool 

296-311:1881 


1780 


1 


280-303:2645 


1 T O 1 

1781 


1 


22-37:2170 


1782 


1 


313-332:1963 


1783 


7 


n j/i A 1 A O O O A O O O 1 1 "f A A A /( A1 /OA 1 T A C C AO CTA.IOIA 

243-261:2338 288-311:1992 401-429:1705 502-519:1819 






CC/C C<;0.1 OOO cm /C1 /1 . 1 70A /C7C /CAA.OIAO 

556-568: looo 59/-614: 1 /oO 6 /5-69U:21U2 


1784 


1 


1 /<51 MC7.11AO 

1433-1457:3108 


1 nor 

1785 


l 


OAA /I AO . 1 occ 

390-403:1855 


1 TO /" 

1786 


2 


1 AO 1 A A . 1 TOT TOO IC^.IIIrt 

103-122:1737 738-756:2220 


1 TOT 

1787 


2 


1 1 1 O 1 AO C A 1 TO 1AO 1T/1/T 

116-131:2359 173-193:1746 


1788 


l 


lOO AAA O/l/tO 

128-144:2443 


1 T O f\ 

1789 


1 


TO AT 1 AOO 

73-97:1982 


1790 


l 


103-124:2547 


1791 


l 


CO ^TA-OCO^ 

53-69:2536 


1792 


3 


Oyl AA.T^"OA 1AOO "IAAO.1 TO A 1 1 A T 11 ZTT . TO zTT 

84-99:2680 1083-1098:1729 1147-1167:2267 


1 TAO 

1793 


1 


59-74:2644 


1 T A 

1794 


2 


T yl OA.OATO /T A AO . 1 ATT 

24-39:2073 69-92:1922 


1 r-J C\C 

1795 


1 


CA TO.T/1AA 

59-78:2490 


1 

1796 


t 

2 


1 1A1 1 ITI-TC^T/T 1 TT/1 1T/1/1.TAAA 

1101-1121:2566 1224-1244:2000 


1 TAT 

1797 


3 


r y( TT.1TOA 110 11 1 .1 O/CA 1 OA 1ACO/1AO 

54-73:1789 113-131:1860 182-195:2408 


1 T A O 

1798 


2 


T/T AT.1A/IO IOC 1 C C . O O /I T 

76-92:1943 135-155:3247 


1799 


i 
1 


1 A1 1 CA.O 1 A£T 

141-159:3106 


1 oaa 

1800 


1 


248-266:1883 


1801 


o 
2 


/IIO /I O A . 1 OC1 AC\£L -CT1.TAOT 

413-430:1851 496-521:2037 


1 0/\T 

1802 


1 


TOT O 1 1 . 1 non 

782-811:1739 


1803 


3 


OC 1AO.1O0T 11T 1 OO.n/l^ 1 /T lOH.I^CA 

85-108:1832 117-138:1746 166-187:2659 


1 O A A 

1804 


1 


1 Ol AAA. 1 OOA 

183-200: 1880 


1 OAC 

1805 


1 


C A A C/CO.IOTO 

549-568:1822 


1806 


t 

2 


101 1/1A.TAA1 1A1 A1A.ACOA 

121-142:2091 191-210:2580 


1 CAT 

1807 


3 


CO /TT.AOAO 111 111.nC£ 11C 111.1C1/C 

52-6/:2292 112-127:1756 215-232:2516 


1 OAO 

1808 


l 


A A 1 1 C.I OO /I 

99-115:1824 


1 OAA 

1809 


l 


TjI /I1.10TZ7 

24-41:1826 


1 fl 1 A 

1810 


3 


T1 A TOO.TOOO TZ"A Tf\T 1 TT*0 T/^T OIC.1 C\C\/7 

710-738:2388 769-792:1779 797-815:1896 


"1 O 1 1 

181 1 


•< 

I 


T1 AO TA/TA 

71-93:2069 


1010 

1812 


l 


OT 1 1 T. 1 OOT 

97-117:1891 


1813 


3 


12-27:2868 38-54:1801 59-89:3042 


1814 


1 


404-422:2347 


1 n 1 r 

1815 


■a 

1 ! 


1-19:2068 


1 Q 1 A 

lo lo 


t 

2 


/^o B/ionn 110 in/i. imn 
02-64. ZUIZ llo-134:19/0 


1817 


1 


68-83:2632 


1818 


2 


3-22:249147-61:1754 


1819 


1 


1187-1209:3678 


1820 


6 


4-29:1727 87-110:3204 95-129:1851 165-190:2822 214- 






230:1867 245-261:1911 


1821 


1 


324-342:2300 


1822 


1 


4-24:1911 
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Table 9 
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SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
filed (Attorney Docket 
No. SEQ ID NO.) * 


1 


912 


1823 


2479 


784 2167 


2 


913 


1824 


2480 


790 12732 


3 


914 


1825 


2481 


787 7952 


4 


915 


1826 


2482 


787 7952 


5 


916 








6 


917 


1827 


2483 


787 5660 


7 


918 


1828 


2484 


790 6350 


8 


919 


1829 


2485 


790 18138 


9 


920 


1830 


2486 


784 2658 


10 


921 


1831 


2487 


788 11684 


11 


922 








12 


923 


1832 


2488 


789 1765 


13 


924 


1833 


2489 


784 6512 


14 


925 


1834 


2490 


789 4745 


15 


926 


1835 


2491 


790 14655 


16 


927 


1836 


2492 


791 2272 


17 


928 


1837 


2493 


791 1348 


18 


929 


1838 


2494 


790 23 


19 


930 


1839 


2495 


784 5039 


20 


931 


1840 


2496 


787 8518 


21 


932 


1841 


2497 


784 5125 


22 


933 








23 


934 








24 


935 


1842 


2498 


790 26893 


25 


936 


1843 


2499 


785 3384 


26 


937 


1844 


2500 


791 2238 


27 


938 


1845 


2501 


784 7881 


28 


939 








29 


940 








30 


941 


1846 


2502 


785 197 


31 


942 


1847 


2503 


785 836 


32 


943 


1848 


2504 


784 8990 


33 


944 


1849 


2505 


792 6041 


34 


945 


1850 


2506 


792 873 


35 


946 








36 


947 


1851 


2507 


784 4339 


37 


948 








38 


949 


1852 


2508 


792 5188 


39 


950 








40 


951 


1853 


2509 


784 6463 


41 


952 


1854 


2510 


784_2235 


42 


953 


1855 


2511 


784_2849 


43 


954 


1856 


2512 


785 206 


A A 

44 


955 


1857 


2513 


787 6368 


45 


956 


1858 


2514 


788_5838 


46 


957 


1859 


2515 


784 7012 


47 


958 








48 


959 


1860 


2516 


785 835 


49 


960 


1861 


2517 


784 7305 


50 


961 








51 


962 


1862 


2518 


784 8815 
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Table 9 
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SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identmcation of Priority 


full-length 


full-length 


* • 

contig 


contig peptide 


Application that contig 


nucleotide 


peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 














IVn WO TT1 INTO ^ * 




yoj 


1 RA1 
IOOj 


i y 


IRA 71 

/ Ot / I -7 J 




yo4 


lo04 


ZDZU 


7R7 fno% 

/o / O /ZJ> 


*\ A 
34 


yoD 








JJ 


yoo 


1 R£5 


0^9 1 
ADL 1 


7R7 ^£QR 

/O / JO7O 


jO 


067 

yo / 


1 R66 

lODO 


9^97 
£ oil £ 


700 1957R 


■57 


yoo 


1 867 

1 OD / 


9^9^ 


7R4 

/ 0*T VJ*-T O ^ 


^r 

JO 


yoy 


1 R6R 
loOo 


9S9il 
ZjZ4 


/OH- lO^O 


^o 


Q70 

y /u 


1 R6Q 

i ooy 


9^9*> 

zjZj 


709 1 1 04 


OU 


Q71 

y / 1 


1 R7ft 
lo /U 




7R4 01 ^7 


01 


y /z 








Di 


y /o 


1 571 


9^97 
ZDZ / 


7R4 14HR 
/ o4 14UO 


OD 


OTA 

y /4 


lo /Z 




7R4 9^0R 


04 


y /j 


lo ID 


ZDzy 


7Ri4 1 nAQ 
/o4 lU4y 




OTA 

y /o 








DO 


y / / 








<^7 
0 / 


Q7R 

y /o 








Oo 


Q*7Q 

y /y 








oy 










/V 


QR1 

yo i 


1 Q7A 
1 o /4 




7R7 ^5191 
/ 0 / OlXl 


7 1 


yoz 


1 57^ 
15 / J 


9^1 1 
Z^O 1 




/z 




1 Q7^ 
lo /O 


9^19 
Z3JZ 


7R7 1H1Q7 
(Of LU l¥ f 


/o 


yo4 








HA 
/4 


or*; 
yoj 


1 £77 
16 / / 




7R7 10994 


75 
/D 


QR£ 

yoo 


lo/o 


9^^A 
Zj«54 


7RQ ^ROR 
/oy jOUo 


7/£ 
/O 


QR7 

y© / 








77 


QRR 

yoo 


1 £7Q 
lo/y 


9^^S 
ZjjJ 


7R4 f09H 

/ 0*+ U^O / 


72 
/O 


QRQ 

yoy 


1 ooU 


JO 


7RS ^R9 

/OJ JOZ 


/y 


QQfi 

yyu 








rh 


QQ1 

yy i 








R 1 
o i 


QQ9 

yyz 


1 RR1 
1 oo 1 


95^7 


70O 190^9 
1 y\j 1Z7JZ 


R9 

OZ 


yyj 








R^ 


OQA 

yy4 


l ooz 


9^^R 


7QO 1^RR7 


9/1 


yyj 


1 oo J 


9^^Q 

zjjy 


7R4 709^ 


oD 


yyo 








OO 


QQ7 

yy / 








Q7 
O / 


QQR 

yyo 


1 RR4 
loo4 


9^j.n 

ZD4U 


7on limn 

/7V i 1U1U 


r r 
oo 


QQO 

yyy 


1 RRS 

1 OO J 


9^A1 
ZJ41 


7QO 17^71 
/yu L / D / L 


RQ 


1UUU 








yu 


1UU1 








Q 1 

y i 


1UUZ 


1 OOD 




7R4 l-^^O 
/o4 1«?DU 


09 

yz 


L\J\JD 


1 RR7 




7Rzl 19^0 
/o4 1ZOU 


yo 


1UU4 


1 RRR 
looo 


I^AA 


7QH 9 1 

/yu zijoo 


y4 


1 fl05 
1UUD 


1 RRQ 
i ooy 


9^4^ 
ZD43 


/o4 /yiz 


05 


1 006 

L\J\J\J 








96 


1007 


1890 


2546 


784 8528 


97 


1008 


1891 


2547 


784 8105 


98 


1009 


1892 


2548 


787 5314 


99 


1010 


1893 


2549 


784 7469 


100 


1011 


1894 


2550 


790 11802 


101 


1012 


1895 


2551 


787 5599 


102 


1013 


1896 


2552 


784 5534 
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Table 9 
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SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identification of Priority 


full-length 


full-length 


contig 


contig peptide 


Application that contig 


nucleotide 


peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 




filed (Attorney Docket 










No. SEQ ID NO.) * 


103 


1014 








104 


1015 


1897 


2553 


791 3043 


105 


1016 


1898 


2554 


784 8916 


106 


1017 


1899 


2555 


784 1271 


107 


1018 








108 


1019 








109 


1020 


1900 


2556 


784 6527 


110 


1021 








111 


1022 


1901 


2557 


790 14183 


112 


1023 


1902 


2558 


791 1570 


113 


1024 


1903 


2559 


787 2154 


114 


1025 


1904 


2560 


787 2849 


115 


1026 








116 


1027 








117 


1028 








118 


1029 


1905 


2561 


790 23824 


119 


1030 


1906 


2562 


787 10284 


120 


1031 


1907 


2563 


790 13062 


121 


1032 








122 


1033 








123 


1034 








124 


1035 


1908 


2564 


787 3576 


125 


1036 








126 


1037 








127 


1038 


1909 


2565 


789 2490 


128 


1039 


1910 


2566 


784 2340 


129 


1040 


1911 


2567 


784 3449 


130 


1041 


1912 


2568 


787 9834 


131 


1042 


1913 


2569 


787 7563 


132 


1043 








133 


1044 


1914 


2570 


784 969 


134 


1045 


1915 


2571 


784 9071 


135 


1046 


1916 


2572 


785 1259 


136 


1047 


1917 


2573 


790_19506 


137 


1048 


1918 


2574 


784 3978 


138 


1049 


1919 


2575 


784 3848 


139 


1050 


1920 


2576 


789 6174 


140 


1051 


1921 


2577 


785 1465 


141 


1052 


1922 


2578 


787 7763 


142 


1053 


1923 


2579 


790 25889 


143 


1054 








144 


1055 


1924 


2580 


784 4674 


145 


1056 


1925 


2581 


787 8960 


146 


•4 y\ mmm 

1057 


1926 


2582 


mm mm m± ^ 

787 2258 


147 


1058 


1927 


2583 


787 2290 


148 


1059 


1928 


2584 


784 8266 


149 


1060 


1929 


2585 


784 1397 


150 


1061 


1930 


2586 


784 8164 


151 


1062 


1931 


2587 


785 1003 


152 


1063 








153 


1064 
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Table 9 
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SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identification of Priority 


full-length 


full-length 


contig 


contig peptide 


Application that contig 


nucleotide 


peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 




filed (Attorney Docket 










No. SEQ ID NO.) * 


154 


1065 


1932 


2588 


784 24b>o 


•4 /"* ^* 

155 


1066 


1933 


2589 


787 51o9 


156 


1067 








157 


1068 


1934 


2590 


nnn 1 A1CA 

787 10359 


158 


1069 








•4 ^ /~V 

159 


1070 








160 


1071 


1935 


2591 


790 17261 


161 


1072 


1936 


2592 


784 9629 


162 


1073 


1937 


2593 


784 3405 


163 


1074 


1938 


2594 


784 3405 


164 


1075 


1939 


2595 


790 4043 


165 


4 API 

1076 


1940 


2596 


TOT OOA/A 

787 3900 


166 


1077 


1941 


2597 


787_9753 


■4 *~ 

167 


1078 


1 942 


2598 


787 4766 


168 


1079 


1943 


2599 


'ion 0 ^ <~i 1 

789__3521 


169 


1080 


1 A A 

1944 


2600 


784 8097 


170 


1081 








171 


1082 


1945 


2601 


TO/1 "70 C 

784_735 


172 


1083 


I946 


2602 


7S4_441o 


173 


1084 


1947 


2603 


nn/l /I /f 1 O 

784 4418 


174 


1085 








175 


1086 








176 


1087 


1948 


2604 


784 1006 


177 


1088 


1949 


2605 


TO yl 1 COO 

784 1528 


178 


1089 








179 


■4 /~V /"V **"V 

1090 








180 


1091 


1 ATA 

1950 


2606 


791 2053 


181 


1092 








182 


1093 


1951 


<■» r\'~i 

2607 


TOO /t O /CA 

7oo 600U 


183 


1094 


1952 


2608 


785 1010 


184 


1095 


1953 


2609 


TO /I O C A 

784 6854 


185 


1096 


1954 


2610 


TO A O /T T/~^ 

784 8679 


186 


1097 


1955 


2611 


790 19249 


187 


-4 /"V 1"1 

1098 


1956 


2612 


TO A C C f 

784 5566 


188 


1099 


1957 


2613 


790 3027 


189 


1100 


1958 


yX X" -1 A 

2614 


790 3027 


190 


1101 


1959 


2615 


TAA OAAn 

790 3027 


191 


1102 








-4 /\/V 

192 


I 1 AO 

II 03 


I960 


2616 


Tf\ 1 o^CI 

791 2652 


193 


1104 


1961 


2617 


TO >1 T "1 "1 

784 7116 


194 


H05 


1962 


2618 


Ton 1 £T c 0 

789 lo5o 


195 


1106 


1963 


2619 


TOT T <TTO 

787 7673 


196 


1107 


1964 


2620 


790 3240 


197 


1 1 AO 








198 


1109 


1965 


262 1 


784 6361 


199 


1110 


1966 


2622 


784 297 


200 


1111 


1967 


2623 


790 13930 


201 


1112 








202 


1113 


1968 


2624 


784_8933 


203 


1114 








204 


1115 
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Table 9 
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SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identification of Priority 


full-length 


full-length 


contig 


contig peptide 


Application that contig 


nucleotide 


peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 




tueci (Attorney JUocKet 










1M0. oiliV^ W riKJ.) 


203 


1 1 1 /C 

1110 








206 


1 1 1 T 
ill/ 








OAT 

207 


1110 

1118 


1969 


2023 


TO/1 T/CA1 

/o4 /0U3 


20a 


1 1 in 

1119 


t ATA 

1970 


2020 


T5?T ^/l^l 

/o / 3433 


2oy 


1 1 OA 

1 120 


1 AT 1 

iy / 1 


202 / 


TO A /COTT 

/oy 03 / / 


210 


1101 
1 121 


1 mo 

iy /2 


202o 


TO/1 

/o4_/440 


T 1 1 

21 1 


1 1 TO 

1122 


1 ATI 

iy /3 


2629 


/o4 3422 


212 


1 1 OQ 

1 123 


1 AT /I 

iy /4 


203U 


/o4 23JU 


TIT 

21 J 


1 1 O/t 

1 124 


iy /3 


t/:q 1 
2o3 1 


/o / /2j / 


T 1 /I 

214 


1 nr 

1 123 


iy /o 


TAQT 

2032 


TO/1 ^/ITT 

/o4 34/3 


213 


1 120 


1 ATT 

iy / / 


2633 


TO/1 T7A1 

/o4 1 /y3 


216 


1 1 OT 

1 12/ 


1 070 

iy /o 


2034 


/o4 1UU2/ 


217 


1100 

1128 








O 1 o 

21b 


1 1 OA 

1129 


1 ATA 

iy/y 


2033 


TOT TOGO 

/o/ 2©yo 


219 


1130 


1 AOA 

1980 


2636 


TOO AT AO 

7oo y208 


y% r\ 

220 


1131 


1981 


2637 


787 2305 


221 


1100 
1132 


1 AOT 

1982 


T/C1 0 

2638 


TO /t O^ CiC 

/84 8336 


222 


1 1 

1133 








223 


1 134 


1 noa 
1983 


T<C2 A 

2639 


TOT Ctltl/Z 

/o/ 3/o6 


224 


1135 


1984 


2640 


79045 31 


225 


1 1 1 c 
1 136 


1 noc 
1983 


T*C/1 1 

2641 


/o4 0/Uo 


226 


1 1 it 
113/ 


1 AO<C 

iyoo 


2042 


/yu iy3io 


227 


1110 
1138 


1 AOT 

iy87 


2043 


TO/1 1 TO/1 

/o4 1 /o4 


226 


1 1 O A 

1139 


1 AOO 

iyoo 


2644 


TO/1 1 <OC 

/o4 ioyo 


229 


1 1 A A 
1 140 


1 AOA 

lyyy 


2643 


TOT 1 "2 /iCO 

/o / 13oo 


230 


1 1 vl 1 

1 141 


1 AAA 

iyyo 


2040 


TOO ^1 Q7 

/oy oiy2 


23 1 


1142 


1991 


O A 1 "! 

2647 


TO A A A AO 

784 4498 


TOT 

232 


1143 








TOT 

233 


1 t A A 

1144 


1 A AO 

1992 


264o 


TO A /CA/1T 

/c»y 0042 


234 


1 143 








235 


1 1 A H 

1 146 


1 AAO 

1993 


T tZ A A 

2649 


TA A A A C 1 

/90 4461 


236 


1 1 vl T 

1 14/ 








23 / 


1 1 JO 

1 14o 


1 QO/1 

iyy4 


203U 


TO/1 T^T-^ 

/o4 20 /3 


Tin 

23 0 


1 1 /IA 

1 149 


iyy3 


2631 


TOO /I^QI 

/oy 43yi 


Tin 

23y 


1 1 en 
1 150 


iyyo 


2632 


TO A /ICQI 

/oy 43yi 


T/1 A 

24U 


1 1 c 1 
1151 


1 QOT 

iyy / 


2653 


toa m 
/yu 1314j 


241 


1152 








242 


1153 


1 AAO 

199o 


O A 

2654 


TA 1 /1AQT 

/yi 4y 32 


T /! O 

243 


1 1 X? A 

1154 








244 


1155 


1 AAA 

1999 


t fee 

2655 


TO A 1 A 1 A 1 

784 10141 


245 


H56 


O AAA 

2000 


2656 


TO /I 1 ATT C 

784_10223 


246 


1157 


2001 


2657 


TOvl TTTT 

784 / /22 


247 


1 158 








24o 


I 1 £A 

I I j9 








249 


1160 


2002 


2658 


787 3228 


250 


1161 


2003 


2659 


785 3642 


251 


1162 


2004 


2660 


784 6469 


252 


1163 


2005 


2661 


787 6106 


253 


1164 


2006 


2662 


791 2793 


254 


1165 








255 


1166 


2007 


2663 


784 3318 
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Table 9 
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SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identification of Priority 


full-length 


full-length 


contig 


contig peptide 


Application that contig 


nucleotide 


peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 




filed (Attorney Docket 






No. SEQ ID NO.) w 


256 


1167 








257 


1168 i 


2008 


2664 


/oy 62/U 


258 


1169 


2009 


2665 


*70 O 1 "3 O 

/oo lolOo 


259 


1170 


2010 


r\ f f r 

2666 


ion cc^i 

7o7 5541 


260 


1171 [ 








261 


1172 


2011 


2667 




262 


1173 


2012 


2668 


TO A A A 

/o4 44Uj 


263 


1174 


2013 


2669 


7o4 75 


264 


1175 


2014 


2670 


784 75o5 


265 


1176 


2015 


2671 


TOC iiOA 

/o5 Jo9U 


266 


1177 


2016 


L 2672 


/by 62->4 


267 


1178 


2017 


2673 


TO< OATO 


268 


1179 


2018 


2674 




269 


1180 








270 


1181 


2019 


2675 


TAA OIAOT 

/yu ziuy/ 


271 


1182 


2020 


2676 


'ion OA/tQ 


272 


1183 


2021 


2677 


lOA Q/Z^iA 

/o4 ooz4 


273 


1184 








274 


1185 


2022 


2678 


nor oiAH 

/o3 23(Jy 


275 


1186 








276 


1187 


2023 


2679 


784 1834 


277 


1188 


2024 


r\ s" o i"\ 

2680 


TOT OM1 

787 8921 


278 


1189 


2025 


2681 


TO A 

784 7125 


279 


1190 


2026 


2682 


785 2903 


280 


1191 


2027 


2683 


785 2903 


281 


1192 








282 


1193 








283 


1194 


2028 


2684 


TA 1 llOfl 

791 23o9 


284 


1195 








285 


1196 


2029 


2685 


TO A "7£ZAH 

/o4 /o4/ 


286 


1197 


2030 


2686 


to >l iroo 

784 352o 


287 


1198 


2031 


2687 


TO/1 rnrn 

784 5750 


288 


1199 








289 


1200 


2032 


O yTO Ci 

2688 


TOA O/IA/C 

/oy 34yo 


290 


1201 








291 


1202 








292 


1203 


2033 


2689 


TAA 1 /C<QO 

/yu looyy 


293 


1204 


2034 


zoyu 


/OJ 3 i DO 


294 


1205 


2035 


2691 


TO O 1 TOO/1 

788 12224 


295 


1206 


^ /"V <y 

2036 


2692 


tot rno/i 


296 


1207 








297 


1208 


2037 


2693 


TO /l ITA 

784 330 


298 


1209 


2038 


2694 


TA 1 or\TO 

791 2972 


299 


1 O 1 A 

12 1U 


OAO A 






300 


1211 








301 


1212 


2040 


2696 


784 528 


302 


1213 


2041 


2697 


787 1611 


303 


1214 


2042 


2698 


785 1187 


304 


1215 


2043 


2699 


784 8087 


305 


1216 


2044 


2700 


789 4427 


306 


1217 


2045 


2701 


789 4427 
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Table 9 
1141 



SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: ot 
contig peptide 
sequence 


Identification of Priority 
Application tn at contig 
nucieonae sequence was 

Nn ^IFO TO NO ^ * 


307 


1 o 1 o 








O AO 

308 


inn 
1219 


ZU40 


07fi0 
Z /UZ 


7QC OAQ1 


309 


1220 


OA/IT 

204/ 


Z /U3 


70 C 1 QAf) 


310 


1011 

1221 


OA/t 0 

204o 


0*7 A/1 
Z /U4 


/Ot 1 J L IK) 


311 


1222 


OA/1 A 

2049 


Z /Uj 


/o < + L*tD 1 


312 


1223 


O AC A 

2050 


07A£ 
Z /U0 




313 


1224 


205 1 


Z /U / 


7R7 ^17S 
/o / Jl / J 


314 


1 ooc 

122 j 


OACO 

2052 


07ft8 
Z /Uo 


7Q/t Q101 


315 


1 OO/C 

lZZo 


O AC7 

2053 


07AO 

z /uy 


70c a^QQ 


3lo 


122/ 


O AC/1 

2054 


071 n 

z / 1U 


7Q< OS1R 


317 


1 oos? 
lZZo 








318 


l22y 








319 


1 OIA 

1Z30 


ZUj j 


071 1 
Z / 1 1 


7RJ. 1010*; 


320 


123 1 


ZUjO 


0*71 o 
Z / 1Z 


7SJ. AQ7-d 


321 


1 oo o 

1232 


205 / 


0*71 O 
Z/ 13 


/oj jUaj 


322 


1 Ol 7 

1233 


205 o 


0*71 A 
Z / L*t 


70c inoR 

/oj OVJXo 


323 


1234 


OACO 

2UD9 


0*71 C 
Z / 1 J 


7R^ ^0^1 


324 


nor 
1Z3j 


OA6A 

2000 


0*71 6 
Z / 10 


75? c 710^ 


325 


1 00£ 

lZ3u 


OA61 
ZU01 


0*71 7 
Z / 1 / 


/OJ D\JOZ, 


326 


1 on 
123 / 


ZUOZ 


071 8 
Z / 1 o 


700 746^ 


327 


1238 


2U03 


071 Q 

z / iy 


7SS 9107 

/ O J Zlu/ 


328 


1239 


2004 


070O. 
Z / ZU 


78^ 10'*^ 
/OJ jujJ 


329 


1240 


ZUOj 


O70 1 

Z /Z 1 


7SS ^fl^7 

/ 0 J ->\J J 1 


330 


1 Oyl 1 

1241 


OA/^ 

2000 


z /zz 


/OJ jUjO 


331 


1 O /I o 

1242 


200/ 


^70^ 
Z/Zj 


70^ ^o^r 

/OJ JUJO 


332 


10/10 

1243 


ZUoo 


070A 
Z /ZH- 


/OJ 


■""i o o 

333 


1244 


2009 


070 C 
Z / ZD 


7QC O A/IO 
/OJ jUHj 


334 


1245 


20 /0 


Z /ZO 


7q< ^nj.'S 

/OJ JU'tJ 


O 1 ,C 

335 


1246 


OA*71 

20 /i 


0707 

z /z / 


753^ 91 1 A 
/O J Z 1 1** 


1 O 

336 


124/ 


O A70 

ZU /2 


0708 
Z /Zo 


7R^ A9J\ 
/OJ HOJ 


337 


1 O /l o 

124o 


20/3 


070Q 

z /zy 


/OJ DU7 


338 


1 O/IQ 

1249 


ZU /4 


077 A 
Z / JU 


75?< 1 S09 
/OJ Uua 


<n i f\ 

339 


1 OCA 

1250 


20 /5 


07Q 1 
Z /3 1 


7S*; 91^7 
/oj z 1 j / 


340 


125 1 


ZU /0 


0770 
Z / jZ 


/O J J 1 t/D 


1 A 1 

341 


1252 


O ATO 

ZU / / 


Z /33 


7R^ ISftR 
/OJ JJUO 


342 


1 o *ci 
1253 








343 


1 O C/1 

1254 


O A7Q 
ZU /a 


Z /J4 


70c Q 1 1 A 

/OJ J 1 1*T 


vi -J 

344 


1255 


OA7A 

2U /9 


071 C 

Z / 


7<3^ ^11^ 

/OJ J 1 1 J 


n A C 

345 


1250 


OAOA 

20o0 


Z /30 


78 S 9161 
/OJ Z10I 


346 


125 / 








o /in 

347 


1255 


ZUol 


0717 
Z 15 1 


7Q< ^19^ 
/oJ J 1ZJ 


348 


1 ICQ 

1259 


OACO 

ZUoZ 


Z / jo 


/OJ oJt 


349 


1 0£A 
1200 


ZUo j 


071Q 

z / jy 


78^ 1^04 

/OJ 1JV/H- 




1261 


2084 


2740 


790 3585 


351 


1262 








352 


1263 


2085 


2741 


784 1062 


353 


1264 


2086 


2742 


784 9616 


354 


1265 








355 


1266 


2087 


2743 


788 604 


356 


1267 


2088 


2744 


784 9474 


357 


1268 


2089 


2745 


784 9474 
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Table 9 
1142 



SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identification of Priority 


full-length 


full-length 


contig 


contig peptide 


Application that contig 


nucleotide 


peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 




filed (Attorney Docket 










No. SEQ ID NO.) * 


358 


1269 








359 


1270 








360 


1271 


2090 


2746 


788 6828 


361 


1272 


2091 


2747 


788 6828 


362 


1273 


2092 


2748 


784 8575 


363 


1274 


2093 


2749 


790 21803 


364 


1275 


2094 


2750 


784 5329 


365 


1276 


2095 


2751 


784 10122 


366 


1277 


2096 


2752 


784 3644 


367 


1278 








368 


1279 


2097 


2753 


787 2863 


369 


1280 


2098 


2754 


784 854 


370 


1281 








371 


1282 








372 


1283 


2099 


2755 


787 3968 


373 


1284 


2100 


2756 


785 2864 


374 


1285 


2101 


2757 


785 2864 


375 


1286 








376 


1287 


2102 


2758 


784 4739 


377 


1288 








378 


1289 


2103 


2759 


791 771 


379 


1290 


2104 


2760 


792 4497 


380 


1291 


2105 


2761 


784 4833 


381 


1292 


2106 


2762 


785_3195 


382 


1293 


2107 


2763 


784_6243 


383 


1294 


2108 


2764 


784 1134 


384 


1295 


2109 


2765 


784 10176 


385 


1296 


2110 


2766 


784 3781 


386 


1297 


2111 


2767 


785 2441 


387 


1298 


2112 


2768 


787 4817 


388 


1299 


2113 


2769 


784_3255 


389 


1300 




■ 




390 


1301 


2114 


2770 


784 9436 


391 


1302 


2115 


2771 


784 5232 


392 


1303 


2116 


2772 


784 4604 


393 


1304 








394 


1305 


2117 


2773 


784 7986 


395 


1306 


2118 


2774 


787 10365 


396 


1307 


2119 


2775 


784 4897 


397 


1308 








398 


1309 


2120 


2776 


787 5541 


399 


1310 


2121 


2777 


787 7655 


400 


1311 


2122 


2778 


784 1002 


401 


1312 


2123 


2779 


787 8365 


402 


1313 








403 


1314 


2124 


2780 


784 9956 


404 


1315 


2125 


2781 


787 7927 


405 


1316 


2126 


2782 


787 10138 


406 


1317 


2127 


2783 


787 5878 


407 


1318 


2128 


2784 


784 6962 


408 


1319 


2129 


2785 


784 1341 
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Table 9 
1143 



SEQ ID JNU: oi 

iuli-iengtn 

nucicotiue 

conn on f*t* 


o I? f~\ TTI 7VJO- nf 
ID iWJ, OI 

Aill Inn fW*ri 

iuu-iengtii 

nanfi rl c» 
UcUUUc 

«P fill Oil Of 


OIjy xXr llvy» UI 
pnnf S Cf 

miclentidfi 

11UV1V KM HWV 

seauence 


SEO ID NO: of 
contie peptide 
seauence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
tiled (Attorney Docket 
No. SEQ ID NO.) * 


409 


1320 


2130 


2786 


787 894 


41 0 

TlU 


1321 








41 1 


1322 


2131 


2787 


784 5866 


41 9 

41Z 


1323 


2132 


2788 


784 5734 


41 1 


1324 








414 
tit 


1325 








41 5 


1326 


2133 


2789 


788 11985 


416 


1327 








41 7 


1328 


2134 


2790 


791 1605 


41 8 


1329 








41 9 


1330 


2135 


2791 


784 3232 


490 


1331 


2136 


2792 


784 1670 


491 


1332 








499 


1333 


2137 


2793 


784 2377 


49^ 


1334 


2138 


2794 


788 2631 


494 


1335 


2139 


2795 


784 8666 


495 


1336 


2140 


2796 


785 1109 


496 


1337 


2141 


2797 


785 1109 


497 


1338 


2142 


2798 


784 6176 


49 X 


1339 


2143 


2799 


785 1465 


499 


1340 


2144 


2800 


785 1465 


410 


1341 

x <mj r x 


2145 


2801 


784 5369 


411 


1342 


2146 


2802 


787 8763 


419 


1343 


2147 


2803 


784 5253 


411 


1344 


2148 


2804 


789 2568 


414 


1145 


2149 


2805 


790 3055 


415 


1346 


2150 


2806 


790 10356 


416 


1347 


2151 


2807 


785 2811 


417 


1348 


2152 


2808 


784 2496 


418 


1349 


2153 


2809 


784 2530 


419 


1350 


2154 


2810 


784 2168 


440 


1351 


2155 


2811 


784 7505 


441 


1352 


2156 


2812 


784 6670 


449 


1353 


2157 


2813 


787 2168 


441 


1354 


2158 


2814 


784 3022 


444 


1355 


2159 


2815 


790 1217 


445 


1356 


2160 


2816 


784 5473 


446 


1357 


2161 


2817 


784 7905 


447 


1358 








44 R 


1359 


2162 


2818 


784 9436 


44Q 


1360 


2163 


2819 


787 2898 


450 


1361 








4^1 


1362 








452 


1363 


2164 


2820 


784 4625 


453 


1364 


2165 


2821 


785 2622 


454 


1365 








455 


1366 


2166 


2822 


787 2386 


456 


1367 


2167 


2823 


788 9391 


457 


1368 


2168 


2824 


784 10104 


458 


1369 


2169 


2825 


784 10104 


459 


1370 


2170 


2826 


787 10366 
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Table 9 
1144 



SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
tiled (Attorney Docket 
No. kHQ ID NO.) 


460 


1 O T 1 

1371 


2171 


Z5Z / 




461 


1372 


O 1 TO 

2172 


Zo28 


7Qf 1 OOT 


462 


1 O T) 

1373 


O 1 TO 

2173 


noon 

2829 


TOT CTTO 


463 


1374 


2174 


O 0"3 A 

2830 


/o4 jUzj 


A S" A 

464 


1 O TC 

1375 








465 


-i <~> >-t 
1376 


O 1 TC 

21 75 


O OO 1 

2831 


TO -C "70A 

785 739 


466 


1377 


O 1 T /"" 

2176 


OOOO 

2832 


TOT 1 O/CA 

/87 1269 


467 


1378 








468 


1379 








469 


1380 








470 


1 O O 1 

1381 


O 1 TT 

2177 


OOOO 

2833 


T O C Ol/T 

/SO 32o 


471 


1382 


O 1 T O 

2178 


O OO A 

2834 


TO /I OO A 1 

784 o341 


472 


1 o oo 

1383 


O 1 TO 

2179 


o o o c 

2835 


TOT T C T 

787 757 


yl TO 

473 


1384 








474 


1 1 o C 

1385 








475 


1386 


O 1 o /\ 

2180 


O OO £Z 

2836 


TOT O /I </1 

/o/ o4j4 


/i t^ 
476 


1 O O T 

1387 


O 1 O 1 

2181 


2837 


7f|A O/iO/1 

/yU o424 


477 


1 o o o 

1388 








478 


1 o on 

1389 


O 1 oo 

2182 


OOOO 

2838 


tot neon 


479 


1 o r\n 

1390 


O 1 oo 

2183 


OOO A 

2839 


/o4 o90o 


480 


1391 


O 1 O A 

2184 


O O A A 

2840 


TOT 1/111 


481 


1 o rto 

1392 


O 1 o c 

2185 


O O A 1 

2841 


taa 1 nnnn 

/yu luyyy 


y4 OT 

482 


1 O 1^1 o 

1393 








oi 

483 


1394 








a a a 

484 


1395 








A Of 

485 


i o 

1396 








486 


1397 


O 1 O £ 

2186 


O O /l o 

2842 


TOT oni£ 

787 2916 


487 


1 1 AO 

1398 


O 1 OT 

2187 


O O /I o 

2843 


TOT f no^ 

787 5y8b 


/l o o 

488 


1399 


O 1 o o 

2188 


O O yi A 

2844 


nor o ATA 

785 zy/y 


489 


1400 


O 1 Ci c\ 

2189 


o o /i c 

2845 


nnr o 1 AC 

785 2105 


490 


1401 








491 


1402 








492 


1403 


o i c\r\ 

2190 


O O A £ 

2846 


TO C O A/1 *C 

7o5 304o 


493 


1404 


o 1 1 

2191 


O O /! T 

2847 


TOO AC /I <C 

788 9546 


494 


1405 


O 1 no 

21 92 


O O /l o 

2848 


TO C 11 CO 

785 2158 


495 


1406 








/I <~\ /f 

496 


1407 


O 1 AO 

2193 


O O A A 

2849 


TO A TC A A 

/o4 /o4y 


>i at 

497 


1 yl AO 

1408 


O 1 A/1 

2194 


Z530 


/o4 lOoo 


498 


1 A f\C\ 

1409 


2195 


AOf 1 

2851 


/o4 ZD /3 


499 


1410 


2196 


O OCO 

2852 


TCC C^"3 

/oj jd3 


500 


1411 








501 


1412 


2197 


2853 


TOO /CO A 

/oy 030 


502 


1413 


2198 


O O C A 

2854 


TO C OCA 

/o5 259 


jUo 


14 14 


O 1 oo 




/OH jOj/ 


504 


1415 


2200 


2856 


790 7396 


505 


1416 








506 


1417 


2201 


2857 


790 11548 


507 


1418 








508 


1419 


2202 


2858 


784 6740 


509 


1420 








510 


1421 


2203 


2859 


785 1007 
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Table 9 
1145 



SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identification of Priority 


full-length 


full-length 


contig 


contig peptide 


Application that contig 


nucleotide 


peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 




filed (Attorney Docket 










No. SEQ ID NO.) * 


511 


1422 








512 


1423 








513 


1424 


2204 


2860 


784 3641 


514 


1425 








515 


1426 


2205 


2861 


789 6066 


516 


1427 


2206 


2862 


788 13582 


517 


1428 


2207 


2863 


784 9762 


518 


1429 


2208 


2864 


787 2052 


519 


1430 


2209 


2865 


790 3055 


520 


1431 








521 


1432 


2210 


2866 


790 23800 


522 


1433 


2211 


2867 


790 23800 


523 


1434 








524 


1435 


2212 


2868 


787 2548 


525 


1436 








526 


1437 


2213 


2869 


784 2565 


527 


1438 


2214 


2870 


784 544 


528 


1439 


2215 


2871 


784 544 


529 


1440 








530 


1441 


2216 


2872 


789 1844 


531 


1442 


2217 


2873 


787 2733 


532 


1443 


2218 


2874 


785 493 


533 


1444 


2219 


2875 


790 18203 


534 


1445 








535 


1446 


2220 


2876 


784 3336 


536 


1447 


2221 


2877 


790_777 


537 


1448 


2222 


2878 


787 4220 


538 


1449 


2223 


2879 


790 26438 


539 


1450 


2224 


2880 


790 13586 


540 


1451 


2225 


2881 


784 270 


541 


1452 








mm m jfm. 

542 


1453 


2226 


2882 


790 14176 


543 


1454 


2227 


2883 


784 3543 


544 


1455 


2228 


2884 


784 3575 


545 


1456 


2229 


2885 


787 5638 


546 


1457 


2230 


2886 


784 3826 


547 


1458 


2231 


2887 


787 6059 


548 


1459 


2232 


2888 


787 2585 


549 


1460 








550 


1461 








551 


1462 


2233 


2889 


787 2965 


552 


1463 








553 


1464 


2234 


2890 


790 13050 


CCA 

554 


1465 








555 


1466 


2235 


2891 


784 3374 


556 


1467 


2236 


2892 


787 8240 


557 


1468 


2237 


2893 


784 4398 


558 


1469 








559 


1470 


2238 


2894 


788 10891 


560 


1471 


2239 


2895 


787 9805 


561 


1472 
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Table 9 
1146 



SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
filed (Attorney Docket 
No. SEQ ID NO.) * 


562 


1473 








563 


f A A 

1474 


2240 


2896 


787 7094 


564 


1475 


2241 


2897 


787 5209 


565 


1476 


2242 


A AAA 

2898 


784 582 


e~ f 

566 


1477 


2243 


2899 


*~1C\ A 1 A1 1 1 

790 10111 


567 


1478 


2244 


2900 


790 13563 


568 


1479 








569 


1480 


2245 


2901 


TOO OOO A 

787 8234 


570 


1481 








571 


1482 


2246 


2902 


»7AA 1 <TO /"\ H 

790 16394 


572 


1483 








573 


A A Ci A 

1484 


2247 


2903 


*ir\r\ 1 Oil 

790 18452 


574 


1 a a c 

1485 


2248 


2904 


ann oil 

790 211 


575 


1486 


2249 


2905 


haa 1 /1AOO 

790 14932 


576 


1487 








577 


1488 


2250 


2906 


TOT 1 ,( AO 

787 7498 


578 


1489 


225 1 


2907 


TO T ^ 1 A 

787 610 


579 


1490 


2252 


2908 


790_19096 


580 


1491 


2253 


2909 


no >< ha yo 

784 7363 


581 


1492 


2254 


2910 


«aa oi r r 

792 8355 


582 


1493 


2255 


2911 


TOT "1 1 T/"4 

787 1179 


583 


1494 


2256 


2912 


790 6822 


584 


1495 


2257 


2913 


784 8670 


585 


1496 


2258 


2914 


TO /f O yl /"/A 

784 2469 


586 


1497 








587 


1498 


2259 


2915 


790 15720 


588 


1499 


2260 I 


2916 


790 11300 


589 


1500 


226 1 


2917 


T O /I 1 /~\ /"V 

784 190 


590 


1501 








591 


1502 


2262 


2918 


789 6245 


592 


1503 


2263 


2919 


TOT TO o zr 

787 7385 


593 


1504 


2264 


2920 


Tr>/\ o o 1 

790 28 1 


594 


1505 


2265 


2921 


TO /I ^"1 OO 

784_6l92 


595 


1506 


2266 




TO/\ TT/\ 

790 770 


596 


1507 


2267 


2923 


TO yl OO /T/l 

784 3364 


597 


1508 


2268 


2924 


T O A £ A 1 *! A 

784 5424 


598 


i c r\c\ 

1509 








599 


1 .C 1 A 

1510 


2269 


2925 


TA A 1 A 1 1 O 

790 1U118 


600 


1 XT 1 1 

1511 








601 


1 r i n 

1512 








602 


1513 


2270 


2926 


785 2555 


603 


1514 








604 


1515 








605 


1 ^ 1 #c 
1Mb 




zyz / 


ta A OA 1 °< 

/yu zuioj 


606 


1517 


2272 


2928 


784 333 


607 


1518 


2273 


2929 


792 3164 


608 


1519 








609 


1520 








610 


1521 


2274 


2930 


784 3455 


611 


1522 


2275 


2931 


784 506 


612 


1523 


2276 


2932 


788 7179 
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Table 9 
1147 



SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
filed (Attorney Docket 
No. SEQ ID NO.) * 


613 


1524 








614 


1525 








615 


1526 


2277 


2933 


790 13445 


616 


1527 








617 


1528 


2278 


2934 


784 6181 


618 


1529 


2279 


2935 


784 4453 


619 


1530 








620 


1531 


O o o O 

2280 


2936 


784 8140 


621 


1532 








622 


1533 


2281 


2937 


nod O >1 1 

784 3441 


623 


1534 


2282 


2938 


'-JOT £A f\A 

787 6194 


624 


1535 


2283 


2939 


784 1407 


625 


1536 








626 


1537 


2284 


2940 


790 8886 


627 


1538 








628 


1539 


2285 


2941 


"70 <r o^ e c 

785 2555 


629 


1540 


2286 


2942 


784_688 


630 


1541 


2287 


2943 


nn /) C5AO 

784 8302 


63 1 


1542 


2288 


2944 


TOT ICOI 

78/ 25 ol 


632 


1543 


2289 


2945 


r ic\r\ i r\ezi c 

/yu lyoio 


633 


1544 


2290 


2946 


/i54 4663 


634 


1545 


2291 


2947 


790 9138 


635 


1 C A f 

1546 








636 


1 C A T 

1547 


flrt Art 

2292 


2948 


TO /I /IT / O 

784 4743 


637 


"1 C A O 

1548 


2293 


2949 


Tf\ f\ 1 Of /l 1 

790 12541 


638 


1549 








639 


1550 








640 


1551 


2294 I 


2950 


TO A A 1 O O 

784 4128 


641 


1552 








642 


1553 


2295 


295 1 


TO/"\ 1 fVOiT'T 

790 19867 


643 


1554 


2296 


2952 


T O A in iCT 1 

784 7061 


644 


\555 


2297 


2953 


TO ^ 11 A Pi 

785 1399 


645 


-i c r* f 

1556 


rt ^ r\ o 

2298 


2954 


TO A CC41E 

784 5565 


646 


1557 


2299 


2955 


i—ir\r\ i n"OA 

790 10780 


647 


■I ^<-r> 

1558 








648 


1559 








649 


1560 


2300 


2956 


TO /I 11 /JO 

784 2148 


650 


1561 


2301 


2957 


rfrt /\ O A 1 T yl 

790 24174 


651 


1562 


2302 


2958 


TO A A 1 f\T 

784 4307 


652 


1563 


2303 


x^l f\ c f~\ 

2959 


790 20063 


653 


1564 








f E A 

654 


1565 


2304 


2960 


T O /I T/T C /I 

784 7654 


655 


1566 


O O £T 

2305 


2961 


TO A T A C\ A 

784 7454 


ODD 


lDO/ 








657 


1568 


2307 


2963 


784 5860 


658 


1569 








659 


1570 


2308 


2964 


788 11718 


660 


1571 


2309 


2965 


784 2105 


661 


1572 








662 


1573 


2310 


2966 


785 1924 


663 


1574 


2311 


2967 


784 8098 
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SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identification of Priority 


full-length 


full-length 


contig 


contig peptide 


Application that contig 


nucleotide 


j peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 




filed (Attorney Docket 










No. SEQ ID NO.) * 


004 


1 CHC 

15 /5 


on 1 o 

23 12 


2968 


784 2320 


ODJ 


15 /6 


1 O 

2313 


or*^/\ 

2969 


784 7695 


666 


loll 


2314 


r\ r\ r\ 

2970 


784 6847 


66 / 


1578 








ooo 


1579 








669 


1580 


23 15 


2971 


784 8098 


670 


1581 


no 1 
23 16 


2972 


784 4781 


6/1 


1 COO 

1582 








672 


1583 


2317 


2973 


792 8335 


673 


1584 








o /4 


1 roc 

1585 


00 1 O 

23 18 


2974 


789 2488 


6 O 


1 co^ 

1586 


00 1 f\ 

2319 


2975 


785 2433 


0 /o 


1587 








Of / 


1 coo 

1588 


2320 


2976 


787 3115 


678 


1589 








679 


1590 


2321 


2977 


787 5209 


OA 

680 


1591 


n O n n 

2322 


2978 


787 4223 


£TO 1 

681 


1592 


no n O 

2323 


2979 


784 5228 


682 


1593 


2324 


2980 


787 5094 


COT 

683 


1594 


2325 


2981 


790 10116 


684 


1595 








4ZOC 

685 


1596 


2326 


2982 


785 1351 


686 


1597 








68 / 


i cno 

1598 








o 

688 


1 con 

1599 


HO OT 

2327 


2983 


^1 1*\ A A /" * f— * 

784 4288 




1600 


2328 


AAA i 

2984 


784 4816 


6yu 


1601 


2329 


y"\ t~\ C\ ^ 

2985 


791 3535 


6yi 


1602 


2330 


A f\ c\ s~ 

2986 


784 4322 


692 


1603 


0001 
233 1 


n n <n i~t 

2987 


785 2272 


/COO 

693 


1604 


2332 


2988 


790 12959 


694 


1605 


2333 


a a o r\ 

2989 


784 4642 


695 


1606 


2334 


2990 


mm f-\ A m* tm jm. a 

784 3534 


696 


1607 








69/ 


1 AO 

1608 








/-no 
698 


1 A A 

1609 








oyy 


1 /C 1 A 

lo 10 


n o o c 

2335 


2991 


784 2904 


/oo 


1611 


n o o 

2336 


2992 


784 8547 


ta 1 

701 


1612 








702 


1 tz 1 o 

lpl3 








703 


1614 


2337 


2993 


m% mm mm, >\ mmm 

785 2907 


704 


1615 








705 


1616 


2338 


2994 


M «k -mm, MW^. J-M mmW m~* 

789 3856 


>~l e\ 

706 


1617 


2339 


2995 


785 2043 


707 

i\Jf 


1U 1 o 


ZJ4U 




HQA A 111 

/84 432/ 


708 


1619 


2341 


2997 


787 10225 


709 


1620 








710 


1621 


2342 


2998 


790 24136 


711 


1622 


2343 


2999 


785 172 


712 


1623 


2344 


3000 


790 24958 


713 


1624 


2345 


3001 


790 17848 


714 


1625 


2346 


3002 


790 10920 
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SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identification of Priority 


full-length 


full-length 


contig 


contig peptide 


Application that contig 


nucleotide 


peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 




filed (Attorney Docket 










No. SEQ ID NO.) * 


71 ^ 

/ 13 


1 aoa 

1020 


2347 


3003 


790 24477 


T\A 
/ 10 


102/ 


2348 


0 a a /i 
3004 


790 21024 


71 7 
/ 1 / 


1 /COC 

102o 


2349 


3005 


790 3053 


/ lo 


1 ^">A 

iozy 








71 Q 

/ iy 


1 /CO A 

103U 


2350 


3006 


790 13392 


7on 
/zu 


1 A1 1 

103 1 


2351 


3007 


790 28822 


70 1 
/Zl 


1 *CQO 

1032 


O O CO 

2352 


3008 


790 17805 


700 


1 £00 


2333 


3009 


784 1991 


700 


1 A1A 

1034 


Z334 


3010 


790 12778 


/Z4 


1033 


2355 


o a i 1 
3011 


784 5899 


70 ^\ 
/Z3 


1030 








70 £ 
/ZO 


1 /£07 
103 I 


2350 


3012 


784 485 l 


707 
/Z / 


103o 


2357 


3013 


784 6070 


70 Q 

/Zo 


1 /COO 


OO .C O 

235o 


3014 


784 9615 


70Q 

/zy 


104U 


2359 


3015 


790 23719 


7on 


1041 








70 1 

/ 5 I 


104Z 


2300 


OA1/ 

3016 


784 79 18 


700 


1043 


2351 


0 rv 1 t 

3017 


790 16176 


700 


1044 , 








70A 
/34 


1 AA < 

1043 


*CO 

2302 


3018 


784 4042 


70^ 

/33 


1040 








70/^ 
/JO 


1 AAH 

104/ 


2363 


OA1 A 

3019 


790 4484 


707 
/3 / 


1 AA Q. 

104o 








/3o 


1649 


2364 


3020 


790 16690 


/ 39 


1 Azr\ 
1050 


2365 


0 a^ 1 
3021 


784 7703 


7/1 A 

/4U 


1 1 

105 1 


2366 


3022 


785 2626 


7/t 1 

/41 


1052 








7/1 0 

/4Z 


1053 


2367 


3023 


784 1423 


7/10 
/43 


1 aza 
1054 








1AA 
/44 


1053 








7zK 
f *+3 


1 AKA 
1030 


Z3oo 


3024 


784 8178 


7A£ 
/HO 


103 / 


23oy 


3025 


784 6726 


1AH 


105 0 


Ol 7rt 

2370 


3026 


792 7162 


7AS 
/4o 


1 A<Q 
105y 


ooti 
23/1 


3027 I 


TO /I OA A O 

784 2948 


740 


IOOU 








/ JU 


1 AA\ 
1001 


1-711 

2372 


O A^ O 

3028 


784 2988 


7*\ 1 
/3 I 


1 A AO 
100Z 








7^0 

/jZ 


1 AA*\ 
1003 


IO OO 

23 /3 


O AO A 

3029 


784_10047 1 


7^0 


1 AAA 
1004 








7^4 

/34 


1 AA< 
1003 


23 /4 


1 AO A 

3030 


787 3415 


/jj 


1 AAA 
lOOO 








7^£ 
/DO 


1 AA1 
IOO / 


one 

2375 


O AO 1 

3031 


787_10007 


7-^7 
/3 / 


1 AASi. 
iOOo 








758 


1669 








759 


1670 


2376 


3032 


790 12779 


760 


1671 








761 


1672 


2377 


3033 


785_3699 


762 


1673 


2378 


3034 


784 4026 


763 


1674 


2379 


3035 


790 14323 


764 


1675 








765 


1676 
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SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identification of Priority 


full-length 


full-length 


contig 


contig peptide 


Application that contig 


nucleotide 


peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 




filed (Attorney Docket 










No. SEQ ID NO.) * 


t/C/C 
too 


1 HT~1 
LOft 


OO OA 

2380 


O AO £ 


TOO O/CI/C 

/y2 2oio 

- *— 


T/TT 
JO I 


1 /TTO 

1678 


OO O 1 

2381 


O AO T 

3037 


TOT /CCOA 

/8 / 6570 


loo 


1 /~TA 

1679 


OO OO 

2382 


1 AO O 

3038 


to A o cnn 

/84 2590 


T/T A 

769 


1680 


OO OO 

2383 


O AO A 

3039 


TO A CO 1 

784 5216 


TTA 

770 


1 Zf O 1 

1681 








771 


1682 


OO O A 

2384 


3040 


TOO 1 O C 1 A 

790 12514 


TTO 

772 


1 /TOO 

1683 








773 


1 //T O yl 

1684 








774 


1685 


2385 


304 1 


785 3366 


775 


1686 


O O O yy" 

2386 


O O yt O 

3042 


TOO 1 O yl /I 

790 12446 


776 


1 /£■ OT 

1687 








TTT 

111 


1 o o 

1688 








778 


1 /Ton 

1689 








779 


1690 








780 


i /To 1 

1691 


O O OT 

2387 


O O A O 

3043 


TO /I Z"TAO 

784 6708 


781 


1692 


OO o o 

2388 


o o A A 

3044 


TO /t OCO 

784 852 


TOO 

782 


1693 








TOO 

783 


1694 








TO yl 

784 


1695 








T O C 

785 


1696 


o o o o 

2389 


3045 


T O yl O yl /TO 

784 9460 


786 


1 697 


OO OO 

2390 


3046 


787 7895 


TOT 

/87 


1698 


OO A1 

2391 


O A /I T 

3047 


T O yl /TO O /£" 

784 6226 


TOO 

788 


1699 


OO OO 

2392 


O O yf O 

3048 


'TOO £"0 iTO 

789 5263 


TOO 

/oy 


1 TAA 
1 /OO 


OO AO 

2393 


O A/1 A 

3049 


TOyf /tAOO 

784 4933 


/yu 


1 TA 1 
1/01 








TA 1 

/y i 


1 TAO 
1 /02 








t ao 

792 


1 HA') 

1703 








Trio 

793 


1 T A A 

1704 


OO A A 

2394 


O AC A 

3050 


TO yl T 1 T/T 

784 7176 


794 


1 TAC 

1705 








795 


1706 


oo r\c 

2395 


3051 


784 3074 


79b 


1 TAT 

1707 








797 


1 TAO 

1708 


OO A/T 

2396 


O rt CO 

3052 


TO C 1 £ ylO 

785 1642 


798 


1 TAA 

1709 








799 


1 T 1 A 

1710 


OO AT 

2397 


O ACO 

3053 


TOO 1 O /TT 1 

790 12671 


OAA 

800 


1 T 1 1 

171 1 








£ A1 

oOl 


1 T 1 O 

1712 








802 


1 T 1 O 

1713 


OO AO 

2398 


O AC /I 

3054 


TO yl 11 OT 

784 1127 


OAI 

803 


1 T 1 yl 
1714 








804 


1 T 1 C 

1715 








q ac 
805 


1 T 1 £Z 

1716 


OO AA 

2399 


O ACC 

3055 


Trio 1 /I /lOO 

790 14423 


q a<c 
806 


1 T 1 T 

1717 


O A A A 

2400 


O AC /T" 

3056 


TO O O O 1 y£T 

7902 8 166 


OAT 

807 


1 T 1 O 

1718 








O AO 

808 


1 T 1 A 

1719 








RfiQ 

ouy 


1 70 A 


J. 


^ACO 
D\)J 1 


/o4 Dy40 


810 


Mil 


2402 


3058 


784 4850 


811 


1722 


2403 


3059 


784 744 


812 


1723 


2404 


3060 


790 13151 


813 


1724 








814 


1725 


2405 


3061 


787 7172 


815 


1726 


2406 


3062 


790 6384 


816 


1727 
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SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: ol 
contig peptide 
sequence 


laentification 01 r^noriiy 

i\ppilCdtlOIl 111*11 CUllllg 

nifd TAttornev Docket 
No SEO ID NO/> * 


817 


1728 








818 


1 toa 

1729 


9AH7 1 
Z4U / 




785 3645 


819 


1730 


Z4U0 




784 9630 


820 


1 T? 1 

1731 








821 


1 TOO 
1 /3Z 




106S 


784 10027 


822 


1 TOO 

1 /33 


9A1 n 

Z4 1U 




784 10083 


823 


1 TO/1 

1 /34 


941 1 
Z*+ 1 1 


JuU / 


787 2031 


824 


1 TO C 


941 9 
Z4- l<i 


"3068 


784 4592 


825 


1 TO/C 

1 /3o 








826 


1 TOT 

1 /3 / 


941 1 




790 3308 


827 


1 TO O 

1 /3o 








828 


1 TOO 

i / 


Z4 IH- 


^070 


784 4360 


829 


1 T /I A 

1740 


OA! ^ 


^071 

/ 1 


785 2472 


830 


1 T /I 1 

1741 


9A1 £ 
Z4 ID 




784 2060 


831 


1 T >l O 

1742 


9A1 7 
Z41 / 




784 3549 


832 


1 T /1 1 

1743 








n o 

833 


fl A A 

1744 


9A1 C 
Z4 16 


0\J / T 


784 1872 1 


834 


1745 


9/1 1 Q 

Z4 iy 




784 7925 


835 


1746 


9A9fi 
Z4ZU 


Jv / U 


784 5564 


836 


1 T /I T 

1747 








837 


1 T /I O 

1748 








838 


1749 


9A9 1 
Z4Z1 


^077 


785 1472 


839 


1 TC A 

1750 


OA99 
Z4ZZ 


^075s 


784 2232 


840 


1 TC 1 

175 1 


9A9*} 
Z4Z3 


^07Q 

J3 v / y 


790 24132 


841 


i Tiro 
1752 


9 AO A 
Z4Z4 




790 6689 


842 


1 TCO 

1753 


9A9<. 
Z4ZD 




787 3699 


843 


1 TC /I 

1754 


9A9£ 
Z4ZO 


^0S9 


785 2955 


844 


1 TC C 

1 /DD 


9A97 
Z4Z / 




784 4324 


845 




949 K 




784 9382 


O A /" 

846 


1 TCT 
1 /5 / 


949Q 




787 7087 


O A T 

847 


1 TC O 

1 /DO 








O A O 

848 


1 TC A 


94^0 


^086 

JUOU 


784 4912 


849 


1 T/CA 

1 /oU 








850 


1 T/T 1 

1761 


9A1 1 
Z4 J I 


^ns7 

/ 


790 17236 


851 


1762 


9A19 




790 1914 


852 


1 T^TO 

1763 


9A^^ 


j \joy 


784 466 


853 


1 T/C/1 

1764 


9414 


D\jy\j 


790 11164 


854 


1765 








855 


1766 


Z43D 


JU!?1 


787 8100 

/ O / O JL \J\J } 


r\ r f 

856 




9/1 1£i 
Z430 


^nQ9 

D\Jyz. 


784 9062 


C\ IT ^7 

857 


1768 








858 


1 T /TO 

1769 


9417 


jyjyo 


784 4141 


859 


1 TTA 

1770 










1771 


2438 


3094 


784 1365 


861 


1772 


2439 


3095 


784 6642 


862 


1773 








863 


1774 


2440 


3096 


789 3442 


864 


1775 


2441 


3097 


784 10198 


865 


1776 


2442 


3098 


790 11264 


866 


1777 








867 


1778 


2443 


3099 


788 13640 
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SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
! sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
Filed (Attorney Docket 
No. SEQ ID NO.) * 


868 


A mm mm /-v 

1779 








869 


1780 


y*v JAM 

2444 


3100 


785 238 


870 


1781 


2445 


3101 


787 10243 


871 


1782 


2446 


3102 


784 8555 


^^^^^^ 

872 


1783 


2447 


3103 


789 5245 


873 


1784 


2448 


3104 


784 4968 


874 


1785 


2449 


3105 


789 5631 


875 


1786 


2450 


3106 


784_6373 


876 


1787 


2451 


3107 


784 7992 


r—t r—t 

877 


1788 


2452 


3108 


787 6002 


878 


1789 


2453 


3109 


784 1503 


o ^-r r\ 

879 


1790 


2454 


3110 


790 10504 


880 


1791 


2455 


3111 


790 18438 


881 


1792 


^X A mt 

2456 


3112 


790 13790 


882 


1793 


2457 


3113 


790 21890 


883 


4 mmf*± A 

1794 


2458 


3114 


790 17754 


884 


1795 








885 


1796 


2459 


A «■ ■« mr 

3115 


785 1016 


886 


1797 


2460 


3116 


785 14 


887 


1798 


2461 


3117 


790 18472 


888 


1799 


2462 


3118 


784 8233 


889 


1800 


2463 


3119 


784 177 


890 


1801 


2464 


3120 


784 3522 


891 


1802 


2465 


3121 


784 10067 


892 


1803 


2466 


3122 


784 7868 


893 


1804 








894 


1805 


2467 


3123 


787 7085 


895 


1806 


2468 


3124 


784 3739 


V 

896 


1807 


2469 


3125 


784 4168 


f~l ^-T 

897 


1808 


2470 


3126 


784 6642 


898 


1809 








/"x 

899 


1810 


2471 


3127 


789 6290 


900 


1811 


2472 


3128 


791 2423 


901 


1812 








902 


1813 








903 


1814 










1 Q1 f 

lo ID 








905 


1816 


2473 


3129 


790 10106 


906 


1817 


2474 


3130 


787 3075 


907 


1818 


2475 


3131 


790 17762 


908 


1819 


2476 


3132 


784 482 


909 


1820 


2477 


3133 


790 21059 


910 


1821 


2478 


3134 


788 13847 


911 


1822 









*784_XXX = SEQ ID NO: XXX of Attorney Docket No. 784, US Serial No. 09/488,725 
filed 01/21/2000, the entire disclosure of which., including sequence listing, is 
incorporated herein by reference. 
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785JXXX = SEQ ID NO: XXX of Attorney Docket No. 785, US Serial No. 09/491,404 
filed 01/25/2000, the entire disclosure of which, including sequence listing, is 
incorporated herein by reference. 

787_XXX = SEQ ID NO: XXX of Attorney Docket No. 787, US Serial No. 09/496,914 
filed 02/03/2000, the entire disclosure of which, including sequence listing, is 
incorporated herein by reference. 

788_XXX = SEQ ID NO: XXX of Attorney Docket No. 788, US Serial No. 09/515,126 
filed 02/28/2000, the entire disclosure of which, including sequence listing, is 
incorporated herein by reference. 

789_XXX = SEQ ID NO: XXX of Attorney Docket No. 789, US Serial No. 09/519,705 
filed 03/07/2000, the entire disclosure of which, including sequence listing, is 
incorporated herein by reference. 

790_XXX = SEQ ID NO: XXX of Attorney Docket No. 790, US Serial No. 09/540,217 
filed 03/31/2000, the entire disclosure of which, including sequence listing, is 
incorporated herein by reference. 

791_XXX = SEQ ID NO: XXX of Attorney Docket No. 791, US Serial No. 09/552,929 
filed 04/18/2000, the entire disclosure of which, including sequence listing, is 
incorporated herein by reference. 

792_XXX = SEQ ID NO: XXX of Attorney Docket No. 792, US Serial No. 09/577,408 
filed 05/18/2000, the entire disclosure of which, including sequence listing, is 
incorporated herein by reference. 
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Table 10 
1154 



SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/339,739 


1 


912 


1 


2 


913 


2 


3 


914 


3 


A 

4 


915 


4 


5 


916 


5 


6 


917 


6 


7 


918 


1 


8 


919 


8 


9 


920 


9 


10 


92 1 


10 


11 


922 


11 


12 


923 


12 


13 


924 


13 


-J .4 

14 


925 


14 


15 


926 


15 


16 


927 


16 


17 


928 


17 


18 


929 


18 


19 


930 


19 


20 


93 1 


20 


21 


932 


21 


22 


933 


22 


23 


934 


23 


24 


935 


24 


25 


936 


25 


26 


937 


26 


27 


938 


27 


28 


939 


28 


29 


940 


29 


30 


94 1 


30 


31 


942 


31 


32 


943 


32 


33 


944 


33 


34 


945 


34 


35 


946 


35 


36 


947 


36 


37 


948 


37 


38 


949 


38 


39 


950 


39 


40 


951 


40 


A -1 

41 


952 


41 


42 


953 


42 


43 


954 


43 


44 


955 


A A 

44 


45 




45 


46 


957 


46 


47 


958 


47 


48 


959 


48 


49 


960 


49 


50 


961 


50 


51 


962 


51 


52 


963 


52 
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Table 10 
1155 



SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/339,739 


53 


964 


53 


54 


965 


' 54 


55 


966 


55 


— 

56 


967 


56 


57 


968 


57 


58 


969 


58 


p* y-v 

59 


970 


d9 


60 


971 


60 


61 


972 


61 


62 


973 


62 


63 


974 


63 


64 


975 


y*~ >f 

64 


y** 

65 


976 


65 


66 


977 


66 


67 


978 


67 


68 


979 


y^ 

68 


69 


980 


69 


70 


981 


•-^ y*\ 

70 


71 


982 


71 


72 


y™\ /— V 

983 


72 


73 


984 


73 


74 


985 


74 


75 


/■^ #1 y"" 

986 


75 


76 


987 


76 


^^^^ 

77 


988 


11 


78 


989 


78 


79 


990 


79 


80 


991 


y-\ y\ 

80 


81 


y™v /"\ y% 

992 


81 


82 


JT\ /V /"X 

993 


82 


(— \ y^ 

83 


994 


83 


84 


995 


■O yl 

84 


85 


996 


85 


86 


997 


86 


87 


/-\ /-\ y-* 

998 


87 


88 


999 


88 


89 


1000 


89 


90 


1001 


90 


/-"V -1 

91 


■4 y"v j~\ j*** 

1002 


91 


92 


1003 


92 


93 


1004 


93 


94 


1005 


94 


95 


1006 


95 


y^ ^* 

96 


1007 


96 


97 


1 AAO 

1008 


97 


98 


1009 


98 


99 


1010 


99 


100 


1011 


100 


101 


1012 


101 


102 


1013 


102 


103 


1014 


103 


104 


1015 


104 
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Table 10 
1156 



SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 
Nucleotide 
■ Sequence 


Full-length 
Peptide 

*— ■» 

Sequence 


Priority 
Application USSN 
60/339,739 


105 


1016 


105 


106 


1017 


106 


107 


1018 


107 


108 


1019 


108 


109 


1020 


109 


1 i r\ 

no 


1021 


no 


in 


1022 


-I -i -f 

111 


112 


1023 


112 


1 1 ^ 
113 


1024 


1 1 o 
1 13 


1 14 


1025 


1 1 4 

1 14 


1 15 


1026 


115 


116 


1027 


116 


1 1 T 

1 17 


1028 


117 


1 1 o 

118 


1029 


1 1 o 

118 


119 


1030 


119 


120 


1031 


120 


121 


1032 


121 


122 


1033 


122 


123 


1034 


123 


124 


1035 


124 


125 


1036 


125 




SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 
Nucleotide 
Sequence 


Full-length 
Peptide 
Sequence 


Priority 
Application USSN 
60/339,453 


126 


1037 


1 


127 


1038 


2 


128 


1039 


3 


129 


1040 


4 


130 


1041 


5 


131 


1042 


7 


132 


1 043 


8 


133 


1044 


9 


134 


1045 


10 


135 


1046 


11 


136 


1047 


12 


137 


1048 


13 


138 


1049 


14 


139 


1050 


15 


140 


1051 


16 


141 


1052 


17 


142 


1 053 


18 


143 


1054 


19 


144 


1055 


20 


145 


1056 


21 


146 


1057 


22 


147 


1058 


23 


148 


1059 


24 


149 


1060 


25 


150 


1061 


26 


151 


1062 


27 
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Table 10 
1157 



SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/339,453 


152 


1063 


28 


153 


1064 


29 


ISA 


1065 


30 


155 


1066 


31 


156 


1067 


32 


157 


1068 


33 


158 


1069 


34 


159 


1070 


35 


160 


1071 


36 


161 


1072 


37 


162 


1073 


38 


163 


1074 


39 


164 


1075 


40 


165 


1076 


41 


166 


1077 


42 


167 


1078 


43 


168 


1079 


44 


169 


1080 


45 


170 


1081 


46 


171 


1082 


47 


172 


1083 


48 


173 


1084 


49 


174 


1085 


50 


175 


1086 


51 


176 


1087 


52 


177 


1088 


53 


178 


1089 


54 


179 


1090 


55 


180 


1091 


56 


181 


1092 


57 


182 


1093 


58 


183 


1094 


59 


184 


1095 


60 


185 


1096 


61 


186 


1097 


62 


187 


1098 


63 


188 


1099 


64 


189 


1100 


65 


190 


1101 


66 


191 


1102 


67 


192 


1103 


68 


193 


1104 


69 


194 


1105 


70 


195 


1106 


71 


196 


H07 


72 


197 


H08 


73 


198 


H09 


74 


199 


mo 


75 


200 


mi 


76 


201 


lll2 


77 


202 


lll3 


78 


203 


lll4 


79 
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Table 10 
1158 



SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/339,453 


204 


1115 


80 


205 


1116 


81 


206 


1117 


82 


207 


11 18 


83 


208 


1119 


f\ A 

84 


209 


1120 


85 


210 


1121 


86 


211 


1122 


87 


212 


1123 


88 


213 


1124 


89 


214 


1125 


90 


215 


1126 


91 


216 


1127 


92 


217 


1128 


93 


218 


1129 


94 


219 


1130 


95 


220 


1131 


96 


221 


1132 


97 


222 


1133 


98 


223 


1134 


99 


224 


1135 


100 


225 


1136 


101 


226 


1137 


102 


227 


1138 


103 


228 


1139 


104 


229 


1140 


105 


230 


1141 


106 


231 


1142 


107 


y-^ 

232 


1143 


108 


233 


1144 


109 


234 


1145 


110 


235 


1146 


111 


236 


1147 


112 


237 


1148 


113 


238 


1149 


114 


239 


1150 


115 


240 


H5l 


116 


241 


H52 


117 


242 


1153 


118 


243 


1154 


119 


244 


H55 


120 


245 


H56 


121 


246 


1157 


122 


247 


H58 


123 


248 


1159 


124 




SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/365,384 


249 


1160 


3 


250 


ll6l 


4 
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Table 10 
1159 



SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/365,384 


251 


1162 


5 


252 


H63 


6 


253 


H64 


8 


254 


1165 


1 t 
12 


255 


H66 


14 


256 


H67 


15 


257 


1168 


•1 t 
17 


258 


1169 


18 


259 


"1 1 *7A 

H70 


1 a 

19 


r\ 

260 


1171 


20 


261 


1172 


21 


262 


1173 


22 


263 


1174 


23 


264 


1175 


24 


265 


1176 


25 


266 


1177 


26 


267 


ll 78 


27 


268 


1179 


28 


269 


1 ISO 


O A 

29 


270 


1181 


1 A 

30 


271 


1182 


31 


272 


1183 


32 


273 


1184 


33 


274 


1185 


34 


275 


1186 


35 


276 


1187 


36 


211 


1188 


37 


278 


H89 


38 


279 


1190 


39 


280 


1191 


40 


281 


■f n ao 
1192 


41 


oo 

282 


1193 


42 


283 


1 1 a ^ 
1194 


43 


284 


i 1 a.c 

1195 


44 


285 


1 1 AzT 

1196 


45 


286 


1 1 AT 

1197 


46 


287 


■i 1 a o 

1198 


47 


rx f\ 

288 


1 "i aa 

1199 


48 


289 


1200 


49 


290 


1201 


C A 

50 


291 


■1 T AT 

1202 


51 


292 


"1 T AT 

1203 


52 


293 


1204 


53 


294 


1205 


54 


295 


1 on/; 


rr 
JJ 


296 


1207 


56 


297 


1208 


57 


298 


1209 


58 


299 


1210 


59 


300 


1211 


60 


301 


1212 


61 


302 


1213 


62 
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Table 10 
1160 



SEQ ID NO of 
Full-length 
Nucleotide 
Sequence 


SEQ ID NO of 
Full-length 
Peptide 
Sequence 


SEQ ID NO in 
Priority 
Application USSN 
60/365,384 


303 


1214 


63 


304 


1215 


64 


305 


1216 


65 


306 


1217 


66 


307 


1218 


67 


308 


1219 


68 


309 


1220 


69 


310 


1221 


70 


311 


1222 " 


71 


312 


1223 


72 


313 


1224 


73 


314 


1225 


74 


315 


1226 


75 


316 


1227 


76 


317 


1228 


11 


318 


1229 


78 


319 


1230 


79 


320 


1231 


80 


321 


1232 


81 


322 


1233 


82 


323 


1234 


83 


324 


1235 


C\ A 

84 


325 


1236 


85 


326 


1237 


86 


327 


1238 


87 


328 


1239 


88 


/""V /**X /"V 

329 


1240 


89 


330 


1241 


90 


331 


1242 


91 


332 


1243 


92 


333 


1244 


93 


334 


1245 


94 


335 


1246 


95 


336 


1247 


96 


337 


1248 


97 


338 


1249 


98 


339 


1250 


99 


340 


1251 


100 


341 


1252 


101 


342 


1253 


102 


343 


1254 


103 


344 


1255 


104 


345 


1256 


105 


346 


1257 


106 


347 


1258 


107 


348 


1259 


108 


349 


1260 


109 


350 


1261 


110 


351 


1262 


111 


352 


1263 


112 


353 


1264 


113 


354 


1265 


114 
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Table 10 
1161 



SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/365,384 


355 


1266 


115 


356 


1267 


116 


357 


1268 


117 


358 


1269 


118 


359 


1270 


119 


360 


1271 


120 


361 


1272 


121 


362 


1273 


122 


363 


111 A 


123 


364 


1275 


VIA 


365 


1276 


125 


366 


1277 


126 


367 


1278 


127 


368 


1279 


128 


369 


1280 


129 


370 


1281 


130 


371 


1282 


131 


372 


1283 


132 


373 


1284 


133 


374 


1285 


134 


375 


1286 


135 


376 


1287 


136 


377 


1288 


137 


378 


1289 


138 


379 


1290 


139 


380 


1291 


140 


381 


1292 


141 


382 


1293 


a a r\ 

142 


383 


1294 


143 


384 


1295 


■4 A A 

144 


385 


1296 


145 


386 


1297 


146 


387 


*t /s /"v *~i 

1298 


147 


388 


1299 


148 


389 


1300 


149 


390 


1301 


150 


391 


1302 


I5l 


392 


1303 


152 


393 


1304 


153 


394 


1305 


154 


395 


1306 


155 


396 


1307 


156 


397 


1308 


157 


398 


1309 


158 


399 


1 1 1 A 

1310 


159 


400 


1311 


160 


401 


1312 


161 


402 


1313 


162 


403 


1314 


163 


404 


1315 


164 


405 


1316 


165 


406 


1317 


166 
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Table 10 
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SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO m 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/365,384 


407 


1318 


167 


408 


1319 


169 


409 


1320 


170 


410 


1321 


171 


411 


1322 


1 72 


412 


1323 


173 


413 


1324 


174 


414 


1325 


175 


415 


1326 


176 


416 


1327 


177 


417 


1328 


178 


418 


1329 


179 




SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/365,091 


419 


1330 


2 


420 


1331 


4 


421 


1332 


5 


422 


1333 


0 


423 


1334 


8 


424 


1335 


9 


425 


1336 


10 


426 


1337 


ll 


427 


1338 


12 


428 


1339 


14 


429 


1340 


1 c 
1 J 


430 


1341 


16 


431 


1342 


17 


432 


1343 


18 


433 


1344 


19 


434 


1345 


20 


435 


1346 


21 


436 


1347 


22 


437 


1348 


24 


438 


1349 


25 


439 


1350 


26 


440 


1351 


27 


441 


1352 


28 


442 


1353 


29 


443 


1354 


32 


444 


1355 


33 


445 


1356 


34 


446 


1357 


35 


447 


1358 


36 


448 


1359 


37 


449 


1360 


38 


450 


1361 


39 


451 


1362 


40 


452 


1363 


41 


453 


1364 


42 
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Table 10 
1163 



SEQ ID NO of 
Full-length 
Nucleotide 
Sequence 


SEQ ID NO of 
Full-length 
Peptide 
Sequence 


SEQ ID NO in 
Priority 
Application USSN 
60/365,091 


454 


1365 


A O 

43 


455 


1366 


A A 

44 


456 


1367 


A C 

45 


457 


* A> a" /~\ 

1368 


46 


458 


1369 


A 

47 


459 


1370 


48 


460 


M « mm -» 

1371 


49 


461 


1372 


50 j 


462 


1373 


51 


463 


•4 ."H AT J 

1374 


^ ^H 

52 


464 


1375 


53 


465 


1376 


54 


466 


1377 


55 


467 


1378 


56 


468 


1379 


57 


469 


1380 


/H 

58 


470 


1381 


59 


471 


1382 


,A" /H 

60 


472 


1383 


A* - 1 

61 


473 


1384 


62 


474 


1385 


63 


475 


1386 


64 


476 


1387 


A Z 

65 


477 


1388 


66 


478 


1389 


67 


479 


1390 


A*" O 

68 


480 


1391 


69 


481 


1392 


70 


482 


1393 


71 


483 


-f AH a> J 

1394 


72 


484 


1395 


73 


485 


1396 


74 


486 


1397 


75 


487 


1398 


76 


488 


1399 


77 


489 


1400 


78 


490 


1401 


79 


491 


1402 


80 


A At 

492 


J A\ A^ 

1403 


81 


493 


1404 


82 


494 


1405 


A"\ Al 

83 


495 


1406 


^H J 

84 


496 


1407 


AH A" 

85 


497 


1408 


A*L A* 

86 


498 


1409 


5 / 


499 


1410 


88 


500 


1411 


89 


501 


1412 


90 


502 


1413 


91 


503 


1414 


92 


504 


1415 


93 


505 


1416 


94 
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Table 10 
1164 



SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/365,091 


506 


1417 


95 


507 


•I i 4 A 

1418 


96 


r~ /-v n 

508 


1419 


97 


509 


1420 


98 


510 


1421 


99 


511 


1422 


100 


512 


1423 


101 


513 


•4 A n A 

1424 


102 


514 


1425 


103 


515 


1426 


104 


516 


1427 


105 


517 


1428 


106 


518 


1429 


107 


519 


1430 


108 


520 


1431 


109 


521 


1432 


110 


522 


1433 


111 


523 


1434 


112 


524 


1435 


113 


525 


1436 


114 


526 


1437 


115 


527 


1438 


116 


528 


1439 


117 


529 


1440 


118 


530 


1441 


119 


531 


1442 


120 


532 


1443 


121 


533 


1444 


122 


534 


1445 


123 


535 


1446 


124 


536 


1447 


125 


537 


1448 


126 


538 


1449 


127 


539 


1450 


128 


540 


1451 


129 


A\ *t 

541 


1452 


130 


542 


1453 


131 


543 


1454 


132 


544 


1455 


133 


545 


1456 


135 


546 


•A A ^ 

1457 


136 


547 


1458 


137 


C A O 

548 


1459 


138 


549 


1460 


139 


550 


1461 


140 


551 


1462 


141 


552 


1463 


142 


553 


1464 


143 


554 


1465 


144 


555 


1466 


145 
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Table 10 
1165 



SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/372,615 


556 


1467 


2 


557 


1468 


3 


558 


1469 


4 


559 


1470 


5 


560 


1471 


6 


561 


1472 


7 


562 


1473 


8 


563 


1474 


9 


564 


1475 


10 


565 


1476 


ll 


566 


1477 


12 


567 


1478 


13 


568 


1479 


1 A 

14 


569 


1480 


15 


570 


1481 


16 


571 


1482 


17 


572 


1483 


18 


573 


1484 


19 


574 


1485 


20 


575 


I486 


21 


576 


1487 


22 


577 


1 ^ no 

1488 


23 


578 


1489 


24 


579 


1490 


25 


580 


1491 


26 


581 


1492 


27 


582 


1493 


28 


583 


1494 


29 


584 


1495 


30 


585 


1496 


31 


586 


1497 


32 


587 


1498 


33 


588 


1499 


34 


589 


1500 


35 


590 


1501 


36 


591 


1502 


37 


592 


1503 


38 


593 


1504 


39 


594 


1505 


40 


595 


1506 


41 


596 


1507 


42 


597 


1508 


43 


598 


1509 


A A 

44 


599 


1510 


45 


/"AA 

oUU 


Id 1 1 




601 


1512 


47 


602 


1513 


48 


603 


1514 


49 


604 


1515 


50 


605 


1516 


51 


606 


1517 


52 


607 


1518 


53 
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SEQ ID NO of 
Full-length 
Nucleotide 
Sequence 


SEQ ID NO of 
Full-length 
Peptide 
Sequence 


SEQ ID NO in 
Priority 
Application USSN 
60/372,615 


608 


15 19 


54 


609 


1520 


55 


610 


1521 


56 


611 


1522 


51 


612 


1523 


58 


613 


1524 


59 


614 


1525 


60 


615 


1526 


61 


616 


1527 


62 


617 


1528 


63 


618 


1529 


64 


619 


1530 


65 


620 


1531 


66 


621 


1532 


67 


622 


1533 


68 


623 


1534 


69 


624 


1535 


70 


625 


1536 


71 


626 


1537 


72 


627 


1538 


73 


628 


1539 


74 


629 


1540 


75 


630 


1541 


76 


631 


1542 


77 


632 


1543 


78 


633 


1544 


79 


634 


1545 


80 


635 


1546 


81 


636 


1547 


82 


637 


1548 


rx /—i 

83 


638 


1549 


84 


639 


1550 


85 


640 


1551 


86 


641 


1552 


87 


642 


1553 


88 


643 


1554 


89 


644 


1555 


90 


645 


1556 


91 


646 


1557 


92 


647 


1558 


93 


648 


1559 


94 


649 


1560 


95 


650 


1561 


96 


651 


1562 


97 


652 


1563 


98 


653 


1564 


99 


654 


1565 


100 


655 


1566 


101 


656 


1567 


102 


657 


1568 


103 


658 


1569 


104 


659 


1570 


105 
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SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/372,615 


f *f jr\ 

660 


1571 


106 


661 


1572 


107 


662 


1573 


108 


y y f\ 

663 


1574 


109 


664 


1575 


110 


665 


1570 


111 


666 


1577 


1 12 


667 


1578 


113 


y f o 

668 


1579 


1 -1 A 

114 


669 


1580 


115 


670 


1581 


116 


611 


1582 


117 


J^w 

672 


1583 


118 


673 


1584 


119 


674 


1585 


120 


675 


1586 


121 


676 


1587 


122 


677 


1588 


123 


678 


1589 


124 


679 


1590 


125 


680 


1591 


126 


681 


1592 


127 


682 


1593 


128 


683 


1594 


129 


684 


1595 


130 


685 


1596 


131 


686 


1597 


132 


V ™"T 

687 


1598 


133 


688 


1599 


134 


689 


1600 


135 


690 


1601 


136 


691 


1602 


137 


692 


1603 


138 


693 


1604 


139 


694 


1605 


140 


695 


1606 


141 


696 


1607 


142 


697 


1608 


143 


698 


1609 


H A A 

144 


V* f\ /"V 

699 


1610 


145 


700 


1611 


146 


701 


1612 


147 


702 


1613 


148 


703 


1614 


149 




1615 


150 


705 


1616 


151 


706 


1617 


152 


707 


1618 


153 


708 


1619 


154 


709 


1620 


155 


710 


1621 


157 


711 


1622 


158 
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SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/372,615 


mm M jm 

712 


1623 


159 


713 


1624 


160 


mm -4 M 

714 


1625 


161 


715 


1626 


162 


716 


1627 


163 


717 


1628 


164 


718 


1629 


165 


719 


1630 


166 


720 


1631 


167 


721 


1632 


168 


722 


1633 


169 


, 723 


1634 


170 


724 


1635 


171 


725 


1636 


172 


726 


1637 


173 


727 


1638 


174 


m—> /— » 

728 


1639 


175 


729 


1640 


176 


730 


1641 


177 


731 


1642 


178 


732 


1643 


179 


733 


1644 


180 


734 


1645 


181 


735 


1646 


182 


736 


1647 


183 


737 


1648 


184 


738 


1649 


185 


739 


1650 


186 


740 


1651 


187 


741 


1652 


188 


742 


1653 


189 


743 


1654 


190 


744 


1655 


191 


745 


1656 


192 


mm J 

746 


1657 


193 


747 


1658 


194 


mm A f-\ 

748 


1659 


195 


m-j A /"\ 

749 


1660 


196 


750 


1661 


197 


751 


1662 


198 


752 


1663 


199 


753 


1664 


200 


754 


1665 


201 


755 


1666 


202 


756 


1667 


203 


757 


1668 


204 


758 


1669 


205 


759 


1670 


206 


760 


1671 


207 


761 


1672 


208 


762 


1673 


209 


763 


1674 


210 
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SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/372,615 


764 


1675 


2ll 


765 


1676 


212 


766 


1677 


213 


767 


1678 


214 


768 


1679 


215 


769 


1680 


216 


770 


1681 


217 


77 1 


1682 


218 


772 


1683 


219 | 


773 


1684 


220 


774 


1685 


221 


775 


1686 


222 


776 


1687 


223 


^7 m~f 

777 


1688 


224 


778 


1689 


225 


^^^^ 

779 


1690 


226 


780 


1691 


227 


781 


1692 


228 


782 


1693 


s\ r\ 

229 


783 


1694 


230 


no ^ 

784 


1695 


23 1 


785 


1696 


233 


^7 y 

786 


1697 


Art i 

234 


787 


1698 


235 


788 


1699 


236 


789 


1700 


237 


790 


1701 


238 


791 


1702 


239 


/-j r\ r\ 

792 


1703 


240 


793 


1704 


24 1 


794 


1705 


242 


795 


1706 


243 


796 


1707 


244 


797 


1708 


a mm 

245 


798 


1709 


246 


799 


1710 


247 


800 


1711 


248 


801 


1712 


249 


802 


1713 


250 


803 


1714 


25 1 


804 


1715 


252 


805 


1716 


253 


806 


1717 


254 


807 


1718 


255 


808 


1719 


256 




SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/372,381 


809 


1720 


1 


810 


1721 


2 
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SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/372,381 


811 


1722 


3 


812 


1723 


4 


813 


1724 


5 


814 


1725 


6 


815 


1726 


7 


816 


1727 


8 


817 


1728 


9 


818 


1729 


10 


819 


1730 


ll 


820 


1731 


12 


821 


1732 


13 


y*X r*\ 

822 


1733 


i ,j 
14 


823 


1 TO A 

1734 


15 


824 


1735 


16 


825 


1736 


17 


826 


•j ***** ~t 

1737 


18 


827 


1738 


19 


828 


1739 


20 


829 


1740 


21 


830 


1741 


22 


831 


1742 


23 


832 


1743 


24 


833 


1744 


25 


834 


1745 


26 


835 


1746 


27 


836 


1747 


28 


837 


1748 


29 


838 


1749 


30 


839 


1750 


31 


840 


-M mm m» -% 

1751 


32 


841 


1752 


33 


842 


1753 


34 


843 


1754 


35 


844 


1755 


36 


845 


1756 


37 


846 


1757 


38 


847 


1758 


39 


848 


1759 


40 


849 


1760 


41 


850 


1761 


42 


851 


-t m-t J- m± 

1762 


43 


852 


1763 


A A 

44 


853 


1764 


45 


854 


1765 


46 


855 


1766 


47 


856 


1767 


48 


857 


1768 


49 


858 


1769 


50 


859 


1770 


51 


860 


1771 


52 


861 


1772 


53 


862 


1773 


54 
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SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 
Nucleotide 
Sequence 


Full-length 
Peptide 
Sequence 


Priority 
Application USSN 
60/372,381 


863 


1774 


55 


864 


1775 


56 


865 


1776 


57 


866 


1777 


58 


867 


1778 


59 


868 


1779 


60 


869 


1780 


61 


870 


1781 


62 


871 


1782 


63 


872 


1783 


64 


873 


1784 


65 


874 


1785 


66 


875 


1786 


67 


876 


1787 


68 


877 


1788 


69 


878 


1789 


70 


879 


1790 


71 


880 


1791 


72 


881 


1792 


73 


882 


1793 


74 


883 


1794 


75 


884 


1795 


76 


885 


1796 


77 


886 


1797 


78 


887 


1798 


79 


888 


1799 


80 


889 


1800 


81 


890 


1801 


82 


891 


1802 


83 


892 


1803 


84 


893 


1804 


85 


894 


1805 


86 


895 


1806 


87 


896 


1807 


88 


897 


1808 


89 


898 


1809 


90 


899 


1810 


91 


900 


1811 


92 


901 


1812 


93 


902 


1813 


94 


903 


1814 


95 


904 


1815 


96 


905 


1816 


97 


906 


1817 


98 


907 


1818 


99 


908 


1819 ' 


100 


909 


1820 


101 


910 


1821 


102 


911 


1822 


103 
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WHAT IS CLAIMED IS: 

1 . An isolated polynucleotide comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1-911. 

2. An isolated polynucleotide encoding a polypeptide with biological activity, wherein 
said polynucleotide hybridizes to the polynucleotide of claim 1 under stringent hybridization 
conditions. 

3. An isolated polynucleotide encoding a polypeptide with biological activity, wherein 
said polynucleotide has greater than about 99% sequence identity with the polynucleotide of 
claim 1. 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
complementary sequences. 

6. A vector comprising the polynucleotide of claim 1 . 

7. An expression vector comprising the polynucleotide of claim 1. 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1. 

9. A host cell genetically engineered to comprise the polynucleotide of claim 1 
operatively associated with a regulatory sequence that modulates expression of the 
polynucleotide in the host cell. 

10. An isolated polypeptide, wherein the polypeptide is selected from the group consisting 
of: 

(a) a polypeptide encoded by any one of the polynucleotides of claim 1; 
and 

(b) a polypeptide encoded by a polynucleotide hybridizing under 
stringent conditions with any one of SEQ ID NO: 1-911. 
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11. A composition comprising the polypeptide of claim 10 and a carrier. 

12. An antibody directed against the polypeptide of claim 10. 

13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a 
complex with the polynucleotide of claim 1 for a period sufficient to form the complex; and 

b) detecting the complex, so that if a complex is detected, the 
polynucleotide of claim 1 is detected. 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions with 
nucleic acid primers that anneal to the polynucleotide of claim 1 under such conditions; 

b) amplifying a product comprising at least a portion of the 
polynucleotide of claim 1 ; and 

c) detecting said product and thereby the polynucleotide of claim 1 in the 

sample. 

15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the 
method further comprises reverse transcribing an annealed RNA molecule into a cDNA 
polynucleotide. 

16. A method for detecting the polypeptide of claim 10 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a 
complex with the polypeptide under conditions and for a period sufficient to form the 
complex; and 

b) detecting formation of the complex, so that if a complex formation is 
detected, the polypeptide of claim 10 is detected. 

17. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising: 
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a) contacting the compound with the polypeptide of claim 10 under 
conditions sufficient to form a polypeptide/compound complex; and 

b) detecting the complex, so that if the polypeptide/compound complex 
is detected, a compound that binds to the polypeptide of claim 10 is identified. 

18. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising: 

a) contacting the compound with the polypeptide of claim 10, in a cell, 
under conditions sufficient to form a polypeptide/compound complex, wherein the complex 
drives expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence expression, 
so that if the polypeptide/compound complex is detected, a compound that binds to the 
polypeptide of claim 1 0 is identified. 

19. A method of producing the polypeptide of claim 10, comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected 
from the group consisting of any of the polynucleotides from SEQ ID NO: 1-911, under 
conditions sufficient to express the polypeptide in said cell; and 

b) isolating the polypeptide from the cell culture or cells of step (a). 

20. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of any one of the polypeptides SEQ ID NO: 912-1822. 

21. The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide 
array. 

22. A collection of polynucleotides, wherein the collection comprising of at least one of 
SEQ ID NO: 1-911. 

23. The collection of claim 22, wherein the collection is provided on a nucleic acid array. 

24. The collection of claim 23, wherein the array detects full-matches to any one of the 
polynucleotides in the collection. 
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25. The collection of claim 23, wherein the array detects mismatches to any one of the 
polynucleotides in the collection. 

26. The collection of claim 22, wherein the collection is provided in a computer-readable 
format. 



